Gene Information

Name : EC55989_3279 (EC55989_3279)
Accession : YP_002404249.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Virulence
Product : Serine protease pic precursor (ShMu)
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 3360967 - 3365085 bp
Length : 4119 bp
Strand : -
Note : Evidence 2a : Function of homologous gene experimentally demonstrated in an other organism; Product type e : enzyme

DNA sequence :
GTGAATAAAGTTTATTCTCTTAAATATTGCCCCGTCACCGGGGGGCTTATTGCTGTCTCTGAACTTGCCCGCAGGGTAAT
AAAAAAGACATGCCGAAGATTAACGCATATTCTTCTGGCTGGCATTCCAGCAATCTGTCTGTGTTACTCTCAGATATCCC
AGGCGGGTATTGTCCGTTCCGATATTGCCTATCAGATTTATCGTGATTTCGCCGAAAACAAAGGGCTTTTTGTACCTGGT
GCCAATGATATTCCGGTATATGATAAGGACGGAAAACTTGTGGGAAGACTGGGTAAAGCCCCAATGGCCGATTTCAGCAG
TGTGAGCTCAAATGGCGTTGCTACGCTTGTATCGCCTCAGTATATCGTCAGCGTAAAGCATAACGGAGGATATCGGAGTG
TGAGCTTTGGTAATGGGAAAAATACATATTCCCTTGTTGACCGTAATAACCACCCTTCTATTGACTTCCATGCTCCACGT
CTGAATAAACTGGTTACAGAAGTTATTCCCTCAGCGGTAACATCAGAAGGAACCAAAGCCAATGCTTATAAATACACTGA
ACGTTACACCGCTTTTTATCGGGTGGGTAGTGGTACGCAGTACACTAAGGACAAGGACGGAAATTTAGTTAAGGTTGCCG
GTGGATATGCTTTTAAAACAGGAGGAACCACAGGAGTTCCTCTGATATCTGATGCAACAATAGTCTCTAATCCCGGGCAA
ACTTATAATCCTGTAAACGGCCCTTTACCTGACTATGGAGCCCCTGGGGATAGTGGTTCTCCTTTGTTTGCTTATGATAA
ACAACAAAAAAAATGGGTTATTGTTGCTGTATTAAGAGCATATGCAGGTATTAATGGTGCTACGAACTGGTGGAATGTCA
TACCAACAGATTATCTGAACCAGGTTATGCAGGACGATTTCGATGCCCCCGTAGACTTTGTTTCCGGACTGGGCCCCCTG
AACTGGACATACGACAAAACATCAGGCACAGGTACCCTGAGCCAGGGCAGTAAAAACTGGACCATGCACGGGCAGAAAGA
CAATGACCTCAATGCCGGTAAAAATCTGGTATTCAGCGGGCAGAATGGTGCAATTATCCTGAAAGACAGTGTGACTCAGG
GTGCCGGTTATCTCGAATTTAAAGACAGTTACACCGTATCTGCTGAATCCGGAAAAACATGGACGGGTGCCGGCATTATT
ACTGACAAGGGGACGAATGTAACCTGGAAGGTCAACGGCGTTGCCGGTGACAACTTGCATAAGCTGGGGGAAGGAACCCT
GACCATAAACGGAACAGGTGTAAACCCGGGAGGACTGAAAACGGGAGACGGTATCGTTGTACTTAACCAGCAGGCAGACA
CTGCAGGTAATATCCAGGCCTTCAGTTCAGTGAACCTCGCCAGCGGACGTCCGACCGTGGTGCTCGGGGATGCCCGTCAG
GTCAATCCGGATAACATTTCATGGGGATACCGGGGAGGTAAGCTTGACCTTAATGGTAATGCCGTTACCTTCACCCGACT
GCAGGCTGCTGATTACGGGGCGGTGATTACAAATAATGCACAGCAAAAATCCCAGCTTTTACTGGATCTTAAGGCTCAGG
ATACAAATGTCAGTGAACCGACGATTGGAAATATATCCCCCTTTGGTGGTACCGGCACTCCAGGAAACCTGTACAGCATG
ATACTCAACAGCCAGACCCGCTTCTATATTCTGAAATCTGCCAGCTATGGTAACACTCTGTGGGGGAACAGCCTGAATGA
TCCGGCTCAGTGGGAGTTTGTTGGCATGGACAAAAACAAAGCAGTTCAGACAGTAAAAGATAGGATCCTGGCCGGGCGGG
CAAAACAACCCGTTATCTTTCATGGTCAGCTGACCGGGAATATGGATGTCGCCATTCCACAGGTGCCGGGGGGAAGAAAG
GTCATCTTTGATGGTAGCGTGAACCTGCCGGAAGGTACCCTGAGTCAGGACAGTGGCACCCTGATATTCCAGGGACATCC
GGTTATCCATGCCTCCATCAGTGGCAGTGCACCGGTCAGCCTGAACCAGAAAGACTGGGAAAACCGTCAGTTTACAATGA
AAACACTGTCGCTGAAAGACGCTGACTTCCATCTTTCACGTAACGCCTCGCTGAACAGTGACATTAAGTCGGATAACAGC
CATATCACACTGGGAAGTGACAGGGCATTTGTGGATAAAAATGACGGAACAGGAAATTATGTCATTCCGGAGGAAGGTAC
CTCTGTCCCGGACACCGTGAATGACAGGAGCCAGTATGAAGGGAATATTACGCTGAACCATAACTCAGCCCTGGATATCG
GCAGCAGGTTCACCGGGGGGATTGACGCTTATGACAGTGCCGTCAGCATCACCTCTCCGGACGTCCTGTTGACAGCCCCG
GGTGCTTTTGCCGGCAGTTCACTGACAGTGCATGATGGCGGTCATCTTACAGCACTGAACGGTCTTTTCAGCGACGGGCA
TATTCAGGCCGGTAAGAACGGCAAAATCACCCTGAGCGGTACACCGGTTAAAGATACGGCTAATCAGTATGCCCCTGCTG
TATATCTGACGGACGGATATGACCTGACCGGCGATAACGCAGCACTGGAAATTACCCGTGGAGCACATGCTTCCGGTGAT
ATTCATGCCTCTGCGGCATCAACAGTTACCATCGGGTCTGACACGCCGGCAGAACTGGCTTCTGCGGAAACGGCTGCATC
GGCGTTTGCCGGCAGTCTTCTTGAGGGCTATAACGCAGCATTCAATGGTGCCATAACCGGTGGCAGGGCTGATGTCAGTA
TGCATAATGCACTGTGGACTCTGGGTGGGGACTCTGCCATCCACAGTCTTACCGTCAGAAACAGCCGTATTAGTTCTGAA
GGAGACCGTACATTCCGTACCCTGACGGTGAATAAACTGGATGCAACAGGCAGTGATTTTGTTTTGCGTACGGACCTGAA
AAATGCCGATAAAATTAATGTGACTGAAAAAGCCACTGGTTCAGATAACAGCCTGAACGTCAGCTTTATGAATAATCCTG
CTCAGGGACAGGCCCTGAATATTCCTCTGGTCACGGCACCGGCGGGAACTTCAGCAGAGATGTTTAAGGCCGGCACCCGG
GTGACAGGTTTCAGTCGGGTGACCCCAACCCTGCATGTTGATACCAGTGGTGGCAATACGAAGTGGATACTGGATGGTTT
TAAAGCGGAGGCTGATAAAGCCGCTGCCGCGAAGGCTGACAGTTTCATGAATGCCGGGTATAAAAACTTCATGACGGAAG
TTAACAATCTGAACAAACGTATGGGTGACCTGCGTGACACAAACGGTGATGCCGGTGCCTGGGCGCGCATCATGAGTGGT
GCCGGTTCTGCAGACGGTGGTTACAGTGATAATTACACCCATGTTCAGGTCGGCTTTGACAAAAAACATGAACTGGACGG
TGTGGACCTGTTTACCGGTGTCACGATGACCTATACCGACAGCAGTGCAGACAGCCATGCATTCAGCGGAAAGACGAAAT
CGGTGGGGGGCGGTCTGTATGCTTCAGCATTGTTTGAGTCCGGTGCCTATATCGATTTGATTGGTAAATATATTCACCAT
GACAATGATTACACAGGTAACTTTGCTAGCCTGGGAACGAAACACTACAACACCCATTCCTGGTATGCCGGTGCTGAAAC
GGGTTACCGCTATCACCTGACAGAGGACACGTTCATTGAGCCGCAGGCTGAACTGGTTTACGGCGCCGTGTCCGGGAAAA
CATTCCGCTGGAAAGACGGTGATATGGACCTGAGCATGAAGAACAGGGACTTCAGTCCGCTGGTTGGAAGAACAGGGGTT
GAACTGGGCAAGACCTTCAGTGGTAAGGACTGGAGTGTGACGGCCCGTGCCGGAACCAGCTGGCAGTTTGACCTGCTGAA
TAATGGAGAGACCGTACTGCGTGATGCGTCCGGGGAGAAACGGATAAAAGGAGAGAAGGACAGCCGGATGCTGTTTAATG
TTGGTATGAATGCGCAGATAAAGGACAATATGCGCTTTGGTCTGGAGTTTGAGAAGTCAGCCTTTGGTAAATATAACGTG
GATAATGCGGTAAACGCGAATTTCCGGTATATGTTCTGA

Protein sequence :
MNKVYSLKYCPVTGGLIAVSELARRVIKKTCRRLTHILLAGIPAICLCYSQISQAGIVRSDIAYQIYRDFAENKGLFVPG
ANDIPVYDKDGKLVGRLGKAPMADFSSVSSNGVATLVSPQYIVSVKHNGGYRSVSFGNGKNTYSLVDRNNHPSIDFHAPR
LNKLVTEVIPSAVTSEGTKANAYKYTERYTAFYRVGSGTQYTKDKDGNLVKVAGGYAFKTGGTTGVPLISDATIVSNPGQ
TYNPVNGPLPDYGAPGDSGSPLFAYDKQQKKWVIVAVLRAYAGINGATNWWNVIPTDYLNQVMQDDFDAPVDFVSGLGPL
NWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIILKDSVTQGAGYLEFKDSYTVSAESGKTWTGAGII
TDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGIVVLNQQADTAGNIQAFSSVNLASGRPTVVLGDARQ
VNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSQLLLDLKAQDTNVSEPTIGNISPFGGTGTPGNLYSM
ILNSQTRFYILKSASYGNTLWGNSLNDPAQWEFVGMDKNKAVQTVKDRILAGRAKQPVIFHGQLTGNMDVAIPQVPGGRK
VIFDGSVNLPEGTLSQDSGTLIFQGHPVIHASISGSAPVSLNQKDWENRQFTMKTLSLKDADFHLSRNASLNSDIKSDNS
HITLGSDRAFVDKNDGTGNYVIPEEGTSVPDTVNDRSQYEGNITLNHNSALDIGSRFTGGIDAYDSAVSITSPDVLLTAP
GAFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNGKITLSGTPVKDTANQYAPAVYLTDGYDLTGDNAALEITRGAHASGD
IHASAASTVTIGSDTPAELASAETAASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLGGDSAIHSLTVRNSRISSE
GDRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMNNPAQGQALNIPLVTAPAGTSAEMFKAGTR
VTGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMGDLRDTNGDAGAWARIMSG
AGSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYASALFESGAYIDLIGKYIHH
DNDYTGNFASLGTKHYNTHSWYAGAETGYRYHLTEDTFIEPQAELVYGAVSGKTFRWKDGDMDLSMKNRDFSPLVGRTGV
ELGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKDNMRFGLEFEKSAFGKYNV
DNAVNANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 99
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 99
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 99
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 99
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 53
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 48
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 48
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EC55989_3279 YP_002404249.1 Serine protease pic precursor (ShMu) VFG0861 Protein 0.0 99
EC55989_3279 YP_002404249.1 Serine protease pic precursor (ShMu) VFG0635 Protein 0.0 99
EC55989_3279 YP_002404249.1 Serine protease pic precursor (ShMu) VFG0903 Protein 0.0 97
EC55989_3279 YP_002404249.1 Serine protease pic precursor (ShMu) VFG1689 Protein 0.0 47
EC55989_3279 YP_002404249.1 Serine protease pic precursor (ShMu) VFG0904 Protein 0.0 47