Gene Information

Name : ECABU_c03110 (ECABU_c03110)
Accession : YP_006104447.1
Strain : Escherichia coli ABU 83972
Genome accession: NC_017631
Putative virulence/resistance : Virulence
Product : IgA-specific serine endopeptidase precursor
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 319666 - 323781 bp
Length : 4116 bp
Strand : -
Note : -

DNA sequence :
GTGAATAAAGTTTATTCTCTTAAATATTGCCCTGTCACCGGGGGACTTATTGTTGTCTCTGAACTTGCCAGCAGGGTAAT
AAAAAAGACATGCCGAAGATTAACGCATATTCTTCTGGCTGGCATTCCAGCAGTCTATCTGTATTACCCTCAGATATCCC
AGGCGGGCATTGTCCGTTCCGATATTGCCTATCAGATTTATCGTGATTTCGCTGAAAACAAAGGGCTTTTTGTACCTGGT
GCCACAGATATTCCGGTATATGATAAGGACGGAAAACTTGTGGGGAGACTGGATAAAGCCCCAATGGCCGATTTCAGCAG
TGTGAGCTCAAATGGCGTTGCTACGCTTGTATCACCTCAGTATATCGTCAGCGTAAAGCATAACGGAGGATATCAGAGTG
TGAGCTTTGGTAATGGGAAAAATACATATTCCCTTGTTGACCGTAATAACCACTCTTCTGTTGACTTCCATGCTCCACGT
CTGAATAAACTGGTTACAGAAGTTATTCCCTCAGCGATAACATCAGAAGGAACCAAAGCCAATGCTTATAAAGACACTGA
ACGTTACACCGCTTTTTATCGGGTGGGTAGTGGTACGCAGTACACTAAGGACAAGGACGGAAATTTAGTTAAGGTTGCCG
GCGGATATGCTTTTAAAACAGGAGGAACCACAGGAGTTCCTCTGATATCTGATGCAACAATAGTCTCTAATCCCGGGCAA
ACCTATAATCCTGTAAACGGACCTTTACCTGACTATGGTGCCCCTGGGGACAGTGGTTCTCCTTTGTTTGCTTATGATGA
ACAACAAAAAAAATGGGTTATTGTTGCTGTATTAAGAGCATATGCAGGTATTAATGGTGCTACGAACTGGTGGAATGTCA
TACCAACAGATTATCTGAACCAGGTTATGCAGGACGATTTCGATGCCCCCGTGGACTTTGTTTCCGGACTGCCCCCCCTG
AACTGGACATACGACAAAACATCAGGCACAGGCACCCTGAGCCAGGGCAGTAAAAACTGGACCATGCACGGGCAGAAAGA
CAATGATCTCAATGCCGGTAAAAATCTGGTATTCAGCGGGCAGAACGGTGCAATTGTCCTGAAAGACAGTGTGACTCAGG
GGGCCGGTTATCTCGAATTTAAAGACAGTTACACCGTATCTGCTGAATCCGGAAAAACCTGGACGGGTGCCGGCATTATT
ACTGACAAGGGGACGAATGTGACCTGGAAGGTCAACGGGGTTGCCGGTGACAACCTGCATAAATTGGGGGAAGGAACCCT
GACCATAAACGGAACAGGTGTAAACCCGGGAGGACTGAAAACGGGAGACGGTACCGTTGTACTTAACCAGCAGGCAGACA
CTGCAGGTAATGTTCAGGCCTTCAGTTCCGTGAACCTCGCCAGCGGACGACCGACCGTGGTGCTCGGAGATGCCCGTCAG
GTCAATCCGGATAACATTTCATGGGGATACCGGGGAGGTAAGCTTGACCTTAATGGTAATGCCGTTACCTTCACCCGACT
GCAGGCTGCCGATTACGGGGCGGTGATTACAAATAATGCACAGCAAAAATCCCGGCTTTTACTGGATCTTAAGGCTCAGG
ATACAAATGTCAGTGTTCCGATTGGCAGTATATCCCCCTTTGGCGGTACCGGCACACCGGGAAACCTATACAGCATGATA
CTCAACGGCCAGACCCGCTTCTATATTCTGAAATCTGCCAGCTATGGTAACACCCTGTGGGGGAACAGCCTGAATGACCC
GGCTCAGTGGGAATTTGTTGGCACGGACAAAAACAAAGCAGTTCAGACAGTAAAAGACCGGATCCTGGCCGGGCGGGCAA
AACAACCCGTTATCTTTCATGGTCAGCTGACCGGGAATATGGATGTCACCATTCCACAGCTGCCGGGGGGAAGAAAGGTC
ATCCTTGATGGTAGCGTGAACCTGCCGGAAGGTACCCTGAGTGAGGACAGTGGCACCCTGATATTCCAGGGGCATCCGGT
TATCCACGCCTCCGTCAGTGGCAGTGCGCCGGTCAGCCTGAACCAGAAAGACTGGGAAAACCGCCAGTTCATAATGAAAA
CACTGTCGCTGAAAGATGCTGACTTCCATCTTTCACGTAACGCCTCGCTGAACAGTGACATTAAGTCGGATAACAGCCAT
ATCACACTGGGAAGTGACAGGGTATTTGTGGATAAAAATGACGGAACAGGAAATTATGTCATTCTGGAGGAAGGTACCTC
TGTCCCGGACACCGTGAATGACAGGAGCCAGTATGAAGGGAATATTACGCTGGACCATAACTCAACCCTGGATATCGGCA
GCCGGTTCACCGGAGGGATTGAAGCTTATGACAGTGCCGTCAGTATCACCTCTCCGGACGTCCTGTTAACAGCCCCGGGT
GCTTTTGCCGGCAGTTCACTGACAGTGCATGATGGCGGTCATCTTACAGCACTGAACGGTCTTTTCAGCGACGGGCATAT
TCAGGCCGGTAAGAACAGCAAAATCACCCTGAGCGGTACACCGGTTAAAGATACGGCTAACCAGTATGCCCCTGCTGTAT
ATCTGACGGACGGATATGACCTGACCGGCGATAACGCAACACTGGAAATTACCCGTGGAGCACATGCTTCCGGTGATATT
CATGCCTCTGCGGCATCAACAGTTACCATCGGGTCTGACACGCCGGCAGAACTGGCTTCTGCGGAAACGACTGCATCGGC
GTTTGCCGGCAGTCTTCTTGAGGGCTATAACGCAGCATTCAATGGTGCCATAACCGGTGGCAGGGCTGATGTCAGTATGC
ATAATGCACTGTGGACTCTGGGTGGGGACTCCGCCATCCACACTCTTACCGTCAGAAACAGCCGTATCAGTTCTGAAGGA
GACCGTACTTTCCGTACCCTGACGGTGAATAAACTGGATGCAACAGGCAGTGATTTTGTTCTGCGTACGGACCTGAAAAA
TGCCGATAAAATTAATGTGACTGAAAAAGCCACGGGTTCAGATAACAGCCTGAACGTCAGCTTTATGAAGGATCCGGCTC
AGGGACAGTCCCTGAATATTCCTCTGGTCACGGCACCGGCGGGAACTTCAGCAGAGATGTTTAAGGCCGGCACCCGGATG
ATAGGTTTCAGTCGGGTGACTCCAACCCTGCATGTTGACACCAGTGGTGGCAATACGAAGTGGATACTGGACGGTTTTAA
AGCGGAGGCTGATAAAGCCGCTGCCGCGAAGGCTGACAGTTTCATGAATGCCGGGTATAAAAACTTCATGACGGAAGTCA
ACAATCTGAACAAACGTATGGGTGACCTGCGTGACACAAACGGTGATGCCGGAGCCTGGGCGCGCATCATGAGTGGTGCC
GGTTCTGCAGACGGTGGTTACAGTGATAATTACACCCATGTTCAGGTCGGCTTTGACAAAAAACATGAACTGGACGGAGT
GGACCTGTTTACCGGTGTCACGATGACCTATACCGACAGCAGTGCAGACAGCCATGCGTTCAGCGGCAAGACGAAATCGG
TGGGGGGCGGTCTGTATGCTTCAGCATTGTTTGAGTCCGGTGCCTATATCGATTTGATTGGTAAATATATTCACCATGAC
AATGATTACACAGGTAACTTTGCCGGTCTGGGAACGAAACACTACAACACCCATTCCTGGTATGCCGGTGCTGAAACGGG
TTACCGCTATCACCTGACAGAGGAAACGTTCATTGAGCCGCAGGCTGAACTGGTTTACGGCGCCGTGTCCGGGAAAACAT
TCCGCTGGAAAGACGGTGATATGGACCTGAGCATGAAGAACAGGGACTTCAGTCCGCTGATTGGAAGAACAGGGATTGAA
CTGGGCAAGACCTTCAGTGGTAAGGACTGGAGTGTGACGGCCCGTGCCGGGACCAGCTGGCAGTTTGACCTACTGAATAA
TGGTGAGACGGTACTGCGTGATGCGTCCGGGGAGAAACGGATAAAAGGCGAGAAAGACAGCCGGATGCTGTTTAATGTTG
GTATGAATGCGCAGATAAAGGACAATATGCGCTTTGGTCTGGAGTTTGAGAAGTCAGCCTTTGGTAAATATAACGTGGAT
AATGCGGTAAACGCAAATTTCCGGTATATGTTCTGA

Protein sequence :
MNKVYSLKYCPVTGGLIVVSELASRVIKKTCRRLTHILLAGIPAVYLYYPQISQAGIVRSDIAYQIYRDFAENKGLFVPG
ATDIPVYDKDGKLVGRLDKAPMADFSSVSSNGVATLVSPQYIVSVKHNGGYQSVSFGNGKNTYSLVDRNNHSSVDFHAPR
LNKLVTEVIPSAITSEGTKANAYKDTERYTAFYRVGSGTQYTKDKDGNLVKVAGGYAFKTGGTTGVPLISDATIVSNPGQ
TYNPVNGPLPDYGAPGDSGSPLFAYDEQQKKWVIVAVLRAYAGINGATNWWNVIPTDYLNQVMQDDFDAPVDFVSGLPPL
NWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIVLKDSVTQGAGYLEFKDSYTVSAESGKTWTGAGII
TDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGTVVLNQQADTAGNVQAFSSVNLASGRPTVVLGDARQ
VNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSRLLLDLKAQDTNVSVPIGSISPFGGTGTPGNLYSMI
LNGQTRFYILKSASYGNTLWGNSLNDPAQWEFVGTDKNKAVQTVKDRILAGRAKQPVIFHGQLTGNMDVTIPQLPGGRKV
ILDGSVNLPEGTLSEDSGTLIFQGHPVIHASVSGSAPVSLNQKDWENRQFIMKTLSLKDADFHLSRNASLNSDIKSDNSH
ITLGSDRVFVDKNDGTGNYVILEEGTSVPDTVNDRSQYEGNITLDHNSTLDIGSRFTGGIEAYDSAVSITSPDVLLTAPG
AFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNSKITLSGTPVKDTANQYAPAVYLTDGYDLTGDNATLEITRGAHASGDI
HASAASTVTIGSDTPAELASAETTASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLGGDSAIHTLTVRNSRISSEG
DRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMKDPAQGQSLNIPLVTAPAGTSAEMFKAGTRM
IGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMGDLRDTNGDAGAWARIMSGA
GSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYASALFESGAYIDLIGKYIHHD
NDYTGNFAGLGTKHYNTHSWYAGAETGYRYHLTEETFIEPQAELVYGAVSGKTFRWKDGDMDLSMKNRDFSPLIGRTGIE
LGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKDNMRFGLEFEKSAFGKYNVD
NAVNANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 97
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 97
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 97
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 97
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 53
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 48
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 48
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECABU_c03110 YP_006104447.1 IgA-specific serine endopeptidase precursor VFG0903 Protein 0.0 100
ECABU_c03110 YP_006104447.1 IgA-specific serine endopeptidase precursor VFG0635 Protein 0.0 97
ECABU_c03110 YP_006104447.1 IgA-specific serine endopeptidase precursor VFG0861 Protein 0.0 97
ECABU_c03110 YP_006104447.1 IgA-specific serine endopeptidase precursor VFG0904 Protein 0.0 48
ECABU_c03110 YP_006104447.1 IgA-specific serine endopeptidase precursor VFG1689 Protein 0.0 48