Gene Information

Name : ECO103_4914 (ECO103_4914)
Accession : YP_003224732.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Virulence
Product : Efa1/LifA-like protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 5095047 - 5104718 bp
Length : 9672 bp
Strand : -
Note : Integrative element ECO103_IE05

DNA sequence :
ATGAGACTGCCAGAGAAAGTTCTTTTTCCTCCTGTCACTAGTGGCCTGTCAGGGCAGGAAAAACAAAAAAAACCGAAGAG
CATTACCGGATTTCAGGAAAATTATCAACGCAATATCAGGCCAATCAAAACAGCATCAGAAGCCCGACTACGCTTCTTTG
ATAAAATGGTTTCGAAAGAAAACTCTCTGGAAGATGTTGTTTCTTTAGGTGAAATGATTCAGAAGGAAATTTATGGGCAT
AAACAAAGAACATTTTCACCAGTTCATCATACAGGTAACTGGAAATCATCATTGTTACACAACGCGCTCCTTGGTCTGGC
AAATGTTTATAATGGCTTACGGGAAACAGAATACCCTAACACTTTCAACAGAGATGGTATAAAAAGTACTAACTCTTTTA
GAGATAACTCATTGACAAAAACAAGAACTCCCAGAGATAATTTTGAGGAAGGAATAAAACATCCTGAACATGCAACAATA
CCATATGACAACGACAATGAAAGTAATAAATTGCTAAAAGCAGGAAAGATAGCTGGTAACAATAACGAGCTGTTGATGGA
AATAAAAAAGGAATCCCAAAGCGACCATCAAATCCCCCTGTCAGATAAGTTCCTGAAAAGGAAAAAACGATCTCCTGTAG
CTGAAGATAAAGTTCAAAACTCGTTAACACCAGAAAATTTTGTTCAGAAAATTTCACTTAGTGATGAGCTTAAAACAAAA
TATGCAAATGAAATTATAGAGATAAAAAGAATAATGGGAGAATACAATCTTTTACCGGATAAAAACAGTCGTAATGGTCT
AAAACTTCTACAGAAGCAAGCTGATTTACTAAAAATAATCATGGAGGATACCTCTGTTACAGAAAATACCTTTAAAAACA
TAGAGATGGCTATAGCAGATATAAAAAGGGAGTACTATTCGCATACAGTTGATATTGAGAAAAATATTCATGCCATATGG
GTTGCGGGTTCCCCACCTGAAAGCATCTCGGACTATATTAAGACTTTTCTCAAAACCTACAAAGAGTTTACTTACTATCT
TTGGGTTGATGAAAAAGCATTTGGAGCCGCGAAATTTACCAGTGTCTTAAAACAAATTGCATTTGATTTAGCATGCAGAA
CTATACAGCAGAATACTCCACAAAAAAATATTGATTTTATTAATCTATATAATGAAATAAGGAAGAAATACAACAACAAC
CCATCGGGACAACAAGAGTACTTAAACAAACTCAGAGAGCTTTATGCTACTTATCAAAAAATATCCACCCCTCTGAAACA
CATGTTTAATTCATTTTTTTTAGAAAACATGATTAAACTTCAGGATAATTTCTTCAACTATTGCATTGTCAAAGGTGTTA
CAGAGATTAATGATGAGTTACGAATAAACTACCTTAAAAATGTAATAAAACTGTCAGACGATGACATTGGTAATTATCAG
AAAACAATTAACGACAATAAAGATAGAGTAAAAAAACTAATTCTTGATTTACAGAAACAATTTGGTGAAAACCGCATTTC
AATTAAAGATGTTAACTCTTTAACCTCTCTTTCAAAATCAGAAAATAATCACAATTATCAAACTGAAATGTTGCTTCGAT
GGAACTATCCTGCCGCCTCAGACCTGCTCAGGATGTATATCCTTAAAGAGCATGGTGGTATTTATACAGACACAGATATG
ATGCCTGCATACTCTAAACAAGTAATTTTTAAAATTATGATGCAGACAAACGGAGATAATCGTTTCTTGGAAGATTTGAA
ACTACGTCGTGCAATATCTGATGGTGTATTAAGATATGTTAATAACCAAAATATTGATGAAGTTAACTATAATGAAATCA
GTGATGCAGATAAAAACATTATCAAGAAGATATTAACAGAAATATCTAAAATGCCAGAAGATAGTATTTTTACTAAGATC
AATACAAGGATTCCTCGAGACACAATGCCCATCCTTCGTCGTTATCACCTATGGCCTGATGGATGGAATATTCGTGGGCT
CAATGGATTCATGCTATCACATAAAGGTAGTGAAGTGATTGATGCTGTTATCGCAGGCCAGAATCAGGCTTACAGGGAAC
TAAGAAGAATAAGAGATAATATTCATAGTGAAATATACTTCAAACAAACTGATGAATTGTCCTCACTTCCAGATACAGAC
AAAATTGGAGGGATTCTGGTAAAAAAATACCTTTCAGGAAGTCTTTTTTCAAAATTCAGACAAGATACTATTATCCCGGA
GGCATTGAGCACCCTCCAGATATCAGGTCCTGACTTGATTCAAAGAAAGATGTTGCAATTTTTCAGGAGTAGAGGGGTGT
TAGGTGAAGAGTTCATTAATGAAAGAAAACTGAGTGATAAAGCTTATATTGGTGTCTACAAAACAACTGGCACAGGGAAA
TATGACTGGTTAACCCCTGAATCGATCGGCGTTAATGATGTCACGCCCGCAGATGAAAGTACCTGGTGTATAGGAAAAGG
CCGGTGTGTTGATGACTTCCTGTTCAAAGATGTTTCAACACTAAAAACAGAAAATCTTCCAGAATTATTCTTAACAAAAA
TAGATACTGATACGTTTTTCTCTCAGTGGTCAACCAAAACCAAGAAAGATCTGCAAAAAAAAATACAAGACCTTACTGTA
CGTTATAATGAGCTAATTGACTCGTCGACTATCGACTTTAAAAATCTATATGAAATAGATCAAATGCTCCATATGATTAT
GCTAGAGATGAATGATGATATAGCCAAAAGGTCATTGTTTTCATTGCAAGTTCAAATAGCCGAAAAAATTCGGAGGATGA
CCATTCCTGTAGACAATATAATTAACATCTATCCTGATCTACATAAAAAAAATGACAATGATCTGAGTATGTCCATAAAA
GGCTTTCTTGCGAGTAATCCACATACAAAAATAAATATTCTTTATAGCAATAAGACTGAGCATAATATTTTTATAAAGGA
TTTATTCTCCTTCGCAGTTATGGAAAATGAGTTAAGAGACATTATCAATAACATGAGCAAAGATAAGACTCCTGAAAACT
GGGAAGGGAGGGTAATGTTACAAAGATATCTCGAATTAAAAATGAAAGATCATCTTAGTTTGCAATCTTCTCAGGAAGCA
AATGAGTTTCTTGAAATATCTACTTTTATTTATGAGAATGATTTCTTGAGAGAAAAGATTGAAGCAGTAAAAAACAAAAT
GAATTCTCACGAACTTTATTTTGAAAAAATAAAAAAAGAACAAAACACATGGCAGGATCTGTCCACAAAAGAACAAAAAT
TACAGCTTATTAAAGCATTGAAAGAAATTTCAGGAAATACAGAGAAGGACTCTCATTACGATAGACTTCTTGATGCTTTT
TTTAAAAAACATAATGAAAATATTCATAATAAAATACAAAGAATAAAAGACGAGTTCAAGGAATACTCCCGTGTAGCCAT
TCATAATATAGATAAAGTTATATTTAAAGGGCAAACACTGGATCGTCTTTATCATGAAGGATATGTATTTTCTGATATCA
ATACCTTGTCTCGTTATACACTACACGGACTAGGAATAACTGGTGTACATACTGAAGAAAACCTGCTACCAGCTCCTTCA
TCGTCCTTAATTAATATATTGAAAGAACATTATAATGAAGATGAAATTAGTGCGAAATTACCACTAGCATATGATTACAT
TTTAAATAAAAAAGAATCAAGCTCTATTCCTGTTGAAATTTTGAACAAACTTTCAGAGTTACCACCACATGAACTACTCA
CACCTGTTCTTGGCCAGAGTGTTAATCCTCTGGGCATGGGCTACTCATCCGATAATGGAAAAATCACAGAGCAAGTAATA
GTCAGTGGAGCTGATGGATTTGATAATCCCATATCTGGACTTATATATACCTATCTTGAAGATCTATATAACATCCATGT
AAGGATGCGAGAAGGTACACTAAATTCACAGAATCTTCGTCAGCTTCTGGAAAACTCTGTTTCTTCATGCTTTTTGACTG
AGCAAAGTATTAATAAATTACTTAGTGAGGCAGAAAAAAGGCCTTATCAGTCTTTAACAGAAATACATCAGCATCTCACA
GGATTACCAACTATTGCCGATGCAACCCTTTCATTACTTTCTGTTGGATTACCTGGTACGGGAAAACTATTGCGCAGGGA
GCAGGACTATGGGCGTCCACCAGTTACAGCAATTCAGGATTCCACATTTGTACTCCCTTATAATTTCAAAGGTATTGGTT
TTAACGATAACATTATATCCTCTGCACCTGTAGCCTCCTCATTACATTTTATCGCTGAACATGCGAAATATACTTTATTG
TCATGGCCTGAGTTTTATCGTCATCATGCACAGCGATGGTTCGAAATGGCTAAAGGATATGGAAGCCAGAATATTGATTT
TCACCCTCAGTCTCTATTGGTAACCCAAGAAGGACGCTGTATGGGATTAGCCTTACTTTATTTACAGACTGAAGATACTG
CTCATTATAGCATTCTCCAGGAAAACCTAATGACTGTGAGTGCACTTCATCAGACCAGTAATCGCGATAAGTTGCCACTG
TCCAAAGATGATAATTCCTTAATGACAAGAACTTATAGTCTGATTGAAATGCTACAGTATCAGGGAAACAAATATATTAC
CAACGAATCGCTACTACATAAGACCGCATGGAACCAAGAAAGAATAACTTTATTATTCAATGAAAAAGGAGTTAAGCGAG
CCCTGATAAGCACGCCTAATCATACTCTGGTTCTGCAACAACTGGAGGATATTTACCGGCTCACTGATCCAAATTTTGGG
CATGCAGATTTCCTTTCACCTATAGATGCTCTGAAGTTTATTGAGGCTATGATACAATTAACTCCAACACTTCAGGAATA
TTATGGCCTATTAAACAAAGACATTAATAAACATATACAAGTACATTATGCCGAATCAGATATGGTCTGGAATAAGCTTC
TGCCAGAAAATGATGCTGGACTGAGCACCAGAATTCAGCACACCACCACCGACCGTCTGGCGAATCTGGCTGAACCAGTC
GCTGTTGCAGGTATCTCCCTGCCAGTAAAAACACTTTATGATATCGGAGCCACCCTTGACGGTCGGCGCATCACCTCTCC
TCCAACATCGGAGCAAATCCCTTCTCTGCGTCTCAACGGTGATGTTCTGAATGATTATCTGTCCCGCACAGTTCTGACTC
CAGAACAGGCTGATAACATAAGAAAAATACTGCACACTCAGGGAATACGCAGCGGTACCCGTCCCATAGATCCGGAGATG
ATTCGTGGGACGCAGGATGACCTAGTTTCGTCACAGACTCGTCTGCAAAGGCAAGCAACACGGGTTAAACAGCAACTCGC
CGGTGTACTTGATACTCTGCAACAGCACTTCCAGAACATTCCACGTTCATCCGGTCGTCATCTTTCTGTAGAGAATATTG
AGCTGGCTGATATCGGAAGTGGGCGTTTCAACCTTCAAATTCGAGATGGAGAAACATTGCATACCACTTCTGTGGAAGTA
CCGGAAGTTGTGTCCCGTTTTCAGAAACTTTCTACCATGCTTTCAGCCCTGCCTGCCAGTGGAATCATGGATTTCGACCT
CGGCATGAGTGTGGTCGGAGTCGTCCAGTATGCCCGCCTGCTACAGCAGGGGCACGAAGACAGTACCCTAGCCAAGATAA
ATCTAGCTATGGATATCAAGCAACTTTCCGAAGCAACTCTCGGCAGCATGATTCAGATTGCCGGGAATAAGTTTCTCAAT
ACAGAAGGAATCCAGGGGTTCAGACTGGAAAGCGCCGTCGCTGAAGGTATGCGCTCAGTAGCAACCCGTACCGGAGGCAC
AATGGGGAAAGCCCTTTCTGCCAGTGCCCGTGTTCTTGAACTGCCTGTACTGGAAACAGTTCTGGGGACATGGAACCTGT
ACAACAGCGTCATTCAGCTCCAGCAAGCTACATCTTATTCTGAGACAATGGCTGCCCGGGTACAGATTGCGTTTGATTCC
ATTTCTCTGGGATTAACTGCCGCTTCGGTAGCTTTCCCGCCACTAATTATTGCCACTGGCCCCATTGCGGCTATTGGTAT
GGGAGCTTCCAGTATTGCACGTAATGTGGCACGGAAAGAAGAACGGCATACACAATGGCTGGAATATAAAAAATTCCTGA
CTGATGGCAGTAAACACATTGTTGTGGCCTCTCCGGAAAGAGGTCTGCTGGATTTCTCCGGAAACAAAGTTTTTGGAAAA
ATGGTGCTGGATCTGCGTCAGTCTCCTCCTCTCTTGCATGGAGAAAGCTCTTTTAACGCTGACCGCAAAATCGGTCATCG
TCCGGATCTGGGGGACTGGCAAATTCGTGAGAAGGTGGGGTATGCCAACAGTATCAGTCCCTACTCTTCTCTGGCGCACG
GTTACGCCAACAGTAAATGGCCACGAACAATACCGAAAATTCCCTCGGGAGAATATGACACCATAATTCTGGGCTACGGT
CACCAGTATCAGGCCAATACGGAAATAGAATATCTGTCTAACTGGATTGTATGGCGGGAAGCCGTACCAGACAGTACTTC
CCGCCACAAACGTCCTCCTCTGGAGGTTCTTAATAGTCAGTGTACTGTGATAGCTGGAGAGCGTAAAACCACAGTACTTC
CCCTGAGAGTGCTCAGCGATCTGACACCGGAATGCACAGAACAGGCTATATCGTTAAAAGATTATAAATTCATACTGAGA
GGGGGAAGCGGTGGGCTGGCTGTTCAGGTCGGTGGCGCGGGATATTATGATATTGATGCAAATCCTGTGGCAAAAGAAAA
TACGCTCTCTTTTCGCGGGCTACCGGAAGAGTTTCCGCTCACCTTTGATTTATCAAAACAAACACAGTCGGTCATGCTGA
AAACACCAGACGATGAGGTGCCGGTAATGACCATTACCCAGAAGGGAATAAACACCCTGGTAGGTACAGCCGCCGGTAAA
GACCGACTAATCGGTAACGATAAGGACAATACCTTCCATACAAGCTCTGGCGGCGGTACAGTCATCTCCGGAGGCGGGAA
TAACCGCTATATTATCCCCCGGGATTTAAAAACGCCGTTGACACTGACGCTGTCCAGTAACTCAGTCTCTCACGAAATCT
TTCTGCCAGAAACAACCCTAGCTGAATTAAAACCTGTCGCCTTTGAGCTGAGTTTGATTTACTGGGCCGGGAACAACATA
AATGTTCAACCAGAGGATGAAGCAAAACTGAACCACTTTGCCGGAAACTTCAGGGTGCATACCCGTGATGGCATGACTCT
GGAGGCGGTTTCCCGGGAAAATGGTATTCAACTGGCGATTTCATTATGTGATGTTCAACGCTGGCAGGCTGTTTATCCGG
AAGAAAATAACAGACCGGATGCCATACTGGACAGGCTGCATGATATGGGCTGGAGTCTGACACCAGAAGTCCGGTTCCAA
GGAGGAGAAACACAAGTCAGCTATGATCCCCTGACTCGTCAGCTCGTTTACCAGCTTCAGGCGCGTTACTCTGAATTCCA
GTTGGCCGGTAGTCGCCACCATACCACGGCTGTAACCGGAACTCCGGGAAGCCGATACATTATCATGAAGCCAGTTACAA
CACAGATATTACCGACACAAATCATACTGGCTGGTGATAATGACCATCCGGAAACGATTGATTTACTGGAAGCTAGTCCT
GTTCTGGTTGAAGGGAAAAAAGACAAAAACAGCGTGATATTAACGATTGCTACGATTCAGTATTCCCTTCAACTGACAAT
ATCCGGGATCGAAGAATCGCTGCCCGAGACAACCCGTGTGGCAATTCAGCCTCAGGATACCCGTTTACTGGGTGACGTAC
TCCGGATCTTACCAGATAATGGTAACTGGGTGGGGATTTTCCGGAGTGGTCATACACCAACGGTAAACCGGCTGGAAAAT
TTGATGGCACTGAATCAGGTAATGACGTTCCTGCCCCGGGTATCCGGAAGTGCAGAGCAGGTATTATGCCTCGAAAACCT
AGGTGGCGTAAGGAAAAAAGTGGAGGGGGAGTTACTGTCAGGGAAGCTGAAAGGTGCGTGGAAAGCCGAAGGTGAACCTA
CTGTTCCGGTAAATATCTCAGATCTAAGTATCCCACCCTATTCACGTCTGTATCTGATTTTTGAAGGGAAAAATAATGTG
TTGCTACGCAGTAAAGTACATGCAGCTCCGTTGAAAATAACATCCGCAGGAGAGATGCAGTTATCTGAAAGGCAGTGGCA
ACAGCAGGAACATATTATTGTCAAGCCCGACAACGAAGCCCCCTCATTAATACTCAGTGAATTTCGTCGTTTCACTATTT
CATCGGATAAAACATTTTCTTTAAAACTGATGTGCCATCAGGGTATGGTCCGCATCGACCGCAGATCATTATCGGTCAGA
TTGTTCTATCTGCGTGAACAGCCAGGTATCGGCAGTTTACGTCTGACGTTCAGAGATTTTTTCACAGAAGTGATGGATAC
AACGGACAGGGAAATTCTGGAGAAAGAGCTGAGACCAATTCTGATAGGAGATACACACCGCTTTATCAACGCTGCATACA
AAAATCATCTGAATATCCAGTTAGGAGATGGCGTTCTGAATCTGGCAGACATTGTTGCGGAATACGCCCGTATTCAAAAG
GAAGAAACATCAAAAATATTGTATCAATATCAAGGTGCCATGAAAAAAAAAACAGATGGACCATCTGTGGTAGAAGATGC
CATTATGACCACTACTGTCACAACAGATTCAGGTGAACTATTCCCTACCTTCCACCCGTGGTATACAGATGATTTATCAG
GGCGTTATAAGAGCGTACCTATGGCAAGAAAAGCAGATACTTTGTATCACCTGACACCAAAAGGTGATCTACAGATAATA
TATCAGGTAGCTACAAAAATGGTGAATCAGGCGATGATTGTATCCCTGCCAAACTACCGACACGAGTGGGAAAAATATAA
TTTAAGCATCTTATCCGAAATCCCTCAGAACAATAATACTGTTGTACATTCAATCCTCAGGGTTAATGGCCCCACAATGC
AGGTGCGCACAATTGACTACAGAGGAACGGATGAAAACAATCCCATAGTATCTTTTTCAGATACAACCTTCATCAATGGT
GAACAGATGTTGAGTTATGACTCGCATTCATCAGGGCGAGTCTATTCCAGAGAAGAATATATGATGTGGGAATTGCAGCA
ACGGGTATCAGAAGCTTCCAGTGCCCGGACACAGGATTACTGGCTGATGGATGCAGCGGTAAGAAACGGAGAATGGAAGA
TCACACCAGAATTATTACGTCACACACCGGGATATATCCGGAGTACGGTATCGAAATGGTCCAGAGGATGGCTGAAAACC
GGCACAATACTCCAGACTCCAGAAGACAGAAATACGGATGTATACCTGACTACCATACAGAACAATGTATTTAGTCGTCA
GGGGGGCGGCTACCAAGTGTATTATCGGATTGATGGAATGGCTGGTGCGGATATAGCGGATAATGCACCAGGGGAAACCC
GCTGCACCCTCAGGCCCGGAACATGTTTTGAAGTGACAAGTGTGGATGAAAGGCATTATGAGTGGAATATCATTTATGTC
ACGCTGAAAACCTGTGGCTGGAGCCGAAATGGCCAAAGCAAAACGCCGAATGGTGACAACCTTTTTAACTAA

Protein sequence :
MRLPEKVLFPPVTSGLSGQEKQKKPKSITGFQENYQRNIRPIKTASEARLRFFDKMVSKENSLEDVVSLGEMIQKEIYGH
KQRTFSPVHHTGNWKSSLLHNALLGLANVYNGLRETEYPNTFNRDGIKSTNSFRDNSLTKTRTPRDNFEEGIKHPEHATI
PYDNDNESNKLLKAGKIAGNNNELLMEIKKESQSDHQIPLSDKFLKRKKRSPVAEDKVQNSLTPENFVQKISLSDELKTK
YANEIIEIKRIMGEYNLLPDKNSRNGLKLLQKQADLLKIIMEDTSVTENTFKNIEMAIADIKREYYSHTVDIEKNIHAIW
VAGSPPESISDYIKTFLKTYKEFTYYLWVDEKAFGAAKFTSVLKQIAFDLACRTIQQNTPQKNIDFINLYNEIRKKYNNN
PSGQQEYLNKLRELYATYQKISTPLKHMFNSFFLENMIKLQDNFFNYCIVKGVTEINDELRINYLKNVIKLSDDDIGNYQ
KTINDNKDRVKKLILDLQKQFGENRISIKDVNSLTSLSKSENNHNYQTEMLLRWNYPAASDLLRMYILKEHGGIYTDTDM
MPAYSKQVIFKIMMQTNGDNRFLEDLKLRRAISDGVLRYVNNQNIDEVNYNEISDADKNIIKKILTEISKMPEDSIFTKI
NTRIPRDTMPILRRYHLWPDGWNIRGLNGFMLSHKGSEVIDAVIAGQNQAYRELRRIRDNIHSEIYFKQTDELSSLPDTD
KIGGILVKKYLSGSLFSKFRQDTIIPEALSTLQISGPDLIQRKMLQFFRSRGVLGEEFINERKLSDKAYIGVYKTTGTGK
YDWLTPESIGVNDVTPADESTWCIGKGRCVDDFLFKDVSTLKTENLPELFLTKIDTDTFFSQWSTKTKKDLQKKIQDLTV
RYNELIDSSTIDFKNLYEIDQMLHMIMLEMNDDIAKRSLFSLQVQIAEKIRRMTIPVDNIINIYPDLHKKNDNDLSMSIK
GFLASNPHTKINILYSNKTEHNIFIKDLFSFAVMENELRDIINNMSKDKTPENWEGRVMLQRYLELKMKDHLSLQSSQEA
NEFLEISTFIYENDFLREKIEAVKNKMNSHELYFEKIKKEQNTWQDLSTKEQKLQLIKALKEISGNTEKDSHYDRLLDAF
FKKHNENIHNKIQRIKDEFKEYSRVAIHNIDKVIFKGQTLDRLYHEGYVFSDINTLSRYTLHGLGITGVHTEENLLPAPS
SSLINILKEHYNEDEISAKLPLAYDYILNKKESSSIPVEILNKLSELPPHELLTPVLGQSVNPLGMGYSSDNGKITEQVI
VSGADGFDNPISGLIYTYLEDLYNIHVRMREGTLNSQNLRQLLENSVSSCFLTEQSINKLLSEAEKRPYQSLTEIHQHLT
GLPTIADATLSLLSVGLPGTGKLLRREQDYGRPPVTAIQDSTFVLPYNFKGIGFNDNIISSAPVASSLHFIAEHAKYTLL
SWPEFYRHHAQRWFEMAKGYGSQNIDFHPQSLLVTQEGRCMGLALLYLQTEDTAHYSILQENLMTVSALHQTSNRDKLPL
SKDDNSLMTRTYSLIEMLQYQGNKYITNESLLHKTAWNQERITLLFNEKGVKRALISTPNHTLVLQQLEDIYRLTDPNFG
HADFLSPIDALKFIEAMIQLTPTLQEYYGLLNKDINKHIQVHYAESDMVWNKLLPENDAGLSTRIQHTTTDRLANLAEPV
AVAGISLPVKTLYDIGATLDGRRITSPPTSEQIPSLRLNGDVLNDYLSRTVLTPEQADNIRKILHTQGIRSGTRPIDPEM
IRGTQDDLVSSQTRLQRQATRVKQQLAGVLDTLQQHFQNIPRSSGRHLSVENIELADIGSGRFNLQIRDGETLHTTSVEV
PEVVSRFQKLSTMLSALPASGIMDFDLGMSVVGVVQYARLLQQGHEDSTLAKINLAMDIKQLSEATLGSMIQIAGNKFLN
TEGIQGFRLESAVAEGMRSVATRTGGTMGKALSASARVLELPVLETVLGTWNLYNSVIQLQQATSYSETMAARVQIAFDS
ISLGLTAASVAFPPLIIATGPIAAIGMGASSIARNVARKEERHTQWLEYKKFLTDGSKHIVVASPERGLLDFSGNKVFGK
MVLDLRQSPPLLHGESSFNADRKIGHRPDLGDWQIREKVGYANSISPYSSLAHGYANSKWPRTIPKIPSGEYDTIILGYG
HQYQANTEIEYLSNWIVWREAVPDSTSRHKRPPLEVLNSQCTVIAGERKTTVLPLRVLSDLTPECTEQAISLKDYKFILR
GGSGGLAVQVGGAGYYDIDANPVAKENTLSFRGLPEEFPLTFDLSKQTQSVMLKTPDDEVPVMTITQKGINTLVGTAAGK
DRLIGNDKDNTFHTSSGGGTVISGGGNNRYIIPRDLKTPLTLTLSSNSVSHEIFLPETTLAELKPVAFELSLIYWAGNNI
NVQPEDEAKLNHFAGNFRVHTRDGMTLEAVSRENGIQLAISLCDVQRWQAVYPEENNRPDAILDRLHDMGWSLTPEVRFQ
GGETQVSYDPLTRQLVYQLQARYSEFQLAGSRHHTTAVTGTPGSRYIIMKPVTTQILPTQIILAGDNDHPETIDLLEASP
VLVEGKKDKNSVILTIATIQYSLQLTISGIEESLPETTRVAIQPQDTRLLGDVLRILPDNGNWVGIFRSGHTPTVNRLEN
LMALNQVMTFLPRVSGSAEQVLCLENLGGVRKKVEGELLSGKLKGAWKAEGEPTVPVNISDLSIPPYSRLYLIFEGKNNV
LLRSKVHAAPLKITSAGEMQLSERQWQQQEHIIVKPDNEAPSLILSEFRRFTISSDKTFSLKLMCHQGMVRIDRRSLSVR
LFYLREQPGIGSLRLTFRDFFTEVMDTTDREILEKELRPILIGDTHRFINAAYKNHLNIQLGDGVLNLADIVAEYARIQK
EETSKILYQYQGAMKKKTDGPSVVEDAIMTTTVTTDSGELFPTFHPWYTDDLSGRYKSVPMARKADTLYHLTPKGDLQII
YQVATKMVNQAMIVSLPNYRHEWEKYNLSILSEIPQNNNTVVHSILRVNGPTMQVRTIDYRGTDENNPIVSFSDTTFING
EQMLSYDSHSSGRVYSREEYMMWELQQRVSEASSARTQDYWLMDAAVRNGEWKITPELLRHTPGYIRSTVSKWSRGWLKT
GTILQTPEDRNTDVYLTTIQNNVFSRQGGGYQVYYRIDGMAGADIADNAPGETRCTLRPGTCFEVTSVDERHYEWNIIYV
TLKTCGWSRNGQSKTPNGDNLFN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
efa1/lifA YP_003223428.1 Efa1/LifA-like protein Virulence LEE Protein 0.0 100
efa1-lifA-tox CAC81883.1 Efa1-LifA-Tox protein Virulence LEE II Protein 0.0 99
efa1 AAL57562.1 Efa1 Virulence LEE Protein 0.0 99
efa1/lifA YP_003232172.1 Efa1/LifA Not tested LEE Protein 0.0 99
efa1/lifA CAI43818.1 Efa1/LifA protein Not tested LEE Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECO103_4914 YP_003224732.1 Efa1/LifA-like protein VFG1534 Protein 0.0 99