Gene Information

Name : efa1/lifA (ECO111_5002)
Accession : YP_003237286.1
Strain : Escherichia coli 11128
Genome accession: NC_013364
Putative virulence/resistance : Virulence
Product : Efa1/LifA-like protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 5087662 - 5097333 bp
Length : 9672 bp
Strand : -
Note : Integrative element ECO111_IE06a

DNA sequence :
ATGAGACTGCCAGAGAAAGTTCTTTTTCCTCCTGTCACTAGTGGCCTGTCAGGGCAGGAAAAACAAAAAAAACCGAAGAG
CATTACCGGATTTCAGGAAAATTATCAACGCAATATCAGGCCAATCAAAACAGCATCAGAAGCCCGACTACGCTTCTTTG
ATAAAATGGTTTCGAAAGAAAACTCTCTGGAAGATGTTGTTTCTTTAGGTGAAATGATTCAGAAGGAAATTTATGGGCAT
GAACAAAGAACATTTTCACCAGTTCATCATACAGGTAACTGGAAATCATCATTGTTACACAACGCGCTCCTTGGTCTGGC
AAATGTTTATAATGGCTTACGGGAAACAGAATACCCTAACACTTTCAACAGAGATGGTATAAAAAGTACTAACTCTTTTA
GAGATAACTTATTGACAAAAACAAGAACTCCCAGAGATAATTTTGAGGAAGGAATAAAACATCCTGAACATGCAACAATA
CCATATGACAACGACAATGAAAGTAATAAATTGCTAAAAGCAGGAAAGATAGCTGGTAACAATAACGAGCTGTTGATGGA
AATAAAAAAGGAATCCCAAAGCGACCATCAAATCCCCCTGTCAGATAAGTTCCTGAAAAGGAAAAAACGATCTCCTGTAG
CTGAAGATAAAGTTCAAAACTCGTTAACACCAGAAAATTTTGTTCAGAAAATTTCACTTAGTGATGAGCTTAAAACAAAA
TATGCAAATGAAATTATAGAGATAAAAAGAATAATGGGAGAATACAATCTTTTACCGGATAAAAACAGTCGTAATGGTCT
AAAACTTCTACAGAAGCAAGCTGATTTACTAAAAATAATCATGGAGGATACCTCTGTTACAGAAAATACCTTTAAAAACA
TAGAGATGGCTATAACAGATATAAAAAGGGAGTACTATTCGCATACAGTTGATATTGAGAAAAATATTCATGCCATATGG
GTTGCGGGTTCCCCACCTGAAAGCATTTCGGACTATATTAAGACTTTTCTCAAAACCTACAAAGAGTTTACTTACTATCT
TTGGGTTGATGAAAAAGCATTTGGAGCCGCGAAATTTACCAGTGTCTTAAAACAAATTGCATTTGATTTAGCATGCAGAA
CTATACAGCAGAATACTCCACAAAAAAATATTGATTTTATTAATCTATATAATGAAATAAGGAAGAAATACAACAACAAC
CCATCGGGACAACAAGAGTACTTAAACAAACTCAGAGAGCTTTATGCTACTTATCAAAAAATATCCACCCCTCTGAAACA
CATGTTTAATTCATTTTTTTTAGAAAACATGATTAAACTTCAGGATAATTTCTTCAACTATTGCATTGTCAAAGGTGTTA
CAGAGATTAATGATGAGTTACGAATAAACTACCTTAAAAATGTAATAAAACTGTCAGACGATGACATTGGTAATTATCAG
AAAACAATTAACGACAATAAAGATAGAGTAAAAAAACTAATTCTTGATTTACAGAAACAATTTGGTGAAAACCGCATTTC
AATTAAAGATGTTAACTCTTTAACCTCTCTTTCAAAATCAGAAAATAATCACAATTATCAAACTGAAATGTTGCTTCGAT
GGAACTATCCTGCCGCCTCAGACCTGCTCAGGATGTATATCCTTAAAGAGCATGGTGGTATTTATACAGACACAGATATG
ATGCCTGCATACTCTAAACAAGTAATTTTTAAAATTATGATGCAGACAAACGGAGATAATCGTTTCTTGGAAGATTTGAA
ACTACGTCGTGCAATATCTGATGGTGTATTAAGATATGTTAATAACCAAAATATTGATGAAGTTAACTATAATGAAATCA
GTGATGCAGATAAAAACATTATCAAGAAGATATTAACAGAAATATCTAAAATGCCAGAAGATAGTATTTTTACTAAGATC
AATACAAGGATTCCTCGAGACACAATGCCCATCCTTCGTCGTTATCACCTATGGCCTGATGGATGGAATATTCGTGGGCT
CAATGGATTCATGCTATCACATAAAGGTAGTGAAGTGATTGATGCTGTTATCGCAGGCCAGAATCAGGCTTACAGGGAAC
TAAGAAGAATAAGAGATAATATTCATAGTGAAATATACTTCAAACAAACTGATGAATTGTCCTCACTTCCAGATACAGAC
AAAATTGGAGGGATTCTGGTAAAAAAATACCTTTCAGGAAGTCTTTTTTCAAAATTCAGACAAGATACTATTATCCCGGA
GGCATTGAGCACCCTCCAGATATCAGGTCCTGACTTGATTCAAAGAAAGATGTTGCAATTTTTCAGGAGTAGAGGGGTGT
TAGGTGAAGAGTTCATTAATGAAAGAAAACTGAGTGATAAAGCTTATATTGGTGTCTACAAAACAACTGGCACAGGGAAA
TATGACTGGTTAACCCCTGAATCGATCGGCGTTAATGATGTCACGCCCGCAGATGAAAGTACCTGGTGTATAGGAAAAGG
CCGGTGTGTTGATGACTTCCTGTTCAAAGATGTTTCAACACTAAAAACAGAAAATCTTCCAGAATTATTCTTAACAAAAA
TAGATACTGATACGTTTTTCTCTCAGTGGTCAACCAAAACCAAGAAAGATCTGCAAAAAAAAATACAAGACCTTACTGTA
CGTTATAATGAGCTAATTGACTCGTCGACTATCGACTTTAAAAATCTATATGAAATAGATCAAATGCTCCATATGATTAT
GCTAGAGATGAATGATGATATAGCCAAAAGGTCATTGTTTTCATTGCAAGTTCAAATAGCCGAAAAAATTCGGAGGATGA
CCATTCCTGTAGACAATATAATTAACATCTATCCTGATCTACATAAAAAAAATGACAATGATCTGAGTATGTCCATAAAA
GGCTTTCTTGCGAGTAATCCACATACAAAAATAAATATTCTTTATAGCAATAAGACTGAGCATAATATTTTTATAAAGGA
TTTATTCTCCTTCGCAGTTATGGAAAATGAGTTAAGAGACATTATCAATAACATGAGCAAAGATAAGACTCCTGAAAACT
GGGAAGGGAGGGTAATGTTACAAAGATATCTCGAATTAAAAATGAAAGATCATCTTAGTTTGCAATCTTCTCAGGAAGCA
AATGAGTTTCTTGAAATATCTACTTTTATTTATGAGAATGATTTCTTGAGAGAAAAGATTGAAGCAGTAAAAAACAAAAT
GAATTCTCACGAACTTTATTTTGAAAAAATAAAAAAAGAACAAAACACATGGCAGGATCTGTCCACAAAAGAACAAAAAT
TACAGCTTATTAAAGCATTGAAAGAAATTTCAGGAAATACAGAGAAGGACTCTCATTACGATAGACTTCTTGATGCTTTT
TTTAAAAAACATAATGAAAATATTCATAATAAAATACAAAGAATAAAAGACGAGTTCAAGGAATACTCCCGTGTAGCCAT
TCATAATATAGATAAAGTTATATTTAAAGGGCAAACACTGGATCGTCTTTATCATGAAGGATATGTATTTTCTGATATCA
ATACCTTGTCTCGTTATACACTACACGGACTAGGAATAACTGGTGTACATACTGAAGAAAACCTGCTACCAGCTCCTTCA
TCGTCCTTAATTAATATATTGAAAGAACATTATAATGAAGATGAAATTAGTGCGAAATTACCACTAGCATATGATTACAT
TTTAAATAAAAAAGAATCAAGCTCTATTCCTGTTGAAATTTTGAACAAACTTTCAGAGTTACCACCACATGAACTACTCA
CACCTGTTCTTGGCCAGAGTGTTAATCCTCTGGGCATGGGCTACTCATCCGATAATGGAAAAATCACAGAGCAAGTAATA
GTCAGTGGAGCTGATGGATTTGATAATCCCATATCTGGACTTATATATACCTATCTTGAAGATCTATATAACATCCATGT
AAGGATGCGAGAAGGTACACTAAATTCACAGAATCTTCGTCAGCTTCTGGAAAACTCTGTTTCTTCATGCTTTTTGACTG
AGCAAAGTATTAATAAATTACTTAGTGAGGCAGAAAAAAGGCCTTATCAGTCTTTAACAGAAATACATCAGCATCTCACA
GGATTACCAACTATTGCCGATGCAACCCTTTCATTACTTTCTGTTGGATTACCTGGTACGGGAAAACTATTGCGCAGGGA
GCAGGACTATGGGCGTCCACCAGTTACAGCAATTCAGGATTCCACATTTGTACTCCCTTATAATTTCAAAGGTATTGGTT
TTAACGATAACATTATATCCTCTGCACCTGTAGCCTCCTCATTACATTTTATCGCTGAACATGCGAAATATACTTTATTG
TCATGGCCTGAGTTTTATCGTCATCATGCACAGCGATGGTTCGAAATGGCTAAAGGATATGGAAGCCAGAATATTGATTT
TCACCCTCAGTCTCTATTGGTAACCCAAGAAGGACGCTGTATGGGATTAGCCTTACTTTATTTACAGACTGAAGATACTG
CTCATTATAGCATTCTCCAGGAAAACCTAATGACTGTGAGTGCACTTCATCAGACCAGTAATCGCGATAAGTTGCCACTG
TCCAAAGATGATAATTCCTTAATGACAAGAACTTATAGTCTGATTGAAATGCTACAGTATCAGGGAAACAAATATATTAC
CAACGAATCGCTACTACATAAGACCGCATGGAACCAAGAAAGAATAACTTTATTATTCAATGAAAAAGGAGTTAAGCGAG
CCCTGATAAGCACGCCTAATCATACTCTGGTTCTGCAACAACTGGAGGATATTTACCGGCTCACTGATCCAAATTTTGGG
CATGCAGATTTCCTTTCACCTATAGATGCTCTGAAGTTTATTGAGGCTATGATACAATTAACTCCAACACTTCAGGAATA
TTATGGCCTATTAAACAAAGACATTAATAAACATATACAAGTACATTATGCCGAATCAGATATGGTCTGGAATAAGCTTC
TGCCAGAAAATGATGCTGGACTGAGCACCAGAATTCAGCACACCACCACCGACCGTCTGGCGAATCTGGCTGAACCAGTC
GCTGTTGCAGGTATCTCCCTGCCAGTAAAAACACTTTATGATATCGGAGCCACCCTTGACGGTCGGCGCATCACCTCTCC
TCCAACATCGGAGCAAATCCCTTCTCTGCGTCTCAACGGTGATGTTCTGAATGATTATCTGTCCCGCACAGTTCTGACTC
CAGAACAGGCTGATAACATAAGAAAAATACTGCACACTCAGGGAATACGCAGCGGTACCCGTCCCATAGATCCGGAGATG
ATTCGTGGGACGCAGGATGACCTAGTTTCGTCACAGACTCGTCTGCAAAGGCAAGCAACACGGGTTAAACAGCAACTCGC
CGGTGTACTTGATACTCTGCAACAGCACTTCCAGAACATTCCACGTTCATCCGGTCGTCATCTTTCTGTAGAGAATATTG
AGCTGGCTGATATCGGAAGTGGGCGTTTCAACCTTCAAATTCGAGATGGAGAAACATTGCATACCACTTCTGTGGAAGTA
CCGGAAGTTGTGTCCCGTTTTCAGAAACTTTCTACCATGCTTTCAGCCCTGCCTGCCAGTGGAATCATGGATTTCGACCT
CGGCATGAGTGTGGTCGGAGTCGTCCAGTATGCCCGCCTGCTACAGCAGGGGCACGAAGACAGTACCCTAGCCAAGATAA
ATCTAGCTATGGATATCAAGCAACTTTCCGAAGCAACTCTCGGCAGCATGATTCAGATTGCCGGGAATAAGTTTCTCAAT
ACAGAAGGAATCCAGGGGTTCAGACTGGAAAGCGCCGTCGCTGAAGGTATGCGCTCAGTAGCAACCCGTACCGGAGGCAC
AATGGGGAAAGCCCTTTCTGCCAGTGCCCGTGTTCTTGAACTGCCTGTACTGGAAACAGTTCTGGGGACATGGAACCTGT
ACAACAGCGTCATTCAGCTCCAGCAAGCTACATCTTATTCTGAGACAATGGCTGCCCGGGTACAGATTGCGTTTGATTCC
ATTTCTCTGGGATTAACTGCCGCTTCGGTAGCTTTCCCGCCACTAATTATTGCCACTGGCCCCATTGCGGCTATTGGTAT
GGGAGCTTCCAGTATTGCACGTAATGTGGCACGGAAAGAAGAACGGCATACACAATGGCTGGAATATAAAAAATTCCTGA
CTGATGGCAGTAAACACATTGTTGTGGCCTCTCCGGAAAGAGGTCTGCTGGATTTCTCCGGAAACAAAGTTTTTGGAAAA
ATGGTGCTGGATCTGCGTCAGTCTCCTCCTCTCTTGCATGGAGAAAGCTCTTTTAACGCTGACCGCAAAATCGGTCATCG
TCCGGATCTGGGGGACTGGCAAATTCGTGAGAAGGTGGGGTATGCCAACAGTATCAGTCCCTACTCTTCTCTGGCGCACG
GTTACGCCAACAGTAAATGGCCACGAACAATACCGAAAATTCCCTCGGGAGAATATGACACCATAATTCTGGGCTACGGT
CACCAGTATCAGGCCAATACGGAAATAGAATATCTGTCTAACTGGATTGTATGGCGGGAAGCCGTACCAGACAGTACTTC
CCGCCACAAACGTCCTCCTCTGGAGGTTCTTAATAGTCAGTGTACTGTGATAGCTGGAGAGCGTAAAACCACAGTACTTC
CCCTGAGAGTGCTCAGCGATCTGACACCGGAATGCACAGAACAGGCTATATCGTTAAAAGATTATAAATTCATACTGAGA
GGGGGAAGCGGTGGGCTGGCTGTTCAGGTCGGTGGCGCGGGATATTATGATATTGATGCAAATCCTGTGGCAAAAGAAAA
TACGCTCTCTTTTCGCGGGCTACCGGAAGAGTTTCCGCTCACCTTTGATTTATCAAAACAAACACAGTCGGTCATGCTGA
AAACGCCAGACGATGAGGTGCCGGTAATGACCATTACCCAGAAGGGAATAAACACCCTGGTAGGTACAGCCGCTGGTAAA
GACCGACTAATCGGTAACGATAAGGACAATACCTTCCATACAAGCTCTGGCGGCGGTACAGTCATCTCCGGAGGCGGGAA
TAACCGCTATATTATCCCCCGGGATTTAAAAACGCCGTTGACACTGACGCTGTCCAGTAACTCAGTCTCTCACGAAATCT
TTCTGCCAGAAACAACCCTAGCTGAATTAAAACCTGTCGCCTTTGAGCTGAGTTTGATTTACTGGGCCGGGAACAACATA
AATGTTCAACCAGAGGATGAAGCAAAACTGAACCACTTTGCCGGAAACTTCAGGGTGCATACCCGTGATGGCATGACTCT
GGAGGCGGTTTCCCGGGAAAATGGTATTCAACTGGCGATTTCATTATGTGATGTTCAACGCTGGCAGGCTGTTTATCCGG
AAGAAAATAACAGACCGGATGCCATACTGGACAGGCTGCATGATATGGGCTGGAGTCTGACACCAGAAGTCCGGTTCCAA
GGAGGAGAAACACAAGTCAGCTATGATCCCCTGACTCGTCAGCTCGTTTACCAGCTTCAGGCGCGTTACTCTGAATTCCA
GTTGGCCGGTAGTCGCCACCATACCACGGCTGTAACCGGAACTCCGGGAAGCCGATACATTATCATGAAGCCAGTTACAA
CACAGATATTACCGACACAAATCATACTGGCTGGTGATAATGACCATCCGGAAACGATTGATTTACTGGAAGCTAGTCCT
GTTCTGGTTGAAGGGAAAAAAGACAAAAACAGCGTGATATTAACGATTGCTACGATTCAGTATTCCCTTCAACTGACAAT
ATCCGGGATCGAAGAATCGCTGCCCGAGACAACCCGTGTGGCAATTCAGCCTCAGGATACCCGTTTACTGGGTGACGTAC
TCCGGATCTTACCAGATAATGGTAACTGGGTGGGGATTTTCCGGAGTGGTCATACACCAACGGTAAACCGGCTGGAAAAT
TTGATGGCACTGAATCAGGTAATGACGTTCCTGCCCCGGGTATCCGGAAGTGCAGAGCAGGTATTATGCCTCGAAAACCT
AGGTGGCGTAAGGAAAAAAGTGGAGGGGGAGTTACTGTCAGGGAAGCTGAAAGGTGCGTGGAAAGCCGAAGGTGAACCTA
CTGTTCCGGTAAATATCTCAGATCTAAGTATCCCACCCTATTCACGTCTGTATCTGATTTTTGAAGGGAAAAATAATGTG
TTGCTACGCAGTAAAGTACATGCAGCTCCGTTGAAAATAACATCCGCAGGAGAGATGCAGTTATCTGAAAGGCAGTGGCA
ACAGCAGGAACATATTATTGTCAAGCCCGACAACGAAGCCCCCTCATTAATACTCAGTGAATTTCGTCGTTTCACTATTT
CATCGGATAAAACATTTTCTTTAAAACTGATGTGCCATCAGGGTATGGTCCGCATCGACCGCAGATCATTATCGGTCAGA
TTGTTCTATCTGCGTGAACAGCCAGGTATCGGCAGTTTACGTCTGACGTTCAGAGATTTTTTCACAGAAGTGATGGATAC
AACGGACAGGGAAATTCTGGAGAAAGAGCTGAGACCAATTCTGATAGGAGATACACACCGCTTTATCAACGCTGCATACA
AAAATCATCTGAATATCCAGTTAGGAGATGGCGTTCTGAATCTGGCAGACATTGTTGCGGAATACGCCCGTATTCAAAAG
GAAGAAACATCAAAAATATTGTATCAATATCAAGGTGCCATGAAAAAAAAAACAGATGGACCATCTGTGGTAGAAGATGC
CATTATGACCACTACTGTCACAACAGATTCAGGTGAACTATTCCCTACCTTCCACCCGTGGTATACAGATGATTTATCAG
GGCGTTATAAGAGCGTACCTATGGCAAGAAAAGCAGATACTTTGTATCACCTGACACCAAAAGGTGATCTACAGATAATA
TATCAGGTAGCTACAAAAATGGTGAATCAGGCGATGATTGTATCCCTGCCAAACTACCGACACGAGTGGGAAAAATATAA
TTTAAGCATCTTATCCGAAATCCCTCAGAACAATAATACTGTTGTACATTCAATCCTCAGGGTTAATGGCCCCACAATGC
AGGTGCGCACAATTGACTACAGAGGAACGGATGAAAACAATCCCATAGTATCTTTTTCAGATACAACCTTCATCAATGGT
GAACAGATGTTGAGTTATGACTCGCATTCATCAGGGCGAGTCTATTCCAGAGAAGAATATATGATGTGGGAATTGCAGCA
ACGGGTATCAGAAGCTTCCAGTGCCCGGACACAGGATTACTGGCTGATGGATGCAGCGGTAAGAAACGGAGAATGGAAGA
TCACACCAGAATTATTACGTCACACACCGGGATATATCCGGAGTACGGTATCGAAATGGTCCAGAGGATGGCTGAAAACC
GGCACAATACTCCAGACTCCAGAAGACAGAAATACGGATGTATACCTGACTACCATACAGAACAATGTATTTAGTCGTCA
GGGGGGCGGCTACCAAGTGTATTATCGGATTGATGGAATGGCTGGTGCGGATATAGCGGATAATGCACCAGGGGAAACCC
GCTGCACCCTCAGGCCCGGAACATGTTTTGAAGTGACAAGTGTGGATGAAAGGCATTATGAGTGGAATATCATTTATGTC
ACGCTGAAAACCTGTGGCTGGAGCCGAAATGGCCAAAGCAAAACGCCGAATGGTGACAACCTTTTTAACTAA

Protein sequence :
MRLPEKVLFPPVTSGLSGQEKQKKPKSITGFQENYQRNIRPIKTASEARLRFFDKMVSKENSLEDVVSLGEMIQKEIYGH
EQRTFSPVHHTGNWKSSLLHNALLGLANVYNGLRETEYPNTFNRDGIKSTNSFRDNLLTKTRTPRDNFEEGIKHPEHATI
PYDNDNESNKLLKAGKIAGNNNELLMEIKKESQSDHQIPLSDKFLKRKKRSPVAEDKVQNSLTPENFVQKISLSDELKTK
YANEIIEIKRIMGEYNLLPDKNSRNGLKLLQKQADLLKIIMEDTSVTENTFKNIEMAITDIKREYYSHTVDIEKNIHAIW
VAGSPPESISDYIKTFLKTYKEFTYYLWVDEKAFGAAKFTSVLKQIAFDLACRTIQQNTPQKNIDFINLYNEIRKKYNNN
PSGQQEYLNKLRELYATYQKISTPLKHMFNSFFLENMIKLQDNFFNYCIVKGVTEINDELRINYLKNVIKLSDDDIGNYQ
KTINDNKDRVKKLILDLQKQFGENRISIKDVNSLTSLSKSENNHNYQTEMLLRWNYPAASDLLRMYILKEHGGIYTDTDM
MPAYSKQVIFKIMMQTNGDNRFLEDLKLRRAISDGVLRYVNNQNIDEVNYNEISDADKNIIKKILTEISKMPEDSIFTKI
NTRIPRDTMPILRRYHLWPDGWNIRGLNGFMLSHKGSEVIDAVIAGQNQAYRELRRIRDNIHSEIYFKQTDELSSLPDTD
KIGGILVKKYLSGSLFSKFRQDTIIPEALSTLQISGPDLIQRKMLQFFRSRGVLGEEFINERKLSDKAYIGVYKTTGTGK
YDWLTPESIGVNDVTPADESTWCIGKGRCVDDFLFKDVSTLKTENLPELFLTKIDTDTFFSQWSTKTKKDLQKKIQDLTV
RYNELIDSSTIDFKNLYEIDQMLHMIMLEMNDDIAKRSLFSLQVQIAEKIRRMTIPVDNIINIYPDLHKKNDNDLSMSIK
GFLASNPHTKINILYSNKTEHNIFIKDLFSFAVMENELRDIINNMSKDKTPENWEGRVMLQRYLELKMKDHLSLQSSQEA
NEFLEISTFIYENDFLREKIEAVKNKMNSHELYFEKIKKEQNTWQDLSTKEQKLQLIKALKEISGNTEKDSHYDRLLDAF
FKKHNENIHNKIQRIKDEFKEYSRVAIHNIDKVIFKGQTLDRLYHEGYVFSDINTLSRYTLHGLGITGVHTEENLLPAPS
SSLINILKEHYNEDEISAKLPLAYDYILNKKESSSIPVEILNKLSELPPHELLTPVLGQSVNPLGMGYSSDNGKITEQVI
VSGADGFDNPISGLIYTYLEDLYNIHVRMREGTLNSQNLRQLLENSVSSCFLTEQSINKLLSEAEKRPYQSLTEIHQHLT
GLPTIADATLSLLSVGLPGTGKLLRREQDYGRPPVTAIQDSTFVLPYNFKGIGFNDNIISSAPVASSLHFIAEHAKYTLL
SWPEFYRHHAQRWFEMAKGYGSQNIDFHPQSLLVTQEGRCMGLALLYLQTEDTAHYSILQENLMTVSALHQTSNRDKLPL
SKDDNSLMTRTYSLIEMLQYQGNKYITNESLLHKTAWNQERITLLFNEKGVKRALISTPNHTLVLQQLEDIYRLTDPNFG
HADFLSPIDALKFIEAMIQLTPTLQEYYGLLNKDINKHIQVHYAESDMVWNKLLPENDAGLSTRIQHTTTDRLANLAEPV
AVAGISLPVKTLYDIGATLDGRRITSPPTSEQIPSLRLNGDVLNDYLSRTVLTPEQADNIRKILHTQGIRSGTRPIDPEM
IRGTQDDLVSSQTRLQRQATRVKQQLAGVLDTLQQHFQNIPRSSGRHLSVENIELADIGSGRFNLQIRDGETLHTTSVEV
PEVVSRFQKLSTMLSALPASGIMDFDLGMSVVGVVQYARLLQQGHEDSTLAKINLAMDIKQLSEATLGSMIQIAGNKFLN
TEGIQGFRLESAVAEGMRSVATRTGGTMGKALSASARVLELPVLETVLGTWNLYNSVIQLQQATSYSETMAARVQIAFDS
ISLGLTAASVAFPPLIIATGPIAAIGMGASSIARNVARKEERHTQWLEYKKFLTDGSKHIVVASPERGLLDFSGNKVFGK
MVLDLRQSPPLLHGESSFNADRKIGHRPDLGDWQIREKVGYANSISPYSSLAHGYANSKWPRTIPKIPSGEYDTIILGYG
HQYQANTEIEYLSNWIVWREAVPDSTSRHKRPPLEVLNSQCTVIAGERKTTVLPLRVLSDLTPECTEQAISLKDYKFILR
GGSGGLAVQVGGAGYYDIDANPVAKENTLSFRGLPEEFPLTFDLSKQTQSVMLKTPDDEVPVMTITQKGINTLVGTAAGK
DRLIGNDKDNTFHTSSGGGTVISGGGNNRYIIPRDLKTPLTLTLSSNSVSHEIFLPETTLAELKPVAFELSLIYWAGNNI
NVQPEDEAKLNHFAGNFRVHTRDGMTLEAVSRENGIQLAISLCDVQRWQAVYPEENNRPDAILDRLHDMGWSLTPEVRFQ
GGETQVSYDPLTRQLVYQLQARYSEFQLAGSRHHTTAVTGTPGSRYIIMKPVTTQILPTQIILAGDNDHPETIDLLEASP
VLVEGKKDKNSVILTIATIQYSLQLTISGIEESLPETTRVAIQPQDTRLLGDVLRILPDNGNWVGIFRSGHTPTVNRLEN
LMALNQVMTFLPRVSGSAEQVLCLENLGGVRKKVEGELLSGKLKGAWKAEGEPTVPVNISDLSIPPYSRLYLIFEGKNNV
LLRSKVHAAPLKITSAGEMQLSERQWQQQEHIIVKPDNEAPSLILSEFRRFTISSDKTFSLKLMCHQGMVRIDRRSLSVR
LFYLREQPGIGSLRLTFRDFFTEVMDTTDREILEKELRPILIGDTHRFINAAYKNHLNIQLGDGVLNLADIVAEYARIQK
EETSKILYQYQGAMKKKTDGPSVVEDAIMTTTVTTDSGELFPTFHPWYTDDLSGRYKSVPMARKADTLYHLTPKGDLQII
YQVATKMVNQAMIVSLPNYRHEWEKYNLSILSEIPQNNNTVVHSILRVNGPTMQVRTIDYRGTDENNPIVSFSDTTFING
EQMLSYDSHSSGRVYSREEYMMWELQQRVSEASSARTQDYWLMDAAVRNGEWKITPELLRHTPGYIRSTVSKWSRGWLKT
GTILQTPEDRNTDVYLTTIQNNVFSRQGGGYQVYYRIDGMAGADIADNAPGETRCTLRPGTCFEVTSVDERHYEWNIIYV
TLKTCGWSRNGQSKTPNGDNLFN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
efa1/lifA CAI43818.1 Efa1/LifA protein Not tested LEE Protein 0.0 99
efa1-lifA-tox CAC81883.1 Efa1-LifA-Tox protein Virulence LEE II Protein 0.0 99
efa1 AAL57562.1 Efa1 Virulence LEE Protein 0.0 99
efa1/lifA YP_003232172.1 Efa1/LifA Not tested LEE Protein 0.0 99
efa1/lifA YP_003223428.1 Efa1/LifA-like protein Virulence LEE Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
efa1/lifA YP_003237286.1 Efa1/LifA-like protein VFG1534 Protein 0.0 100