Gene Information

Name : EJP617_18420 (EJP617_18420)
Accession : YP_005818410.1
Strain : Erwinia sp. Ejp617
Genome accession: NC_017445
Putative virulence/resistance : Unknown
Product : Putative adhesin/hemagglutinin/hemolysin
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2022889 - 2031021 bp
Length : 8133 bp
Strand : -
Note : -

DNA sequence :
GTGAACAAACTTTGCTATCGAATTATTTTCAACCGCGCGCGCGGCCTGCTGATGGTCGTGGCAGATATTGCCCGCAGCCG
AACGGGTTCGTCACGTACGCGCCGGGATAAAATCCCCATGCGCTGCTGCCGGCTTAGCCCGTTAAGCTGCGCCATGCTTG
CCGCTTTCGCTTTTGTCACCTCGCCCGGTGAGGCTCAGGCAGGCATTGTCGCCGACGGCCAGGCGCCGAATAACCAGCAG
CCGAATATCGTATCGAGCGCCAACGGCACGCCGCAAATCAATATTCAGACGCCGAGTGAGGCAGGCGTTTCGCGCAACAC
TTACAGCCAGTTCGATGTCGATAACAAAGGCGCCATCCTGAATAACTCGCGCCTCCATGTCGCCACACAGTTGGGCGGCA
TGGTGGCAGGAAACCAGTGGCTGGCGGGCGGAGAAGCAAGAGTGATCCTCAATGAGGTTAACTCGCGCGACCCAAGCAGG
CTCCATGGCTACATCGAAGTTGCCGGGCGTCAGGCACAGGTGGTTATCGCTAACCCATCGGGCATTTCCTGTAATGGCTG
TGGTTTTATCAATGCTAACCGCGCCACGCTAACAACCGGCCAGGCGCAAATGAATAACGGCCAGCTAACCGGTTATGCAG
TAGAGCGCGGTGAGATTGTGGTACAGGGTGCAGGCATGGACAGCAGCCGCCAGGATTACACTGATGTCATTGCCCGTTCG
GTGAAAATCAATGCCGGCATAGTGGCGAAAGCGTTAAATGTGACGGCCGGTCGTAACCGGGTCGATGCGGCTCATCAACA
GATCGATACGCTCGCGGCCGACGGGTCCGCCCGGCTCGGCGGCATGTATGCCAACAAAATTCGATTAATCGGCACCGAAC
AGGGCGTCGGCGTGGCTAACGCCGGGAATATCGGCGCGCAGGCAGGTTCGTTGGTGGTAACGGCAGATGGCCGCATCGAA
AACAGCGGCAACCTTCACAGCAGCGCCGGAATGAACGTCGCCTCCAATACGTCCATCAACAACAGCGGTGCCTTACAGGC
TGGCGACAGTGCCAGCCTGACCAGCACGGGCGACCTGCATAACGGCGGGTCGATTGTGGCGCGTCATCATCTGCAAACGC
AAAGCGCCAGCCTCAACAGCGAACCGGGAAGCGTACTCGCCGCTGGGGTCAACCAGGACGGTAAGCTGGCCGGTAGCGGC
AAATTAACGCTGACAACGTCTGGCCAGCTACGCGCTCAGGGGCAAAATCTGGCCGCCGGGGATTTTACCGCCCGTGGACA
AGGCGTGGATGTGCGCCACGGTTCCAGCAGTGCGAACAACATCACGCTGGAAGCCGGGCAGAGCGACCTGCTCACCGGCA
GTGGTGCGCAGATTGCAGCGCAGCGCCAGCTGACCGCCAGCAGCGGAAAAATGCTGAATAACGACGGTGGCAGCCTGTCG
GCCGACAAACTGAGCATCAACGCGCACGATCTGTCGAACCAACAGGGCGAGATAACTCAGTTGGGCAGCGATTCACTTCA
GCTTAGCCATCAGGGCAACCTCAATAACCGTGGCGGTAGCCTCGCCAGTAACGGCAAAGATCTGACCATACAGTCCGCGA
TCATCCATAATCAGGATGGCAAAATTGTCCATGCGGCTGACGGCATTCTGTCCCTGAGCAGTGACCGCCTGAACGGCGCT
CAGGGACAAATCCTGTCTAACGGTCACCTGAGCATGACCACTGGCGATACCCTGCTTGACGGCGGCGTGACCTTGGCGCA
GCTCATCACGTTAACGGCGAACAGCCTGTCCAACCGTGACGGCAAAATTTTGCAAACCGGCAGCGGTGCCATGTCGCTTG
CGGTGCTGAATGGTATGGATAACCAGAACGGTATTCTGGCCGCAGGCGGCGATCTTCATCTGCAGGCCGCCAGCCTGAAC
AACCAGCAAGGTCAAATCAGCGCACAGGATAACGGCTCGTTAAACGTTAACTTGCTCAACCCGCTCAATAATCAGCAGGG
CGTGATACAGGCAGACGGAACCGTGGGGATTGACACCCAGGGCCAGCAGATTGATAACCGCTCCGGGTTATTGAGTGCGG
GCAAATCACTCACCTTGCTAAGCGGCGAGTTGGTCAATCAGGCCGGTCAGTTGCGCAGCGGCGGCGATCTGCAGCTCCAC
AGCCACGGACATAAGCTGGACAACGCCCAGGGCGGCATCATCAGCTCGTTCGGAAACGCACGGCTGGACGTTAGCGAACT
GGACAACCGTGGCGGGCAGTTACAAACCGCAGGCAATGCGCTACTGAATGCCTTTCAGGGTTTGGTCAATAACACGGCCG
GCCTGATCCGCAGCGGTGCCACCACCACTATTAACGCCGCTCAAATCGTCAACCGTGACAGCAACACGCAAAATACCGGG
CTGGAAGGCCAGAGTGTTCAGCTGAACAGTGATTCACTCGACAATACGCAAGGTACGCTGCGCGCTAACGATTTGCTGGC
TATCACCACCATGCAGTCGCTCAATAATTCGCAGGGCTTAGTCTCTTCTGCTGATGCTCTTTCTGTCAGCGGCGGTGATA
CGCTGAAGCTAAGTAACAGCGGAGGCACCCTGATCTCCGGTAAGGATCTGCAACTGCGCGCCGCTTCTCTGGCCGGTGAC
GGGCGCGTGTTATCGCAGGGCGCGATGACGCTCAATCTGCAACAGGGATTCGTCAATCAGGGCGAGGTGATTGCCAACGG
CGATCTGAACTTTAACGTCGGCAAGGAGCTTGAAAACGGCGCGCTGATTAAGTCCGGCGCTACGCTGAACCTGCATTCAG
CCAGCCTGAACAATACGGCTACGGGGGAAATCAGCGCCGGGCAAAACCATATTTTCACCGATGGCGAACTGACCAACCGT
GGCTTGATCGACGGCAGCCTGACGCATATTCAGTCGGGTACGCTAACCAACACGGGCAGCGGGCGCATCTATGGTGACCA
TATCGCCCTGCAAAGCGGCACGCTGAACAATCTGGCCGAAGGCGGCACGGCGGCGACCATCGCCGCCCGCGAGCGGCTGG
ATATCGGCGCACAAAACATCAATAATAGCGATCATGCACTGATTTTCAGCGGTGGGGACGCGGCGACAGGCGGGCAACTC
AACGACAGCTGGCAGGCATCCGGCCAGGCTTCCGTATTCAACAACCACAGCGCCACGCTGGAATCTGTCGGCAATATGAC
GCTCAACATCGGGCAAATCAACAATTTCAATGACCATCTGGTCACCCAGGAGGTGGTTAGCGAACCCTCGCAGCATCATG
AAGCGGTGTTACAGGGCGCAACAACCCGCCACGACTGGGATAAAGTGGATACCTCATACTCGAATAAGTACGGCGTACAC
GATGCCATTATGCCTGATGGCAGCGTCAACAATGAATTTTACGAGTACCAATACCAGCGCACCGTTACCGAAACCCAAAT
CACAGAGAGCGATCCCGGGCAGATCATCGCCGGGGGGAACCTGACGATCGACAGCGATCGGATTAATAACTACGACAGCC
GTATGATGGCGGGCGGTACGCTGGGCGTGAATCCTGGCGCGGTGCTGAATAACGTGGCCAGCGAAGGCACTAAAATCACG
GTCGATATCGGTAGGCAAACGCACTGGTACGCCAAAAAGTCCGGCGGTGGACTACTGGGCGGCACCAAGACCTCACAGGG
CAAGGATACCAACCGTTACCGGCCAGCCCCCATGATGCAAACCTTTGCTTTGGCCACGATGGTTTATCAGGGGCATACCC
AGGTAAACGGCAACGGCACCACGATTGGCGGGCGCGCTACCTCGGGGCTGAACCAGCAGGCTGACAATGCCGCTGTGGTG
ACACCACCGGCCGACCATATTGTCGAAATTGCCTTGCCGGACCAGGCAGCAAACAGCGTTATCCGCGTCACCAGCCCGAA
CACTCGCCTGCCCGACAACAGCCTGTTCCAGCTGCATGGCGAGGTCACCAGCCACTATCTGATCGAAACCGATCCCCGCT
TCACCCATCAAAAGCAGTGGCTGAGCAGCGACTATATGCAGAGTGCGCTTAGCCAGGATCCGGACCGTCTGGACAAGCGT
CTGGGCGATGGTTACTACGAACAGCGATTGATCCGCGAACAAATTGTCAATCTGACCGGGCAACGTTATCTGGCGGGCTA
CAATAATGACGGCGAGCAGTTCAAAGCGCTGATGGATGCAGGCGTGACTTTCGCTAAGGATTACCACCTGACGCTGGGCG
TGGCCTTGACGCCAGCGCAGATGGCGCTGCTAACCAGCGATATGGTTTGGCTGGTCCGGCAGGATGTTACCCTGCCGGAC
GGTTCGCAGCAAAGCGTGCTGGTGCCACAGGTTTATGCGCGAGTGAAAAAAGGCGACCTCGACGGTAGCGGAGCCTTGCT
GTCCGGTCACAACGTGGTGCTCAACACCGGCCGCGACCTGACTAACGGGGGCAAAATAGCCGGCCGGGAAGTGACACAGA
TCAATGCTGACACCCTCAACAACAGCGGCTTTATTGGTGCTAACCGCGTCAATCTCAACGCCCTGACCGATATCAACAAC
ATCGGCGGCACCCTGCAAGGCGGTGATGCCCTGGTGGCGATAGCCGGTCGCGATATCAACAGCAGCAGCACGCTGGCGGG
CGACGACAGCAATCGCTATCTTAACCGGCCGGCGGGCATCTATGTGCAAAATGACAATGGCGCGCTGGGCCTGCTGGCTG
CCAACAATATCAATCTGACCGCATCGCAGTTGGACAATAGCGGCAATAACAGTCAGACAGAAATTATCGCCGGCCGTGAC
CTCAACCTTAATACCCTGACCACCACCCACAGCGAAAAAAGCGACTGGGGCAGCGATAACTACCGTCATCTGACGCAAAC
CGGGGATATCGGCACGCAGCTGAACGGCGGCGGTAGCGTGGCTATGAGCGCGGGCCACGATCTGAATGCTCAGGCCGCCG
CCATTACGGCAAAAGACAGCGTGGCGATGCAGGCCGGTCATGATATCAACCTGACCGCCGGTGATTCTGCCTATCATCTG
ACCGAGCACAGTAAGCAAACCGTCAAGGGATTATTGTCCGGCAAATCGGCAGAAACGCACGACGAAGTGCAAAGCCAGAG
CGCGCTCGGCAGTAGCGTCAGCGGCAACAGCGTGACGATGCAGGCAGGACACGATCTGCAGGTGAGCGGCAGCAGCGTGG
CGTCCACGCAGGACGTCAGCCTGCTGGCCGGCAATGACCTCAATATCACCACCACCGGCCAGTCGCGCCAGGAAACTCAC
CTGCGCGAGGAGCAAAAAACCGGACTGACCGGCACCGGCGGTATCGGCTTCAGCTACGGTAAAAATGCGCTTAAAACCAC
CGATGAGGGGCAATCGCAAAGCAGCGCGGGGAGCATCGTTGGCAGCAGCCGGGGCAACGTCAGCCTCACTGCTGGCAATG
CGCTGACGGTGAAAGGCTCGGAGGTGCTGGGCGGTCAGGATCTGAACCTGAGCGGCAAACAGGTCAGTATTCTGGCGGCG
GACAACCAGAGCGTGCAAACCCATACCGTGGAGCAAAAACACAGCGGGCTGACGCTGGCGCTGTCAGGTGCGGTGGGCAG
TGCGGTTAACGGCGCGGTCACCAGCGCCAGTGAGGCCAGTAACGCCAGCAGCGGACGGCTGGCGGCGCTTGACGGCATCA
AAACGGCGCTCGGCGGCGTGCAGGCGTATCAGGGCTATCAGCTCGGTACGGCTCAGGGAGAGGATGCGAAAAACCTGTTT
GGCGTTAACCTGTCTTACGGCAGCCAGTCGTCAAAATCTCAGCAGACGCAGACCTCGAACCAGAGCCAGGGCAGCACCCT
CACCGCCGGCAATAACCTCAATATTCGCGCCACCGACACGGATATCACCGTTCAGGGCAGCCAGCTGCAGGCCGGCAGGG
ATCTCGGCCTGGCTGCCGCCCGTGACGTCAACCTGCTGTCGGCGCAGAACACCTCGCTTCTTGAGGGGAAAAACGAAAGC
CACGGCGCTTCCGTCGGCGTGGGCATCAATTTCGGCGGGGATAAAAACGGCCTGACCTTCAACGCCAGCGGCAATAAAGG
CACAGGTTCGGAGAACGGCAACGGCGTCACGCATACTGAAACCACGCTCAACGCCGGTCATCATCTGACCATCAGCAGCG
GGCGTGACACCACGCTGACCGGCGCACAGGTCAGCGGCGACAGCGTGACGCTGGACACGGCCCGCAACCTGACATTAACC
AGCGAGCAGGACAGCGATAACTACGACTCGAAGCAGCAGAATGCCAGTGCGGGCGGCAGCGCTGGCGCGGGCGGCCCCGG
CGGTTCGCTTAACCTCAGCCGGGATAAGATGCACAGCACATGGGATTCGGTGCAGGAGCAGACCGGCATCTTGGCCGGCA
AAGGCGGCTTTGACATCACCACCGGCGGGCATACGCAGCTCAACGGCGCGGTTATCGGCAGCACCGCCACGGCGGATAAA
AACCGCCTTGATACCGGCACGCTGGGCTTTGCTGATATGGATAACAGGGCCGACTATAAGGTCGAACACCTGAGCACCGG
CTTCAGCACCGGCGGCAGTACGGGTGGTCAGTTCCTGAGTAATCTGGGCAGCACCCTGCTGGTCGGTGTTAACGGTAACG
GCCACGACAGCTCCGCCACCCAAGCGGCGGTATCTGACGGCAGCATTATTATCCGTGATAAGAATGCGCAGCAGCAGGAT
GTCGCTGACCTGAGCCGCGACGTGGAGCATGCCAGCCAGACCCTTTCCCCGATATTCAACAAGGAGAAGGAGCAGCAGCG
GCTGCAGGAAGCGCAGAAGATCGGCGAAATAGGGGCGCAGGCGCTGGATATTGCGGCGACCCGGGGCAAAATCGAAGCCA
CGCATGCGGCAAATGATAAAATTGCCGGGGCCACGCAGCACGACCGCGATGAAGCCCTGTCCGATCTGAAAAAGCAGGAT
CCGTCGAAGCAGTACGACAGCGCGGATATAGAAAAGCAGGTCTACCAGAACTTCTATAATCAGGCGCTGACGGAAAGCCA
GTTCGGCACGGGGGGAAAAGTGCAACAGGCCATCCAGGCCGCCACGGCGGCGGTACAGGGCCTGGCGGGCGGCAATATCG
CGCAGGCGATGGCGGGCGGAGCGGCACCGTATCTGGCGGAGGTCATTCATAAAATGACCACCGACCCGGCAACCGGCAGT
GTGGACGTCCAGTCTAATCTGATGGCGCACGCGGTACTGGGTGCGGTGGTGGCACAGACGAGCGGTCACTCCGCTCTGGC
CGGAGCCGCAGGCGCGACGACCGGCGAATTTATCGCGCAGCAGCTGTACCCCGGAGTAGACAGGAACCAGCTTAACGAAG
AGCAAAAGCAGACAGTTAGCGCACTGGGTACGCTGGCGGCGGGTCTTGCGGGTGGCTTGACGGGAGGCAGCGCCGCCGAT
ACAGTGGCGGGCGCGCAGGCCGGGAAGAATTCGGTGGAGAATAATGCGCTGAGTCTGCCGTCCGGCTTGCAGAGTTACGT
TCAGGCGGTGGGATCGTGGGATCAGTATGCCGAGACAAACGGACTGACGCCGGAGCAGAAACAGGCCGGGCTGAATAAGC
TGGCGCAGGGTGATATGCCGGAAGGGGCAAATATTCCGAAGGTGATTGTTGAGGCATATAAAGATGGTGTACTGGCAGCG
GGAGCCGCTTATTTAGGGCCGGCAGCTTCGGCAGGTAAAGTCGTTGGTGGTACGCTTATTGGTGCTATCGCCAATGGTTC
TTACCAGTGGTTTGATATGAGTCAGCCTGGAAATGAAAATAAATCGTATGACTATCTGGGAACAGGGTCAGCAGCGGTGA
CGGGTGGCTTAGCACCGGGGCGTGGTATTTGGCAAAACGTGGGCATAGCTACAGGTGGAGCGATATTTACGGACGGTCCG
GATGCGGGGGCTGTAGGTGGCGCTGCCGCCGGGGCGTGGGCTGGCGGAATGTTTGGAGAATATGCTCCGGGGATCGTTAA
TTCAGTCATAGGGAAAGAACTGCCAGGTATTGTCTACGATGTTATTGGCTCGGGGGGATCAGAAGTCGTTGGTGGTTTCG
TCAAAGATATATCAAACCCACAAGCGGCTGATAGTAATAAGGGAGGGAAATAA

Protein sequence :
MNKLCYRIIFNRARGLLMVVADIARSRTGSSRTRRDKIPMRCCRLSPLSCAMLAAFAFVTSPGEAQAGIVADGQAPNNQQ
PNIVSSANGTPQINIQTPSEAGVSRNTYSQFDVDNKGAILNNSRLHVATQLGGMVAGNQWLAGGEARVILNEVNSRDPSR
LHGYIEVAGRQAQVVIANPSGISCNGCGFINANRATLTTGQAQMNNGQLTGYAVERGEIVVQGAGMDSSRQDYTDVIARS
VKINAGIVAKALNVTAGRNRVDAAHQQIDTLAADGSARLGGMYANKIRLIGTEQGVGVANAGNIGAQAGSLVVTADGRIE
NSGNLHSSAGMNVASNTSINNSGALQAGDSASLTSTGDLHNGGSIVARHHLQTQSASLNSEPGSVLAAGVNQDGKLAGSG
KLTLTTSGQLRAQGQNLAAGDFTARGQGVDVRHGSSSANNITLEAGQSDLLTGSGAQIAAQRQLTASSGKMLNNDGGSLS
ADKLSINAHDLSNQQGEITQLGSDSLQLSHQGNLNNRGGSLASNGKDLTIQSAIIHNQDGKIVHAADGILSLSSDRLNGA
QGQILSNGHLSMTTGDTLLDGGVTLAQLITLTANSLSNRDGKILQTGSGAMSLAVLNGMDNQNGILAAGGDLHLQAASLN
NQQGQISAQDNGSLNVNLLNPLNNQQGVIQADGTVGIDTQGQQIDNRSGLLSAGKSLTLLSGELVNQAGQLRSGGDLQLH
SHGHKLDNAQGGIISSFGNARLDVSELDNRGGQLQTAGNALLNAFQGLVNNTAGLIRSGATTTINAAQIVNRDSNTQNTG
LEGQSVQLNSDSLDNTQGTLRANDLLAITTMQSLNNSQGLVSSADALSVSGGDTLKLSNSGGTLISGKDLQLRAASLAGD
GRVLSQGAMTLNLQQGFVNQGEVIANGDLNFNVGKELENGALIKSGATLNLHSASLNNTATGEISAGQNHIFTDGELTNR
GLIDGSLTHIQSGTLTNTGSGRIYGDHIALQSGTLNNLAEGGTAATIAARERLDIGAQNINNSDHALIFSGGDAATGGQL
NDSWQASGQASVFNNHSATLESVGNMTLNIGQINNFNDHLVTQEVVSEPSQHHEAVLQGATTRHDWDKVDTSYSNKYGVH
DAIMPDGSVNNEFYEYQYQRTVTETQITESDPGQIIAGGNLTIDSDRINNYDSRMMAGGTLGVNPGAVLNNVASEGTKIT
VDIGRQTHWYAKKSGGGLLGGTKTSQGKDTNRYRPAPMMQTFALATMVYQGHTQVNGNGTTIGGRATSGLNQQADNAAVV
TPPADHIVEIALPDQAANSVIRVTSPNTRLPDNSLFQLHGEVTSHYLIETDPRFTHQKQWLSSDYMQSALSQDPDRLDKR
LGDGYYEQRLIREQIVNLTGQRYLAGYNNDGEQFKALMDAGVTFAKDYHLTLGVALTPAQMALLTSDMVWLVRQDVTLPD
GSQQSVLVPQVYARVKKGDLDGSGALLSGHNVVLNTGRDLTNGGKIAGREVTQINADTLNNSGFIGANRVNLNALTDINN
IGGTLQGGDALVAIAGRDINSSSTLAGDDSNRYLNRPAGIYVQNDNGALGLLAANNINLTASQLDNSGNNSQTEIIAGRD
LNLNTLTTTHSEKSDWGSDNYRHLTQTGDIGTQLNGGGSVAMSAGHDLNAQAAAITAKDSVAMQAGHDINLTAGDSAYHL
TEHSKQTVKGLLSGKSAETHDEVQSQSALGSSVSGNSVTMQAGHDLQVSGSSVASTQDVSLLAGNDLNITTTGQSRQETH
LREEQKTGLTGTGGIGFSYGKNALKTTDEGQSQSSAGSIVGSSRGNVSLTAGNALTVKGSEVLGGQDLNLSGKQVSILAA
DNQSVQTHTVEQKHSGLTLALSGAVGSAVNGAVTSASEASNASSGRLAALDGIKTALGGVQAYQGYQLGTAQGEDAKNLF
GVNLSYGSQSSKSQQTQTSNQSQGSTLTAGNNLNIRATDTDITVQGSQLQAGRDLGLAAARDVNLLSAQNTSLLEGKNES
HGASVGVGINFGGDKNGLTFNASGNKGTGSENGNGVTHTETTLNAGHHLTISSGRDTTLTGAQVSGDSVTLDTARNLTLT
SEQDSDNYDSKQQNASAGGSAGAGGPGGSLNLSRDKMHSTWDSVQEQTGILAGKGGFDITTGGHTQLNGAVIGSTATADK
NRLDTGTLGFADMDNRADYKVEHLSTGFSTGGSTGGQFLSNLGSTLLVGVNGNGHDSSATQAAVSDGSIIIRDKNAQQQD
VADLSRDVEHASQTLSPIFNKEKEQQRLQEAQKIGEIGAQALDIAATRGKIEATHAANDKIAGATQHDRDEALSDLKKQD
PSKQYDSADIEKQVYQNFYNQALTESQFGTGGKVQQAIQAATAAVQGLAGGNIAQAMAGGAAPYLAEVIHKMTTDPATGS
VDVQSNLMAHAVLGAVVAQTSGHSALAGAAGATTGEFIAQQLYPGVDRNQLNEEQKQTVSALGTLAAGLAGGLTGGSAAD
TVAGAQAGKNSVENNALSLPSGLQSYVQAVGSWDQYAETNGLTPEQKQAGLNKLAQGDMPEGANIPKVIVEAYKDGVLAA
GAAYLGPAASAGKVVGGTLIGAIANGSYQWFDMSQPGNENKSYDYLGTGSAAVTGGLAPGRGIWQNVGIATGGAIFTDGP
DAGAVGGAAAGAWAGGMFGEYAPGIVNSVIGKELPGIVYDVIGSGGSEVVGGFVKDISNPQAADSNKGGK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
S4 AAQ19127.1 putative adhesin/hemagglutinin/hemolysin Not tested PAI I CL3 Protein 0.0 48