Gene Information

Name : tsh (APECO1_O1CoBM73)
Accession : YP_001481228.1
Strain :
Genome accession: NC_009837
Putative virulence/resistance : Virulence
Product : Tsh
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 58775 - 62908 bp
Length : 4134 bp
Strand : +
Note : temperature sensitive hemagglutinin; similar to AAA24698; identified by match to protein family HMM PF02395

DNA sequence :
ATGAACAGAATTTATTCTCTTCGCTACAGCGCTGTGGCCCGGGGCTTTATTGCCGTATCTGAGTTTGCTAGGAAATGTGT
TCATAAGTCTGTCAGACGTCTGTGTTTCCCGGTTTTATTACTGATCCCGGTACTATTCTCTGCAGGAAGTCTTGCGGGAA
CGGTCAATAATGAACTCGGGTATCAGTTATTTCGTGATTTTGCTGAAAATAAGGGGATGTTCCGCCCGGGGGCAACGAAT
ATCGCTATTTATAATAAGCAGGGAGAATTTGTCGGTACGCTGGATAAGGCAGCTATGCCTGATTTCAGTGCTGTGGATTC
GGAAATCGGTGTGGCGACACTGATAAACCCGCAGTATATCGCCAGCGTGAAACATAACGGGGGATATACAAACGTTAGCT
TTGGTGATGGTGAAAACCGTTACAATATCGTGGACCGGAATAATGCGCCGTCACTGGATTTTCATGCCCCCCGGCTGGGT
AAACTGGTGACAGAGGTTGCCCCTACTGCGGTGACGGCGCAGGGGGCAGTGGCTGGCGCATATCTGGATAAGGAGCGCTA
TCCTGTTTTTTATCGTCTGGGGTCTGGTACTCAGTATATTAAGGACAGTAACGGACAGCTGACAAAAATGGGAGGTGCAT
ATTCCTGGCTGACCGGCGGGACTGTCGGTAGCCTGTCATCCTATCAGAATGGAGAAATGATTAGCACCAGTTCAGGTCTG
GTTTTTGATTACAAACTTAATGGTGCAATGCCCATTTATGGCGAGGCCGGTGACAGCGGTTCGCCTTTATTTGCTTTTGA
TACTGTTCAGAATAAATGGGTGCTGGTCGGTGTTCTTACTGCGGGGAATGGCGCGGGGGGCAGGGGAAATAACTGGGCTG
TTATTCCACTGGATTTTATCGGGCAGAAATTTAATGAAGACAATGATGCCCCGGTCACGTTCAGAACATCGGAAGGTGGT
GCACTGGAGTGGAGCTTTAACAGCAGTACCGGAGCTGGTGCGCTGACACAGGGAACCACCACATATGCCATGCACGGGCA
GCAGGGAAATGACCTGAATGCTGGTAAGAACCTGATATTTCAGGGGCAGAATGGTCAGATTAACCTTAAGGATTCGGTTT
CTCAGGGGGCGGGTTCCCTGACGTTCCGTGATAATTACACAGTAACAACCTCTAACGGAAGTACCTGGACCGGTGCCGGT
ATTGTTGTGGACAACGGGGTGTCCGTAAACTGGCAGGTTAATGGTGTTAAGGGCGATAACCTGCATAAAATTGGTGAAGG
TACGCTGACGGTACAGGGTACAGGTATTAATGAAGGTGGCCTGAAGGTCGGGGACGGAAAGGTTGTACTGAACCAGCAGG
CGGACAATAAAGGACAGGTGCAGGCGTTCAGCAGTGTTAATATTGCCAGTGGCCGGCCGACCGTGGTACTGACTGATGAG
CGGCAGGTAAATCCGGATACCGTCTCATGGGGATATCGTGGGGGCACACTGGATGTTAATGGTAACAGTCTGACGTTTCA
TCAGTTGAAGGCGGCAGATTATGGTGCCGTGCTGGCGAATAACGTTGATAAACGGGCCACTATCACGCTGGACTATGCCC
TGCGGGCTGACAAAGTAGCACTGAATGGCTGGTCGGAATCAGGTAAAGGAACTGCCGGAAATTTATATAAATACAATAAC
CCGTACACAAATACGACGGATTACTTCATCCTGAAGCAGAGCACCTATGGTTATTTCCCCACGGACCAGAGCAGCAACGC
CACCTGGGAGTTTGTGGGGCACAGTCAGGGGGATGCACAGAAACTGGTAGCTGACCGTTTCAATACTGCAGGGTATCTGT
TTCACGGACAACTGAAAGGCAATCTGAATGTGGACAATCGCCTGCCTGAAGGCGTTACCGGTGCTCTGGTGATGGACGGA
GCTGCGGATATCTCCGGTACATTCACCCAGGAAAACGGGCGTCTGACGCTGCAGGGGCATCCGGTTATCCATGCATACAA
TACTCAGTCTGTGGCTGACAAACTGGCTGCCAGTGGAGACCATTCGGTTCTGACTCAGCCTACGTCATTCAGTCAGGAGG
ACTGGGAGAACCGCAGTTTTACCTTTGACAGGCTGTCACTGAAGAACACTGATTTTGGTCTTGGTCGCAATGCCACACTG
AACACAACCATCCAGGCAGATAACTCCAGCGTCACGCTGGGCGACAGCCGGGTATTTATCGACAAAAACGATGGCCAGGG
AACAGCCTTTACCCTTGAAGAAGGCACATCTGTTGCAACTAAAGATGCAGATAAAAGTGTCTTCAACGGCACCGTCAACC
TGGATAATCAGTCAGTGCTGAATATCAATGATATATTCAATGGCGGAATACAGGCGAACAACAGTACCGTGAATATCTCC
TCAGACAGTGCCGTTCTGGGGAACTCAACACTGACCAGTACCGCCCTGAATCTGAACAAGGGAGCAAATGCTCTGGCCAG
TCAGAGTTTTGTTTCTGACGGTCCAGTGAATATTTCTGATGCCACCCTGAGTCTGAACAGCCGTCCTGATGAGGTATCTC
ACACACTTTTACCTGTATACGATTATGCCGGTTCATGGAACCTGAAGGGAGACGATGCCCGCCTGAACGTGGGGCCGTAC
AGTATGTTGTCAGGTAATATCAATGTTCAGGATAAAGGGACTGTCACCCTCGGAGGGGAAGGGGAACTGAGTCCTGACCT
GACTCTTCAGAATCAGATGTTGTACAGCCTGTTTAACGGGTACCGCAATATCTGGAGCGGGAGCCTGAATGCACCGGATG
CCACCGTCAGCATGACAGACACCCAGTGGTCGATGAACGGAAACTCCACGGCAGGAAATATGAAACTTAACCGGACAATA
GTCGGTTTTAACGGGGGAACATCACCGTTCACGACACTGACAACAGATAATCTGGACGCGGTTCAGTCAGCATTTGTCAT
GCGTACAGACCTTAACAAGGCAGACAAACTGGTGATAAACAAGTCGGCAACAGGTCATGACAACAGCATCTGGGTTAACT
TCCTGAAAAAACCTTCTAACAAGGACACGCTTGATATTCCACTGGTCAGCGCACCTGAAGCGACAGCTGATAATCTGTTC
AGGGCATCAACACGGGTTGTGGGATTCAGTGATGTCACCCCCATCCTTAGTGTCAGAAAAGAGGACGGGAAAAAAGAGTG
GGTCCTCGATGGTTACCAGGTTGCACGTAACGACGGCCAGGGTAAGGCTGCCGCCACATTCATGCACATCAGCTATAACA
ACTTCATCACTGAAGTTAACAACCTGAACAAACGCATGGGCGATTTGAGGGATATTAATGGCGAAGCCGGTACGTGGGTG
CGTCTGCTGAACGGTTCCGGCTCTGCTGATGGCGGTTTCACTGACCACTATACCCTGCTGCAGATGGGGGCTGACCGTAA
GCACGAACTGGGAAGTATGGACCTGTTTACCGGCGTGATGGCCACCTACACTGACACAGATGCGTCAGCAGACCTGTACA
GCGGTAAAACAAAATCATGGGGTGGTGGTTTCTATGCCAGTGGTCTGTTCCGGTCCGGCGCTTACTTTGATGTGATTGCC
AAATATATTCACAATGAAAACAAATATGACCTGAACTTTGCCGGAGCTGGTAAACAGAACTTCCGCAGCCATTCACTGTA
TGCAGGTGCAGAAGTCGGATACCGTTATCATCTGACAGATACGACGTTTGTTGAACCTCAGGCGGAACTGGTCTGGGGAA
GACTGCAGGGCCAAACATTTAACTGGAACGACAGTGGAATGGATGTCTCAATGCGTCGTAACAGCGTTAATCCTCTGGTA
GGCAGAACCGGCGTTGTTTCCGGTAAAACCTTCAGTGGTAAGGACTGGAGTCTGACAGCCCGTGCCGGCCTGCATTATGA
GTTCGATCTGACGGACAGTGCTGACGTTCATCTGAAGGATGCAGCGGGAGAACATCAGATTAATGGCAGAAAAGACAGTC
GTATGCTTTACGGTGTGGGGTTAAATGCCCGGTTTGGCGACAATACGCGTCTGGGGCTGGAAGTTGAACGCTCTGCATTT
GGTAAATACAACACAGATGATGCGATAAACGCTAATATTCGTTATTCATTCTGA

Protein sequence :
MNRIYSLRYSAVARGFIAVSEFARKCVHKSVRRLCFPVLLLIPVLFSAGSLAGTVNNELGYQLFRDFAENKGMFRPGATN
IAIYNKQGEFVGTLDKAAMPDFSAVDSEIGVATLINPQYIASVKHNGGYTNVSFGDGENRYNIVDRNNAPSLDFHAPRLG
KLVTEVAPTAVTAQGAVAGAYLDKERYPVFYRLGSGTQYIKDSNGQLTKMGGAYSWLTGGTVGSLSSYQNGEMISTSSGL
VFDYKLNGAMPIYGEAGDSGSPLFAFDTVQNKWVLVGVLTAGNGAGGRGNNWAVIPLDFIGQKFNEDNDAPVTFRTSEGG
ALEWSFNSSTGAGALTQGTTTYAMHGQQGNDLNAGKNLIFQGQNGQINLKDSVSQGAGSLTFRDNYTVTTSNGSTWTGAG
IVVDNGVSVNWQVNGVKGDNLHKIGEGTLTVQGTGINEGGLKVGDGKVVLNQQADNKGQVQAFSSVNIASGRPTVVLTDE
RQVNPDTVSWGYRGGTLDVNGNSLTFHQLKAADYGAVLANNVDKRATITLDYALRADKVALNGWSESGKGTAGNLYKYNN
PYTNTTDYFILKQSTYGYFPTDQSSNATWEFVGHSQGDAQKLVADRFNTAGYLFHGQLKGNLNVDNRLPEGVTGALVMDG
AADISGTFTQENGRLTLQGHPVIHAYNTQSVADKLAASGDHSVLTQPTSFSQEDWENRSFTFDRLSLKNTDFGLGRNATL
NTTIQADNSSVTLGDSRVFIDKNDGQGTAFTLEEGTSVATKDADKSVFNGTVNLDNQSVLNINDIFNGGIQANNSTVNIS
SDSAVLGNSTLTSTALNLNKGANALASQSFVSDGPVNISDATLSLNSRPDEVSHTLLPVYDYAGSWNLKGDDARLNVGPY
SMLSGNINVQDKGTVTLGGEGELSPDLTLQNQMLYSLFNGYRNIWSGSLNAPDATVSMTDTQWSMNGNSTAGNMKLNRTI
VGFNGGTSPFTTLTTDNLDAVQSAFVMRTDLNKADKLVINKSATGHDNSIWVNFLKKPSNKDTLDIPLVSAPEATADNLF
RASTRVVGFSDVTPILSVRKEDGKKEWVLDGYQVARNDGQGKAAATFMHISYNNFITEVNNLNKRMGDLRDINGEAGTWV
RLLNGSGSADGGFTDHYTLLQMGADRKHELGSMDLFTGVMATYTDTDASADLYSGKTKSWGGGFYASGLFRSGAYFDVIA
KYIHNENKYDLNFAGAGKQNFRSHSLYAGAEVGYRYHLTDTTFVEPQAELVWGRLQGQTFNWNDSGMDVSMRRNSVNPLV
GRTGVVSGKTFSGKDWSLTARAGLHYEFDLTDSADVHLKDAAGEHQINGRKDSRMLYGVGLNARFGDNTRLGLEVERSAF
GKYNTDDAINANIRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 79
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 79
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 78
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 50
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 50
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 50
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 49
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
tsh YP_001481228.1 Tsh VFG0904 Protein 0.0 79
tsh YP_001481228.1 Tsh VFG1689 Protein 0.0 79
tsh YP_001481228.1 Tsh VFG0635 Protein 0.0 50
tsh YP_001481228.1 Tsh VFG0861 Protein 0.0 50
tsh YP_001481228.1 Tsh VFG0903 Protein 0.0 50