Gene Information

Name : BTH_II1062 (BTH_II1062)
Accession : YP_439259.1
Strain :
Genome accession: NC_007650
Putative virulence/resistance : Unknown
Product : host specificity protein J
Function : -
COG functional category : S : Function unknown
COG ID : COG4733
EC number : -
Position : 1233884 - 1237189 bp
Length : 3306 bp
Strand : +
Note : identified by match to protein family HMM PF00041

DNA sequence :
TTGAAGAAGCTCCATGCAGAAAGAGGGCTGAAGCGGATCTACGGCGCGAAGGGCGGCGGCGGTGGTGGTGGCAGCAGCGA
ATCGCCCGACAGTCTGCATTCGATTGCGCGCGCGAAGGTGCTCGATGTGATCTCGGCGGGGCCGATCGTGGGGCTGGTGA
ATGGCCTGCAGTCGGTCTATCTCGACGGCACACCGATCCAGAACGCGGACGGTTCGCTGAATTTCCAGAACTACACCGTC
GACGTGCGGACGGGCACGCAGGATCAGGACTACATCCCGGGCTTTCCGGCCGTCGAGCGTGAGGCCGGCGTCGGCGTGCC
GCTGACGTCCGACGCGCCGTGGGTGCGCCAGATCCAGAATACGCAACTGACCGCGGTGCGTGTGCGCTTCGGCGTGCCGG
CGCTACAGCGTCAGGACACGTCAAATGGCAACATCACAGGCTATCGCGTCGACTATGCGATCGACTTGTCGGTCGACGGC
GGGTCGTATACGCAGGTGCTGGCCGGTGCGTTCGACGGCAAGACGACGTCGCTCTACGAGCGCTCGCATCGGATCGAGCT
GCCGCGCGCGAAAAACGGCTGGCTGATCCGTGTGCGCCGTATCACGCCGAACGCGCACACGGCGACGATCGCCGACGCGA
TCAACATCGAAGCGATCACCGAAATCATCGATCGGAAGCTTCGCTATCCGATGACGGCGCTCGTCGGCATGACGTTCGAC
GCACGTTCGTTCTCGAGCGTGCCGGTGCGTTCGTATCACGTGCGAGGGATGATCTTCCGTGTTCCGACAAATTACGATCC
GGAGACCCGCACGTATTCGGGCACTTGGGATGGCACGTTCAAGGCAGCATGGACGAACAATCCAGCTTGGGTCTACTACG
GCCTACTGCTCGACAAGCTCAACGGATTGGGCGACCGTGTCGATGCGTCGATGGTCGACAAGTGGGCGCTGTACGCAATC
GCGCGTCACTGTGACGAGCTGGTATCGGACGGGAAGGGCGGCAAGGAGCCGCGCTTTACGTGCAACTGCGTGATTCAGAC
AAAGGCGGATGCGTTCAAGGTCGTGCAGGATATCGCAAGTGTCTTTCGCGGGATTTCGTATTGGGGGGCCGGCTCCGTCG
TCGCGTCGGCCGATATGCCGTCCGATCCGGTCTACCTGTACACGGCCGCGAACGTCGTCGGCGGTTCATTCAAGTACGTC
GGCAGCGAGCGCAAGACGCGCTACACGGTCGCGCTCGTCAGCTACAACGATCCGACGAACCAGTACAAGCAGGCTGTCGA
AGCCGTGCAGGACGACGACGGGATCGCGCGATACGGCGTCATCAAGACGGAGGTCACGGCGTTCGGCTGCACGTCGCAGG
CGCAGGCGCACCGTCTCGGTCGGTGGCTGCTGCTGACGTCGCGGTACGAGACCGGGACGGTCTCGTTTCAGGTCGGGCTC
GACGGGACGCTTTGTGCGCCGGGACAGGTGATCGCCGTTGCCGACCCTAAGAAGGCCGGCCGCCGGATCGGCGGCCGTAT
CCGCGCCGCAGCCGGCGAGACGATCACGCTCGACAAGGCGCCGACTATCGCCGCCGGCGATCGTTTCACGGCGATCTTGC
CGTCGGGCATCGCGCAGGCGCGGGTGGTGAAGGCCGTCAACGGTGACACGGTGACGCTTGCCGCGCGCTTCGACGCTGAT
CCGGTGCCGGGCGCTGTGTGGATGGTCGAGAGCAACGAACTCGCCGCGCAGCAGTATCGCGTGGTGAGTGTGCAGGAGAG
CGACGACAACGGCCAGATCGTCTACACGATCAACGCGACACAGTATGAGCCGGGCAAGTATGCGGCGATCGACGACGGCG
CGCAGATCCAGCAACGGCCGATCACGATTGTTCCGCCTTCGGTACAGCCACCGCCGTCGAACGTACGCCTGTCGACGTAC
TCGGTAGTCGATCAGGGCATTTCGAAAACGTCGATGGTGATCGCGTGGGATGCGGCGAACCACGCGACGAGCTACGTCGC
CGAATGGCGCAAGGACAACGGCGAGTGGGTGCGGGCGCCGTCGACGGGCGGTTTGCAGGTTGAGGTGCCGGGCATCTATC
AGGGCAAATACCTCGCGCGCGTGCGCGCCGAAAACGCGCTCGGCGTGACGTCGATTCCGGCGTATGGCGTCGATACGCAG
CTGACCGGGAAAACCACTCCGCCGCCGTCGGTCGTGTCGCTGACCGCGGCGGGCATCGTGTACGGGATCGACCTGAAATG
GGCGTTCCCGGGTGACGGCTCCGCCGGCGACACGCAGCGAACGGAGATCTGGTACAGCCGCACGCCGAATCGCGACGACG
CGACCAAGTTCTCCGACTTCGCGTATCCGCAGGCGTCGACGTCGTATCAGGGGCTCGCGGTCGGGCAGGTGTTTTATTTC
TGGGCGCGCCTGGTCGACACGTCCGGCAACGTCGGGCCGTGGTTCCCGGCGAAGGGGCCGGGCGTGCAGGGTCAGCCGAG
CACGGATCAAAGCGACTATGAGAAGTATTTCGCCGGCCAGATCGGGAAGTCGGCGCTTGGCACGGAGCTGCGCGCGCCGA
TCGACCTGATCACCCCGCCGATGGCCGGCGACGCAACGATCTACGCGGGCGACGAAAGACTCAATGCTGGCGTGTGGTCA
CTGCAAGCGGCGATCGCCGAGGGCGATATGGCGGTCGCGAAGAAGGTCGAAACAGTCGCGGCCCAGCTGCACTCGGGCTC
GAATCTGCTGAACGCCGCGGTGCAGAAGGAGACGATTGCGCGTGTCGAAGCTGATCGTGCGATGGCGCAGGACATCACGA
CGGTGCAGGCGCAGGTGGACGACAACGTGGCTGCGGTGCAAACCGTTGCGAAGTCCTACGCCGACCTGAACGGACGTGTC
GCGGCTTCGTATCAGATCAAGGTACAGACGACCGCCGACGGCCACAAATACATGGCGTCGATCGGTGTGGGCATCGACAA
CGAAAACGGCGTCGTCGAATCGCAGGTGCTCGTGTCGGCGAAGCGGTTCGCCGTGATCGACGAGGACGGCTCCGGTGTGA
TCGGTGCGCCGTTCGTCGTGCAGGGCGGGCAGGTGTTCTTGCGTCAGGCGCTGATCGGTGCGGGCTGGATTACGAACGCG
ATGATCGGCAGCTACATCCAGTCCGACAACTACATCGCGGGGCGGCAGGGATGGCGGTTGGATAAGACCGGTTGGTTCGA
AATCAACGCAGCGGACGGCAGCGGAAATCGGCTTGTGATGGATGGTAGCAGTGTCCGTGTCTACGACGGTAACGGCGTGC
TGCGGGTGCGCATGGGGATGTGGTAA

Protein sequence :
MKKLHAERGLKRIYGAKGGGGGGGSSESPDSLHSIARAKVLDVISAGPIVGLVNGLQSVYLDGTPIQNADGSLNFQNYTV
DVRTGTQDQDYIPGFPAVEREAGVGVPLTSDAPWVRQIQNTQLTAVRVRFGVPALQRQDTSNGNITGYRVDYAIDLSVDG
GSYTQVLAGAFDGKTTSLYERSHRIELPRAKNGWLIRVRRITPNAHTATIADAINIEAITEIIDRKLRYPMTALVGMTFD
ARSFSSVPVRSYHVRGMIFRVPTNYDPETRTYSGTWDGTFKAAWTNNPAWVYYGLLLDKLNGLGDRVDASMVDKWALYAI
ARHCDELVSDGKGGKEPRFTCNCVIQTKADAFKVVQDIASVFRGISYWGAGSVVASADMPSDPVYLYTAANVVGGSFKYV
GSERKTRYTVALVSYNDPTNQYKQAVEAVQDDDGIARYGVIKTEVTAFGCTSQAQAHRLGRWLLLTSRYETGTVSFQVGL
DGTLCAPGQVIAVADPKKAGRRIGGRIRAAAGETITLDKAPTIAAGDRFTAILPSGIAQARVVKAVNGDTVTLAARFDAD
PVPGAVWMVESNELAAQQYRVVSVQESDDNGQIVYTINATQYEPGKYAAIDDGAQIQQRPITIVPPSVQPPPSNVRLSTY
SVVDQGISKTSMVIAWDAANHATSYVAEWRKDNGEWVRAPSTGGLQVEVPGIYQGKYLARVRAENALGVTSIPAYGVDTQ
LTGKTTPPPSVVSLTAAGIVYGIDLKWAFPGDGSAGDTQRTEIWYSRTPNRDDATKFSDFAYPQASTSYQGLAVGQVFYF
WARLVDTSGNVGPWFPAKGPGVQGQPSTDQSDYEKYFAGQIGKSALGTELRAPIDLITPPMAGDATIYAGDERLNAGVWS
LQAAIAEGDMAVAKKVETVAAQLHSGSNLLNAAVQKETIARVEADRAMAQDITTVQAQVDDNVAAVQTVAKSYADLNGRV
AASYQIKVQTTADGHKYMASIGVGIDNENGVVESQVLVSAKRFAVIDEDGSGVIGAPFVVQGGQVFLRQALIGAGWITNA
MIGSYIQSDNYIAGRQGWRLDKTGWFEINAADGSGNRLVMDGSSVRVYDGNGVLRVRMGMW

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ESA_01044 YP_001437149.1 hypothetical protein Not tested Not named Protein 0.0 45