Gene Information

Name : EcHS_A2123 (EcHS_A2123)
Accession : YP_001458801.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Unknown
Product : IS66 family transposase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 2107746 - 2109359 bp
Length : 1614 bp
Strand : +
Note : identified by match to protein family HMM PF03050

DNA sequence :
ATGAGTCAGAAATACCTCATTCGCATCGCAGAGCTGGAAAGGTTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGA
CCAGCAACTGAGTCTGGTTGAAGAGACGGAAGCCTTCCTGCGCTCTGCACTGACACGTGCCGAAGAAAAGATCGAAGAAG
ATGAACGGGAAATAGAACATCTGCGGGCTCAGATAGAAAAACTGCGCCGGATGCTGTTCGGTACCCGTTCTGAAAAACTG
CGTCGTGAAGTTGAACTGGCTGAGGCTCTGCTGAAACAACGTGAACAGGACAGCGATCGTTACAGTGGGCGGGAAGACGA
TCCTCAGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACACCTTCCCCGTGAAATACACCGCC
TGGAGCCAGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCTGAACAGCTGGAA
CTGGTGAGCAGTGCCCTGAAAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGTATTGTTGAAGC
ACCGGCGCCGTCCCGCCCGATAGAGCGTGGTATCGCGGGCCCCGGATTACTTGCCCGCGTGTTAACGGGAAAATACTGCG
AACATCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGCCAGGGTGTCGAACTGAGCCGGGCCTTACTCTCCAACTGG
GTTGACGCGTGCTGCCAGTTAATGACACCGGTGAATGATGCCCTGTACCGTTATGTAATGAACACCCGCAAGATTCACAC
TGATGACACACCGGTAAAGGTACTGGCACCGGGTCAGAAAAAGGCGAAAACAGGGCGTATCTGGACGTATGTCCGGGATG
ATCGCAATGTGGGTTCGTCATCTCCTCCAGCGGTCTGGTTCGCGTACTCGCCGAACCGGCAGGGGAAACACCCGGAGCAA
CACCTCCGCCCCTTCCGGGGTATCCTGCAGGCGGATGCGTTCACAGGTTACGACAGGTTGTTCAGTGCAGAACGTGAAGG
TGGTGCACTGACAGAAGTTGCGTGCTGGGCCCATGCCCGGCGAAAAATCCACGATGTATACATCAGCAGCAAAAGTGCGA
CGGCAGAAGAAGCACTGAAGCGAATCAGTGAACTGTACGCCATCGAGGATGAAATACGGGGATTACCGGAGTCAGAGCGT
CTTGCCGTCAGGCAGCAGCGAAGCAAAGTGTTACTGACGTCGCTGCATGAATGGATGGTGGAGAAGAATGGTACGCTGTC
GAAAAAATCCAGACTGGGCGAAGCGTTCAGCTATGTACTGAATCAGTGGGATGCCCTCTGTTATTACAGTGATGACGGTC
TGGCGGAGGCGGATAATAATGCTGCGGAAAGAGCGCTTCGTGCAGTCTGTCTCGGAAAGAAAAACTTTATGTTCTTTGGC
AGCGATCACGGCGGCGAGCGTGGAGCACTGTTGTACGGGCTGATCGGCACCTGCCGTCTGAACGGTATCGATCCGGAAGC
GTATCTGCGCCATATCCTGAGCGTACTGCCGGAATGGCCTTCCAACCGAGTTGATGAACTCCTGCCATGGAACGTAGTAC
TCACCAATAAATAA

Protein sequence :
MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALTRAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKL
RREVELAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGGELDYLGEVSAEQLE
LVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNW
VDACCQLMTPVNDALYRYVMNTRKIHTDDTPVKVLAPGQKKAKTGRIWTYVRDDRNVGSSSPPAVWFAYSPNRQGKHPEQ
HLRPFRGILQADAFTGYDRLFSAEREGGALTEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESER
LAVRQQRSKVLLTSLHEWMVEKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNAAERALRAVCLGKKNFMFFG
SDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 0.0 97
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 0.0 96
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 0.0 96
tnp AEA34687.1 IS66 family transposase Not tested Not named Protein 3e-140 88
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 3e-140 64
unnamed AAC31494.1 L0015 Not tested LEE Protein 3e-140 64
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 2e-139 64
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 2e-139 64
tnp AEA34686.1 transposase Not tested Not named Protein 2e-139 64
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 2e-139 64
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 2e-139 64
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 3e-140 64
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 3e-140 64
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 2e-134 63
unnamed AAL57570.1 unknown Not tested LEE Protein 2e-140 61
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 8e-140 61
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 1e-103 54
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 1e-99 54

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcHS_A2123 YP_001458801.1 IS66 family transposase VFG1736 Protein 1e-169 91
EcHS_A2123 YP_001458801.1 IS66 family transposase VFG0793 Protein 1e-140 64