Gene Information

Name : EC55989_3274 (EC55989_3274)
Accession : YP_002404245.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Unknown
Product : transposase ORF1, IS66 family
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 3357615 - 3359183 bp
Length : 1569 bp
Strand : -
Note : Evidence 4 : Homologs of previously reported genes of unknown function

DNA sequence :
ATGAGTCAGAAATACCTCATTCGCATCGCTGAGCTGGAAAGGCTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGA
CCAGCAACTGAGTCTGGTTGAAGAGACGGAGGCCTTCCTGCGCTCTGCACTGGCACGTGCCGAAGAAAAGATCGAAGAAG
ATGAACGGGAAATAGAGCATCTGCGGGCTCAGATAGAAAAACTGCGCCGGATGCTGTTCGGAACCCGTTCTGAAAAACTG
CGTCGTGAGGTTGAACAGGCTGAAGCCCTGCTGAAACAACGCGAGCAGGAAAGCGATCGTTACAGTGGGCGTGAGGATGA
CCCGCTGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACATCTCCCCCGTGAAATATACCGCC
TGGAGCCTGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAA
CTGGTGAGCAGCGCCCTGAAAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGCATCGTTGAAGC
ACCGGCACCATCCCGTCCGATAGAGCGTGGTATCGCGGGCCCGGGGTTACTTGCCCGCGTGTTAACGGGAAAATACTGCG
AACACCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGCCAGGGTGTCGAACTGAGCCGTGCCTTACTCTCCAACTGG
GTTGATGCGTGCTGCCAGTTAATGACGCCGCTGAATGATGCCCTGTACAGTTATGTGATGAACACCCGCAAGGTTCACAC
TGATGACACACCAGTAAAAGTACTGGCACCGGGCAGGAAGAAGGCGAAAACAGGATATATCTGGACGTATGTCCGGGATG
ACCGAAATGCCGGTTCGCCAGAACCTCCGGCGGTCTGGTTCGCCTACTCACCGGACCATCAGGGTAAACATCCGGAGCAA
CACCTTCGTCCCTTCCGGGGTATCCTGCAGGCAGATGCGTTCGCAGGTTACGATCGGCTGTTCAGTGCCGAACGTGAAGG
CGGCGCGTTGACGGAAGCAGGATGCTGGGCTCATGCGCGGCGCAAAATCCACGATGTATATATCAGTACCAAAAGCGCGA
CGGCGGAAGAAGCACTGAAACTAATCGGCGAACTGTACGCCATTGAGCACGAAATACGCGGGTTGCCGGTGTCTGAACGC
CTGGCGGTCAGGCAAATGCAGAGTAAACCGCTACTGACTTCCCTGTATAAGCTGATGCAGGAGAAAGAACACACGTTATC
GAAAAAATGCCGTCTGAGAGATGCGTTCCGGTATATCAGGAAGCACTGGGTTGCGTTGTGCAACTTCTGTGATGACGGTC
TGGCGGAGGCGGACAATAACACAGCGGAAAGAGCGCTTCGTGCAGTCTGTCTCGGAAAGAAAAATTCTTATGACCTTTGT
CAAATATTAAGCCCAAAAAGACCTTACGCAGCTCCGGGAGCTTCGTTATATCGCGGATTAAAGAACCTGAAACAGTCAGA
CTGTATCTTGGATTTTACAAACTCTTATTCGGGGATTTTAGCCCCATGA

Protein sequence :
MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKL
RREVEQAEALLKQREQESDRYSGREDDPLVPRQLRQSRHRRPLPAHLPREIYRLEPEESCCPECGGELDYLGEVSAEQLE
LVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNW
VDACCQLMTPLNDALYSYVMNTRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWFAYSPDHQGKHPEQ
HLRPFRGILQADAFAGYDRLFSAEREGGALTEAGCWAHARRKIHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSER
LAVRQMQSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFCDDGLAEADNNTAERALRAVCLGKKNSYDLC
QILSPKRPYAAPGASLYRGLKNLKQSDCILDFTNSYSGILAP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 2e-170 90
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 2e-170 90
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 1e-150 90
tnp AEA34687.1 IS66 family transposase Not tested Not named Protein 3e-144 87
unnamed AAC31494.1 L0015 Not tested LEE Protein 3e-110 63
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 3e-110 63
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 5e-110 63
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 5e-110 63
tnp AEA34686.1 transposase Not tested Not named Protein 7e-110 63
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 4e-110 63
unnamed AAL08460.1 unknown Not tested SRL Protein 9e-98 63
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 2e-105 62
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 3e-109 62
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 3e-109 62
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 4e-110 62
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 2e-109 62
unnamed AAL57570.1 unknown Not tested LEE Protein 6e-110 62
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 2e-84 54
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 1e-76 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EC55989_3274 YP_002404245.1 transposase ORF1, IS66 family VFG1736 Protein 0.0 99
EC55989_3274 YP_002404245.1 transposase ORF1, IS66 family VFG0793 Protein 2e-110 63
EC55989_3274 YP_002404245.1 transposase ORF1, IS66 family VFG1051 Protein 5e-98 63