Gene Information

Name : EcE24377A_1757 (EcE24377A_1757)
Accession : YP_001462840.1
Strain : Escherichia coli E24377A
Genome accession: NC_009801
Putative virulence/resistance : Unknown
Product : IS66 family transposase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 1756610 - 1758181 bp
Length : 1572 bp
Strand : -
Note : identified by match to protein family HMM PF03050

DNA sequence :
ATGGACACCTCACTTGCTCATGAGAACGCCCGCCTGCGGGCACTGTTGCAGACGCAACAGGACACCATACGCCAGATGGC
TGAATACAACCGCCTGCTCTCACAGCGGGTGGCGGCTTATGCTTCCGAAATCAACCGGCTGAAGGCGCTGGTTGCGAAAC
TGCAACGTATGCAGTTCGGTAAAAGCTCAGAAAAACTTCGTGCAAAAACCGAACGGCAGATACAGGAAGCACAGGAGCGA
ATCAGCGCACTTCAGGAAGAAATGGCGGAAACGCTGGGTGAGCAATATGACCCGGTACTGCCATCCGCCCTGCGCCAGTC
TTCAGCCCGTAAACCGTTACCGGCCTCACTTCCCCGTGAAACCCGGGTTATCCGGCCGGAAGAGGAATGCTGTCCTGCCT
GTGGTGGTGAACTCAGTTCTCTGGGATGTGATGTGTCAGAGCAACTGGAGCTTATCAGCAGCGCCTTTAAGGTTATCGAA
ACACAACGTCCGAAACAGGCCTGTTGCCGGTGCGACCATATCGTGCAGGCACCAGTACCTTCAAAACCCATTGCACGCAG
TTATGCCGGAGCGGGGCTTCTGGCCCATGTTGTCACCGGGAAATATGCAGACCATCTGCCGTTATACCGCCAGTCAGAAA
TATACCGTCGTCAGGGAGTGGAGCTGAGCCGTGCCACACTGGGGCGCTGGACAGGTGCTGTTGCTGAACTGCTGGAGCCG
CTGTATGACGTCCTGCGCCAGTATGTGCTGATGCCCGGTAAAGTCCATGCTGATGATATCCCCGTCCCGGTCCAGGAGCC
GGGCAGCGGTAAAACCCGGACAGCCCGGCTGTGGGTCTACGTCCGTGATGACCGTAACGCCGGTTCACAGATGCCCCCGG
CGGTCTGGTTCGCGTACAGTCCGGACCGGAAAGGTATCCATCCACAAAATCACCTGGCCGGTTACAGCGGTGTGCTTCAG
GCCGATGCTTACGGTGGTTACCGGGCGTTATACGAATCCGGCAGAATAACGGAAGCCGCGTGTATGGCTCATGTCCGGAG
AAAAATCCACGATGTGCATGCAAGAGCGCCCACCTACATCACCACGGAAGCCCTGCAGCGTATCGGTGAACTGTATGCCA
TCGAGGCAGAGGTCCGGGGCTGTTCAGCAGAACAGCGTCTGGCGGCAAGAAAAGCCAGAGCCGCGCCACTGATGCAGTCA
CTGTATGACTGGATACAGCAACAGATGAAAACACTGTCGCGTCACTCAGATACGGCAAAAGCGTTCGCATACCTGCTGAA
ACAGTGGGATGCACTGAACGTGTACTGCAGTAATGGCTGGGTGGAAATCGACAACAACATCGCAGAGAACGCCTTACGGG
GAGTGGCCGTAGGCCGGAAAAACTGGATGTTCGCGGGTTCCGACAGCGGTGGTGAACATGCGGCGGTGTTGTACTCGCTG
ATCGGCACATGCCGTCTGAACAATGTGGAGTCAGAAAAGTGGCTGCGTTACGTCATTGAACATATCCAGGACTGGCCGGC
AAACCGGGTACGCGATCTGTTGCCCTGGAAAGTTGATCTGAGCTCTCAGTAA

Protein sequence :
MDTSLAHENARLRALLQTQQDTIRQMAEYNRLLSQRVAAYASEINRLKALVAKLQRMQFGKSSEKLRAKTERQIQEAQER
ISALQEEMAETLGEQYDPVLPSALRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLELISSAFKVIE
TQRPKQACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVTGKYADHLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEP
LYDVLRQYVLMPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSQMPPAVWFAYSPDRKGIHPQNHLAGYSGVLQ
ADAYGGYRALYESGRITEAACMAHVRRKIHDVHARAPTYITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQS
LYDWIQQQMKTLSRHSDTAKAFAYLLKQWDALNVYCSNGWVEIDNNIAENALRGVAVGRKNWMFAGSDSGGEHAAVLYSL
IGTCRLNNVESEKWLRYVIEHIQDWPANRVRDLLPWKVDLSSQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 0.0 99
c3563 NP_755438.1 hypothetical protein Not tested PAI I CFT073 Protein 8e-115 67
ECO103_3554 YP_003223421.1 hypothetical protein Not tested LEE Protein 2e-138 64
Z4317 NP_289543.1 hypothetical protein Not tested OI-122 Protein 8e-139 64
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 8e-127 61
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 2e-131 60
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 2e-131 60
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 1e-131 60
unnamed AAC31494.1 L0015 Not tested LEE Protein 1e-131 60
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 1e-131 60
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 1e-131 60
tnp AEA34686.1 transposase Not tested Not named Protein 1e-132 60
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 7e-131 59
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 7e-131 59
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 2e-131 59
unnamed AAL57570.1 unknown Not tested LEE Protein 8e-132 59
unnamed AAL08460.1 unknown Not tested SRL Protein 3e-92 58
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 1e-110 54
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 2e-110 54
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 2e-110 54
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 2e-119 53
tnp AEA34687.1 IS66 family transposase Not tested Not named Protein 1e-72 51

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcE24377A_1757 YP_001462840.1 IS66 family transposase VFG1700 Protein 4e-115 67
EcE24377A_1757 YP_001462840.1 IS66 family transposase VFG0793 Protein 6e-132 60
EcE24377A_1757 YP_001462840.1 IS66 family transposase VFG1051 Protein 2e-92 58
EcE24377A_1757 YP_001462840.1 IS66 family transposase VFG1736 Protein 1e-90 54