Gene Information

Name : Caul_5388 (Caul_5388)
Accession : YP_001672164.1
Strain :
Genome accession: NC_010333
Putative virulence/resistance : Unknown
Product : transposase IS66
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 98285 - 99973 bp
Length : 1689 bp
Strand : +
Note : PFAM: transposase IS66; KEGG: mag:amb3325 transposase and inactivated derivative

DNA sequence :
ATGGACGCTGACCTCGCAGCCCTGCCGGACGATATCGAAGCGCTGAAGGCGGCGCTTCTGGTCGCCCGGGCCGAGGTCGC
GCAAGCGCAAGACGTGGCCGCGAGAGCTCAGGCCGAAGCCTCCAGAGCCCAGGCCGAAGCCGCTGAAGCCAAGGCGCGCG
TGTCTGACGACCAAGCGCTGATCGCCCACCTGAAGCTCCAGATCCAGAAGCTCAATCGTGAGCGCTTCGGCCCTAGCTCG
GAACGCACGGCCCGTCTGCTTGATCAGCTGGAACTGCAGTTGGAGGAGCTGGAGGCTTCGGCGACGGAAGACGAGCTGGC
CGCCGAGATGGCGGCGGCTCGGACCACGACGGTGGCCGCCTTCAGCCGCAAGCGGCCTTCGCGCCAGCCCTTCCCGGAAC
ACCTGCCGCGTGAGCGGGTGATCGTGCCAGGTCCGACCGCCTGCGCCTGCTGTGGCGGGCTGCGCCTCTCGAAGCTGGGC
GAAGACGTTACCGAAACGCTGGAGGTCGTGCCCCGGTCCTGGAAGGTCATCGCGCACGTCCGCGAGAAGTTTAGCTGCCG
CGACTGTGAGGCCATCGGCCAGGCGCCGGCTCCGTTCCATGTGATCGCCAGGGGCTGGGCGGGTCCCAGCCTGCTGGCCA
TGATCCTGTTCGAGAAGTTTGGTCAGCATCAGCCGCTCAATCGCCAGGCCGACCGCTATGCTCGCGAGGGCGTGCCGCTC
AGTCTGTCGACCTTGGCCGATCAGGTCGGGGCCTGCACGGCGGTGCTGGCGCCGCTGTTCCAGCGGCTGGAGGCTCACGT
GCTTGCCGCCGAACGATTGCACGGCGACGACACCACGGTTCCGGTATTGGCCAAGGGCAAGACCGACACCGCCAGGCTCT
GGGTCTATGTGCGCGACGACAAGCCGTTCGCGGGATCGGCGCCGCCGGGCGCGGTCTTCTACTACTCGCGTGATCGGGGT
GGCGAGCATCCGCAAGCGCACTTGTCAGGTTATGCCGGCCTGTTCCAGGCCGACGCCTATGGCGGTTACGGCAAGCTCTA
TGAGCCAGGGCGAAACCCAGGTCCCATTCTTGAAGCAGCCTGCTGGGCACACGCGCGTCGGCCGTTCTTCGTGCTGGCCG
ACCTGGAGCAGAATGCGCGCCGCAAGGCTCGCGGCGCGGCGCCGGCGGTGATCTCGCCGATCGCCCTGGAGATGGTCCAG
CGGATCGACGCGCTGTTCGAGATCGAGCGGGGGATCAGCGGCCAGGACGCAGATAGGCGCCTAGCGGTGCGACAGGCGCT
CAGCGCCCCGCTGGTCGCCGAGATGGAGATCTGGATGCGCGAGCAGCGCGCCAAGCTCTCACGCGGTCATGACTTGGCCC
GGGCCTTCGACTACATGCTCAAGCGCTGGGCCGCGTTCACGCGCTTCCTCGACGACGGCCGCGTCTGTCTGAGCAACAAT
GCCGCCGAGCGGGCGCTGCGCGGCGTGGCCATGGGGCGTAAGTCCTGGCTGTTCTGTGGTTCTGATCGCGGCGGTCAACG
CGCGGCGGTGATGTACAGCCTGATCGTCACCGCCAAGCTGAACGACATCGACCCTCAAGCCTGGCTGGCCGACGTCCTGG
CCCGCATCGCCGAGCATCCCAGCCAGCAGCTCGATGAACTACTGCCCTGGAACTGGCAGCCCCTCGCTACCGCTGACCGC
GCCGCTTAG

Protein sequence :
MDADLAALPDDIEALKAALLVARAEVAQAQDVAARAQAEASRAQAEAAEAKARVSDDQALIAHLKLQIQKLNRERFGPSS
ERTARLLDQLELQLEELEASATEDELAAEMAAARTTTVAAFSRKRPSRQPFPEHLPRERVIVPGPTACACCGGLRLSKLG
EDVTETLEVVPRSWKVIAHVREKFSCRDCEAIGQAPAPFHVIARGWAGPSLLAMILFEKFGQHQPLNRQADRYAREGVPL
SLSTLADQVGACTAVLAPLFQRLEAHVLAAERLHGDDTTVPVLAKGKTDTARLWVYVRDDKPFAGSAPPGAVFYYSRDRG
GEHPQAHLSGYAGLFQADAYGGYGKLYEPGRNPGPILEAACWAHARRPFFVLADLEQNARRKARGAAPAVISPIALEMVQ
RIDALFEIERGISGQDADRRLAVRQALSAPLVAEMEIWMREQRAKLSRGHDLARAFDYMLKRWAAFTRFLDDGRVCLSNN
AAERALRGVAMGRKSWLFCGSDRGGQRAAVMYSLIVTAKLNDIDPQAWLADVLARIAEHPSQQLDELLPWNWQPLATADR
AA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 7e-87 47
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 1e-83 46
unnamed AAC31494.1 L0015 Not tested LEE Protein 4e-90 45
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 5e-90 45
tnp AEA34686.1 transposase Not tested Not named Protein 2e-90 45
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 5e-90 45
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 8e-90 45
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 3e-89 45
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 3e-89 45
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 4e-90 45
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 1e-89 45
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 5e-87 45
unnamed AAL57570.1 unknown Not tested LEE Protein 2e-90 45
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 8e-90 45
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 2e-78 44
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 2e-78 42
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 2e-78 42
unnamed AAL08460.1 unknown Not tested SRL Protein 4e-61 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Caul_5388 YP_001672164.1 transposase IS66 VFG0793 Protein 2e-90 45
Caul_5388 YP_001672164.1 transposase IS66 VFG1051 Protein 2e-61 41