Gene Information

Name : ECP_0226 (ECP_0226)
Accession : YP_668162.1
Strain : Escherichia coli 536
Genome accession: NC_008253
Putative virulence/resistance : Unknown
Product : IcmF-like protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3523
EC number : -
Position : 244617 - 248144 bp
Length : 3528 bp
Strand : -
Note : -

DNA sequence :
GTGTTCAAATTTCCCACATCCCGACTGTTCAGCACGTTGAAATCTGCGCTCAGGCCAGCGATGCCGCGGTTTAAAGTTTC
TGCCACCTGGCTACTGACGCTGGCATGGATTTTTCTGCTGGTGTGGATCTGGTGGCAGGGTCCAAAATGGACGCTCTATG
AGCAGCACTGGTTGGCTCCGCTGGCAAACCGCTGGCTGGCGACCGCCGTCTGGGGACTTATCGCTCTGGTCTGGCTCACC
TGGCGGGTGATGAAGCGTCTGCAAAAGCTGGAAAAACAGCAGAAACAGCAGCGGGAGGAAGAAAAAGATCCGTTGACCGT
GGAACTCCACCGCCAGCAGCAATATCTGGATCACTGGCTGCTGCGCCTGCGCCGCCATCTGGATAACCGCCGTTATCTGT
GGCAGTTGCCGTGGTATATGGTCATTGGTCCTGCGGGTAGCGGCAAAAGCACGCTGCTGCGCGAGGGCTTTCCGTCTGAC
ATTGTTTACACGCCGGAAAGCATCCGGGGTGTGGAATACCACCCGCTGATCACACCGCGAGTGGGCAACCAGGCGGTAAT
TTTCGATGTTGACGGCGTACTGACCACTCCCGGCGGGGATGATCTGCTCCGCCGCCGCCTGCGCGAACACTGGCTGGGCT
GGCTGATGCAAACGCGCGCTCGCCAGCCGCTCAACGGTCTTATCCTGACGCTCGATCTTCCCGATCTGCTGACGGCGGAT
AAATCCCGCCGTGAGACACTGGTACAAAATTTGCGCCAGCAACTTCAGGAGATCCGTCAGAGCCTGCACTGCCGTCTGCC
CGTTTACGTGGTGCTGACACGGCTGGATCTGCTGAACGGCTTTGCCGCGCTGTTCCATTCACTGGATAAAAAAGACCGCG
ATGCGATCCTCGGCGTCACATTTACCCGCCGCGCCCATGAAAGTGACGGCTGGCGCAGCGAACTGGGGGCTTTCTGGCAG
ACGTGGGTACAACAGGTGAACCTGGCGCTGTCGGATCTGGTGCTCGCACAAACCGGTGCTGCTCCCCGCAGCGCTGTGTT
CAGCTTCTCCCGTCAGATGCAGGGAACAGGAGAAATCGTCACCGCACTGCTCGCCGCATTGCTGGACGGTGAGAACATGG
ATGTAATGCTGCGTGGCGTCTGGCTCACATCCTCGCTACAGCGTGGCCAGGTGGATGATATTTTCACGCAGTCCGCCGCC
CGCCAGTACGGACTGGGTAACAGCTCGCTGGCAACCTGGCCTCTGGTGGAGACGACGCCGTATTTTACTCGCCGCCTCTT
CCCGGAAGTCCTGCTGGCTGAGCCGAACCTGGCGGGTGAAAACAGCGTCTGGCTGAACAGCTCCCGGCGCAGGCTGACCG
CCTTTTCCACCTGTGGCGCGGCACTGGCGGCATTGATGGTCGGAAGCTGGCACCATTATTACAATCAGAACTGGCAGTCT
GGCGTTAACGTACTGGCACAAGCTAAAGCCTTTATGGACGTACCACTACCGCAGGGAACGGATGAATTCGGCAATCTGCA
ATTGCCATTGCTTAACCCGGTACGCGATGCCACCCTGGCCTATGGTGATTATCGCGATCACGGTTTTCTGGCGGATATGG
GATTGTACCAGGGCGCCCGCGTAGGGCCGTATGTGGAGCAAACCTACATTCAGCTTCTTGAGCAGCGTTATCTCCCCTCG
TTAATGAACGGCCTGATCCGGGATCTAAACATTGCCCCGCCAGAGAGCGAAGAAAAGCTCGCTGTGCTGCGCGTAGTGCG
CATGATGGAAGACAAAAGTGGGCGCAACAACGAGGCGGTAAAACAGTACATGGCGCGGCGCTGGAGCAATGAATTTCACG
GCCAGCGCGATATTCAGGCGCAACTGATGGTGCATCTGGACTATGCGCTGGAGCACACCGACTGGCACGCGCAGCGCCAA
AGCAGCGACAGCGATGCTGTAAGCCGCTGGACCCCCTATAATAAACCGGTCATTAATGCGCAACATGAACTGAGCAAGCT
ACCCATATACCAGCGTGTCTACCAGACCCTTCGCACCAAAGCATTAAGCGTGTTACCCGCCGATTTGAATTTGCGCGACC
AGGTTGGTCCCACCTTCGACAACGTGTTCGTCGCCGGTAATGATGAAAAACTGGTGATCCCGCAGTTTCTCACCCGCTAT
GGCCTGCAAAGCTATTTTATCAAACAACACGACGGTCTCGTTGAGTTGACCGCGCTGGATTCGTGGGTGCTGAACCTGAC
GCAAAGCGTCGCCTACAGCGAGGCCGACCGTGAAGAGATCCAGCGCCATATCACTGAACAGTACCTCAGTGACTATACCG
CTACATGGCGTGCCGGAATGGATAATCTCAACGTCCGTGACTATGAGACCATGCCGGCGCTGACCGACGCGCTGGAGCAG
ATTATCAGCGGTGATCAGCCATTCCTGCGTGCTCTGACGGCGCTGCGCGATAATACACACGCGCTGACGCTCTCCGGCAA
ACTGGACGATAAGGCGAAGGAAGCGGCGATAAATGAGATGGATTACCGCCTGTTATCCCGGCTGGGGCATGAGTTCGCGC
CGGAAAATAGCGCACTGGAGGAGCAAAAGGACAAGGCGAGTACTCTACAGGCCGTGTATCAGCAACTGACCGAGCTGCAC
CGTTACCTGCTGGCGATCCAGAACTCGCCAGTGTCTGGGAAATCGGCGCTGAAAGCAGTACAGCTACGGCTGGATCAAAA
CAGCAGCGATCCAATCTTCGCCACCCGTCAGATGGCAAAAACCCTGCCTGCGCCTCTTAACCGCTGGGTAGGTAAGCTGG
CGGATCAGGCCTGGCATGTAGTGATGGTGGAAGCCGTTCGTTACATGGAAGTGGACTGGCGCGACAATGTAGTGAAACCC
TTCAACGAGCAGCTTGCCGATAACTATCCGTTTAATCCGCACGCCACACAGGATGCCTCACTGGATTCGTTTGAACGTTT
CTTTAAACCGGATGGCATTCTGGACAATTTCTACAAGAACAACCTGCGCCTGTTCCTTGAAAACGATCTGACCTTTGGCG
ACGACGGCAGAATGTTAATCCGTGAAGATATCCGGCAGCAACTGGATACTGCGCAGAAAATCCGCAACATCTTCTTCAGC
CAGCAGAACGGGCTGGGCGCACAGTTTGCCGTGGAAACCGTATCGCTTTCCGGCAATAAGCGGCGCAGCGTACTTAACCT
GGACGGCCAGTTAGTGGACTACAGCCAGGGACGCAACTACACCGCCCATCTGGTCTGGCCGAACAACATGCGTGAAGGCA
ATGAAAGCCAGCTGACGCTGATTGGCACCAGCGGCAGAGCACCGCGCAGTATCGCGTTCAGCGGACCGTGGGCGCAGTTC
CGCCTGTTCGGCGCGGGCCAGTTGACCAATGTGACCAGTGACACCTTTAACGTGCGCTTTAACGTGGACGGCGGCGCGAT
GGTTTACCGGGTGCATGTGGATACCGAAGATAACCCGTTCACCGGCGGTCTGTTCAGCCAGTTCCGTTTACCGGATACGT
TGTATTAA

Protein sequence :
MFKFPTSRLFSTLKSALRPAMPRFKVSATWLLTLAWIFLLVWIWWQGPKWTLYEQHWLAPLANRWLATAVWGLIALVWLT
WRVMKRLQKLEKQQKQQREEEKDPLTVELHRQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKSTLLREGFPSD
IVYTPESIRGVEYHPLITPRVGNQAVIFDVDGVLTTPGGDDLLRRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTAD
KSRRETLVQNLRQQLQEIRQSLHCRLPVYVVLTRLDLLNGFAALFHSLDKKDRDAILGVTFTRRAHESDGWRSELGAFWQ
TWVQQVNLALSDLVLAQTGAAPRSAVFSFSRQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAA
RQYGLGNSSLATWPLVETTPYFTRRLFPEVLLAEPNLAGENSVWLNSSRRRLTAFSTCGAALAALMVGSWHHYYNQNWQS
GVNVLAQAKAFMDVPLPQGTDEFGNLQLPLLNPVRDATLAYGDYRDHGFLADMGLYQGARVGPYVEQTYIQLLEQRYLPS
LMNGLIRDLNIAPPESEEKLAVLRVVRMMEDKSGRNNEAVKQYMARRWSNEFHGQRDIQAQLMVHLDYALEHTDWHAQRQ
SSDSDAVSRWTPYNKPVINAQHELSKLPIYQRVYQTLRTKALSVLPADLNLRDQVGPTFDNVFVAGNDEKLVIPQFLTRY
GLQSYFIKQHDGLVELTALDSWVLNLTQSVAYSEADREEIQRHITEQYLSDYTATWRAGMDNLNVRDYETMPALTDALEQ
IISGDQPFLRALTALRDNTHALTLSGKLDDKAKEAAINEMDYRLLSRLGHEFAPENSALEEQKDKASTLQAVYQQLTELH
RYLLAIQNSPVSGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYMEVDWRDNVVKP
FNEQLADNYPFNPHATQDASLDSFERFFKPDGILDNFYKNNLRLFLENDLTFGDDGRMLIREDIRQQLDTAQKIRNIFFS
QQNGLGAQFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESQLTLIGTSGRAPRSIAFSGPWAQF
RLFGAGQLTNVTSDTFNVRFNVDGGAMVYRVHVDTEDNPFTGGLFSQFRLPDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 99
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 99
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 56