Gene Information

Name : ECS88_0234 (ECS88_0234)
Accession : YP_002390068.1
Strain : Escherichia coli S88
Genome accession: NC_011742
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3523
EC number : -
Position : 245294 - 248821 bp
Length : 3528 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pm : membrane component

DNA sequence :
GTGTTCAAATTTCCCACATCCCGACTGTTCAGCACGTTGAAATCTGCGCTCAGGCCAGCGATGCCGCGGTTTAAAGTTTC
CGCCACCTGGCTACTGACGCTGGCATGGATTTTTCTGCTGGTGTGGATCTGGTGGCAGGGTCCAAAATGGACGCTCTATG
AGCAGCACTGGCTGGCTCCGCTGGCAAACCGCTGGCTGGCGACCGCCGTCTGGGGACTTATCGCTCTGGTCTGGCTCACC
TGGCGGGTGATGAAGCGTCTGCAAAAGCTGGAAAAACAGCAGAAACAGCAGCGGGAGGAAGAAAAAGATCCGTTGACCGT
GGAACTCCACCGCCAGCAGCAATATCTGGATCACTGGCTGCTGCGCCTGCGCCGCCATCTGGATAACCGCCGTTATCTGT
GGCAGTTGCCGTGGTATATGGTCATTGGTCCTGCGGGTAGCGGCAAAAGCACGCTGCTGCGCGAGGGCTTTCCGTCTGAC
ATTGTTTACACGCCGGAAAGCATCCGGGGTGTGGAATACCACCCGCTGATCACACCGCGAGTGGGCAACCAGGCGGTAAT
TTTCGATGTTGACGGCGTACTGACCACTCCCGGCGGGGATGATCTGCTCCGCCGCCGCCTGCGCGAACACTGGCTGGGCT
GGCTGATGCAAACGCGCGCTCGCCAGCCGCTCAACGGTCTTATCCTGACGCTCGATCTTCCCGATCTGCTGACGGCGGAT
AAATCCCGCCGTGAGACACTGGTACAAAATTTGCGCCAGCAACTTCAGGAGATCCGTCAGAGCCTGCACTGCCGTCTGCC
CGTTTACGTGGTGCTGACACGGCTGGATCTGCTGAACGGCTTTGCCGCGCTGTTCCATTCACTGGATAAAAAAGACCGCG
ATGCGATCCTCGGCGTCACATTTACCCGCCGCGCCCATGAAAGTGACGGCTGGCGCAGCGAACTGGGGGCTTTCTGGCAG
ACGTGGGTACAACAGGTGAACCTGGCGCTGTCGGATCTGGTGCTCGCACAAACCGGTGCTGCTCCCCGCAGCGCTGTGTT
CAGCTTCTCCCGTCAGATGCAGGGAACAGGAGAAATCGTCACCGCACTGCTCGCCGCATTGCTGGACGGTGAGAACATGG
ATGTAATGCTGCGTGGCGTCTGGCTCACATCCTCGCTACAGCGTGGCCAGGTGGATGATATTTTCACGCAGTCCGCCGCC
CGCCAGTACGGACTGGGTAACAGCTCGCTGGCAACCTGGCCTCTGGTGGAGACGACGCCGTATTTTACTCGCCGCCTCTT
CCCGGAAGTCCTGCTGGCTGAGCCGAACCTGGCGGGTGAAAACAGCGTCTGGCTGAACAGCTCCCGGCGCAGGCTGACCG
CCTTTTCCACCTGTGGCGCGGCACTGGCGGCATTGATGGTCGGAAGCTGGCACCATTATTACAATCAGAACTGGCAGTCT
GGCGTTAACGTACTGGCACAAGCTAAAGCCTTTATGGACGTACCACCACCGCAGGGAACGGATGAATTCGGCAATCTGCA
ATTGCCATTGCTTAACCCGGTACGCGATGCCACCCTGGCCTATGGTGATTATCGCGATCACGGTTTTCTGGCGGATATGG
GATTGTACCAGGGCGCCCGCGTAGGGCCGTATGTGGAGCAAACCTACATTCAGCTTCTTGAGCAGCGTTATCTCCCCTCG
TTAATGAACGGCCTGATCCGGGATCTAAACATTGCCCCGCCAGAGAGCGAAGAAAAGCTCGCTGTGCTGCGCGTAGTGCG
CATGATGGAAGACAAAAGTGGGCGCAACAACGAGGCGGTAAAACAGTACATGGCACGGCGCTGGAGCAATGAATTTCACG
GCCAGCGCGATATTCAGGCGCAACTGATGGTGCATCTGGACTATGCGCTGGAGCACACCGACTGGCACGCGCAGCGCCAA
AGCAGCGACAGCGATGCTGTCAGCCGCTGGACCCCCTATGATAAACCGATCATTAATGCGCAGCAGGAACTGAGCAAGCT
GCCCATATACCAGCGTGTCTACCAGACCCTGCGCACCAAAGCATTAAGCGTGTTGCCCGCCGATTTGAATTTGCGCGACC
AGGTTGGTCCCACCTTCGACAACGTGTTCGTCGCCGGTAATGATGAAAAACTGGTGATCCCGCAGTTCCTCACCCGCTAT
GGACTGCAAAGCTATTTTGTCAAACAGCGTGAGGGCCTCGTTGAGCTGACCGCGCTGGATTCGTGGGTACTGAACCTGAC
GCAAAGCGTCGCCTACAGCGAGGCCGACCGTGAAGAGATCCAGCGCCATATCACCGAACAGTACATCAGTGACTATACCG
CCACCTGGCGTGCCGGAATGGATAACCTCAACGTCCGTGACTATGAGGCCATGTCGGCGCTGACCGACGCGCTGGAGCAG
ATTATCAGCGGCGATCAGCCATTCCAGCGTGCGCTGACGGCGCTGCGCGATAATACCCACGCGCTGACGCTCTCCGGCAA
ACTGGATGATAAGGCGAGGGAAGCGGCGATAAATGAGATGGATTACCGCCTGTTATCCCGGCTGGGGCATGAGTTCGCAC
CGGAAAACAGCGCACTGGAGGAGCAAAAGGACAAGGCGAGTACGCTACAGGCCGTGTACCAGCAACTGACCGAGCTGCAC
CGTTACCTGCTGGCGATCCAGAACTCGCCAGTGCCGGGGAAATCGGCGCTGAAAGCAGTACAGCTACGGCTGGATCAAAA
CAGCAGCGATCCAATCTTCGCCACCCGTCAGATGGCAAAAACCCTGCCTGCGCCTCTTAACCGCTGGGTAGGTAAGCTCG
CGGATCAGGCCTGGCATGTGGTGATGGTGGAAGCCGTTCGTTACATGGAAGTGGACTGGCGCGACAATGTAGTGAAACCC
TTCAACGAGCAGCTTGCCGATAACTATCCGTTTAATCCGCGCGCCACACAGGATGCCTCACTGGATTCGTTTGAACGTTT
CTTTAAACCGGATGGCATTCTGGACAATTTCTACAAGAACAACCTGCGCCTGTTCCTTGAAAACGATCTGACCTTTGGCG
ACGACGGCAGAGTGTTAATCCGTGAAGATATCCGGCAGCAACTGGATACCGCGCAGAAAATCCGCGACATCTTCTTCAGC
CAGCAGAACGGGCTGGGCGCACAGTTTGCCGTGGAAACCGTATCGCTTTCCGGCAATAAGCGGCGCAGCGTACTTAACCT
GGACGGCCAGTTAGTGGACTACAGCCAGGGACGCAACTACACCGCCCATCTGGTCTGGCCGAACAACATGCGTGAAGGCA
ATGAAAGCAAGCTGACGCTGATTGGCACCAGCGGCAGAGCACCGCGCAGTATCGCGTTCAGTGGACCGTGGGCGCAGTTC
CGCCTGTTCGGCGCGGGCCAGTTGACCAATGTGACCAGTGACACCTTTAACGTGCGCTTTAACGTGGACGGCGGCGCAAT
GGTTTACCAGGTGCATGTGGATACCGAAGATAACCCGTTCACCGGCGGTCTGTTCAGCCTGTTCCGTTTACCGGATACGT
TGTATTAA

Protein sequence :
MFKFPTSRLFSTLKSALRPAMPRFKVSATWLLTLAWIFLLVWIWWQGPKWTLYEQHWLAPLANRWLATAVWGLIALVWLT
WRVMKRLQKLEKQQKQQREEEKDPLTVELHRQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKSTLLREGFPSD
IVYTPESIRGVEYHPLITPRVGNQAVIFDVDGVLTTPGGDDLLRRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTAD
KSRRETLVQNLRQQLQEIRQSLHCRLPVYVVLTRLDLLNGFAALFHSLDKKDRDAILGVTFTRRAHESDGWRSELGAFWQ
TWVQQVNLALSDLVLAQTGAAPRSAVFSFSRQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAA
RQYGLGNSSLATWPLVETTPYFTRRLFPEVLLAEPNLAGENSVWLNSSRRRLTAFSTCGAALAALMVGSWHHYYNQNWQS
GVNVLAQAKAFMDVPPPQGTDEFGNLQLPLLNPVRDATLAYGDYRDHGFLADMGLYQGARVGPYVEQTYIQLLEQRYLPS
LMNGLIRDLNIAPPESEEKLAVLRVVRMMEDKSGRNNEAVKQYMARRWSNEFHGQRDIQAQLMVHLDYALEHTDWHAQRQ
SSDSDAVSRWTPYDKPIINAQQELSKLPIYQRVYQTLRTKALSVLPADLNLRDQVGPTFDNVFVAGNDEKLVIPQFLTRY
GLQSYFVKQREGLVELTALDSWVLNLTQSVAYSEADREEIQRHITEQYISDYTATWRAGMDNLNVRDYEAMSALTDALEQ
IISGDQPFQRALTALRDNTHALTLSGKLDDKAREAAINEMDYRLLSRLGHEFAPENSALEEQKDKASTLQAVYQQLTELH
RYLLAIQNSPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYMEVDWRDNVVKP
FNEQLADNYPFNPRATQDASLDSFERFFKPDGILDNFYKNNLRLFLENDLTFGDDGRVLIREDIRQQLDTAQKIRDIFFS
QQNGLGAQFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGRAPRSIAFSGPWAQF
RLFGAGQLTNVTSDTFNVRFNVDGGAMVYQVHVDTEDNPFTGGLFSLFRLPDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 100
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 99
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 56