Gene Information

Name : ECOK1_0223 (ECOK1_0223)
Accession : YP_006099470.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 245211 - 248678 bp
Length : 3468 bp
Strand : -
Note : identified by similarity to GB:AAQ96724.1; match to protein family HMM PF06744; match to protein family HMM PF06761; match to protein family HMM TIGR03348

DNA sequence :
ATGCCGCGGTTTAAAGTTTCCGCCACCTGGCTACTGACGCTGGCATGGATTTTTCTGCTGGTGTGGATCTGGTGGCAGGG
TCCAAAATGGACGCTCTATGAGCAGCACTGGCTGGCTCCGCTGGCAAACCGCTGGCTGGCGACCGCCGTCTGGGGACTTA
TCGCTCTGGTCTGGCTCACCTGGCGGGTGATGAAGCGTCTGCAAAAGCTGGAAAAACAGCAGAAACAGCAGCGGGAGGAA
GAAAAAGATCCGTTGACCGTGGAACTCCACCGCCAGCAGCAATATCTGGATCACTGGCTGCTGCGCCTGCGCCGCCATCT
GGATAACCGCCGTTATCTGTGGCAGTTGCCGTGGTATATGGTCATTGGTCCTGCGGGTAGCGGCAAAAGCACGCTGCTGC
GCGAGGGCTTTCCGTCTGACATTGTTTACACGCCGGAAAGCATCCGGGGTGTGGAATACCACCCGCTGATCACACCGCGA
GTGGGCAACCAGGCGGTAATTTTCGATGTTGACGGCGTACTGACCACTCCCGGCGGGGATGATCTGCTCCGCCGCCGCCT
GCGCGAACACTGGCTGGGCTGGCTGATGCAAACGCGCGCTCGCCAGCCGCTCAACGGTCTTATCCTGACGCTCGATCTTC
CCGATCTGCTGACGGCGGATAAATCCCGCCGTGAGACACTGGTACAAAATTTGCGCCAGCAACTTCAGGAGATCCGTCAG
AGCCTGCACTGCCGTCTGCCCGTTTACGTGGTGCTGACACGGCTGGATCTGCTGAACGGCTTTGCCGCGCTGTTCCATTC
ACTGGATAAAAAAGACCGCGATGCGATCCTCGGCGTCACATTTACCCGCCGCGCCCATGAAAGTGACGGCTGGCGCAGCG
AACTGGGGGCTTTCTGGCAGACGTGGGTACAACAGGTGAACCTGGCGCTGTCGGATCTGGTGCTCGCACAAACCGGTGCT
GCTCCCCGCAGCGCTGTGTTCAGCTTCTCCCGTCAGATGCAGGGAACAGGAGAAATCGTCACCGCACTGCTCGCCGCATT
GCTGGACGGTGAGAACATGGATGTAATGCTGCGTGGCGTCTGGCTCACATCCTCGCTACAGCGTGGCCAGGTGGATGATA
TTTTCACGCAGTCCGCCGCCCGCCAGTACGGACTGGGTAACAGCTCGCTGGCAACCTGGCCTCTGGTGGAGACGACGCCG
TATTTTACTCGCCGCCTCTTCCCGGAAGTCCTGCTGGCTGAGCCGAACCTGGCGGGTGAAAACAGCGTCTGGCTGAACAG
CTCCCGGCGCAGGCTGACCGCCTTTTCCACCTGTGGCGCGGCACTGGCGGCATTGATGGTCGGAAGCTGGCACCATTATT
ACAATCAGAACTGGCAGTCTGGCGTTAACGTACTGGCACAAGCTAAAGCCTTTATGGACGTACCACCACCGCAGGGAACG
GATGAATTCGGCAATCTGCAATTGCCATTGCTTAACCCGGTACGCGATGCCACCCTGGCCTATGGTGATTATCGCGATCA
CGGTTTTCTGGCGGATATGGGATTGTACCAGGGCGCCCGCGTAGGGCCGTATGTGGAGCAAACCTACATTCAGCTTCTTG
AGCAGCGTTATCTCCCCTCGTTAATGAACGGCCTGATCCGGGATCTAAACATTGCCCCGCCAGAGAGCGAAGAAAAGCTC
GCTGTGCTGCGCGTAGTGCGCATGATGGAAGACAAAAGTGGGCGCAACAACGAGGCGGTAAAACAGTACATGGCACGGCG
CTGGAGCAATGAATTTCACGGCCAGCGCGATATTCAGGCGCAACTGATGGTGCATCTGGACTATGCGCTGGAGCACACCG
ACTGGCACGCGCAGCGCCAAAGCAGCGACAGCGATGCTGTCAGCCGCTGGACCCCCTATGATAAACCGATCATTAATGCG
CAGCAGGAACTGAGCAAGCTGCCCATATACCAGCGTGTCTACCAGACCCTGCGCACCAAAGCATTAAGCGTGTTGCCCGC
CGATTTGAATTTGCGCGACCAGGTTGGTCCCACCTTCGACAACGTGTTCGTCGCCGGTAATGATGAAAAACTGGTGATCC
CGCAGTTCCTCACCCGCTATGGACTGCAAAGCTATTTTGTCAAACAGCGTGAGGGCCTCGTTGAGCTGACCGCGCTGGAT
TCGTGGGTACTGAACCTGACGCAAAGCGTCGCCTACAGCGAGGCCGACCGTGAAGAGATCCAGCGCCATATCACCGAACA
GTACATCAGTGACTATACCGCCACCTGGCGTGCCGGAATGGATAACCTCAACGTCCGTGACTATGAGGCCATGTCGGCGC
TGACCGACGCGCTGGAGCAGATTATCAGCGGCGATCAGCCATTCCAGCGTGCGCTGACGGCGCTGCGCGATAATACCCAC
GCGCTGACGCTCTCCGGCAAACTGGATGATAAGGCGAGGGAAGCGGCGATAAATGAGATGGATTACCGCCTGTTATCCCG
GCTGGGGCATGAGTTCGCACCGGAAAACAGCGCACTGGAGGAGCAAAAGGACAAGGCGAGTACGCTACAGGCCGTGTACC
AGCAACTGACCGAGCTGCACCGTTACCTGCTGGCGATCCAGAACTCGCCAGTGCCGGGGAAATCGGCGCTGAAAGCAGTA
CAGCTACGGCTGGATCAAAACAGCAGCGATCCAATCTTCGCCACCCGTCAGATGGCAAAAACCCTGCCTGCGCCTCTTAA
CCGCTGGGTAGGTAAGCTCGCGGATCAGGCCTGGCATGTGGTGATGGTGGAAGCCGTTCGTTACATGGAAGTGGACTGGC
GCGACAATGTAGTGAAACCCTTCAACGAGCAGCTTGCCGATAACTATCCGTTTAATCCGCGCGCCACACAGGATGCCTCA
CTGGATTCGTTTGAACGTTTCTTTAAACCGGATGGCATTCTGGACAATTTCTACAAGAACAACCTGCGCCTGTTCCTTGA
AAACGATCTGACCTTTGGCGACGACGGCAGAGTGTTAATCCGTGAAGATATCCGGCAGCAACTGGATACCGCGCAGAAAA
TCCGCGACATCTTCTTCAGCCAGCAGAACGGGCTGGGCGCACAGTTTGCCGTGGAAACCGTATCGCTTTCCGGCAATAAG
CGGCGCAGCGTACTTAACCTGGACGGCCAGTTAGTGGACTACAGCCAGGGACGCAACTACACCGCCCATCTGGTCTGGCC
GAACAACATGCGTGAAGGCAATGAAAGCAAGCTGACGCTGATTGGCACCAGCGGCAGAGCACCGCGCAGTATCGCGTTCA
GTGGACCGTGGGCGCAGTTCCGCCTGTTCGGCGCGGGCCAGTTGACCAATGTGACCAGTGACACCTTTAACGTGCGCTTT
AACGTGGACGGCGGCGCAATGGTTTACCAGGTGCATGTGGATACCGAAGATAACCCGTTCACCGGCGGTCTGTTCAGCCT
GTTCCGTTTACCGGATACGTTGTATTAA

Protein sequence :
MPRFKVSATWLLTLAWIFLLVWIWWQGPKWTLYEQHWLAPLANRWLATAVWGLIALVWLTWRVMKRLQKLEKQQKQQREE
EKDPLTVELHRQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKSTLLREGFPSDIVYTPESIRGVEYHPLITPR
VGNQAVIFDVDGVLTTPGGDDLLRRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTADKSRRETLVQNLRQQLQEIRQ
SLHCRLPVYVVLTRLDLLNGFAALFHSLDKKDRDAILGVTFTRRAHESDGWRSELGAFWQTWVQQVNLALSDLVLAQTGA
APRSAVFSFSRQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAARQYGLGNSSLATWPLVETTP
YFTRRLFPEVLLAEPNLAGENSVWLNSSRRRLTAFSTCGAALAALMVGSWHHYYNQNWQSGVNVLAQAKAFMDVPPPQGT
DEFGNLQLPLLNPVRDATLAYGDYRDHGFLADMGLYQGARVGPYVEQTYIQLLEQRYLPSLMNGLIRDLNIAPPESEEKL
AVLRVVRMMEDKSGRNNEAVKQYMARRWSNEFHGQRDIQAQLMVHLDYALEHTDWHAQRQSSDSDAVSRWTPYDKPIINA
QQELSKLPIYQRVYQTLRTKALSVLPADLNLRDQVGPTFDNVFVAGNDEKLVIPQFLTRYGLQSYFVKQREGLVELTALD
SWVLNLTQSVAYSEADREEIQRHITEQYISDYTATWRAGMDNLNVRDYEAMSALTDALEQIISGDQPFQRALTALRDNTH
ALTLSGKLDDKAREAAINEMDYRLLSRLGHEFAPENSALEEQKDKASTLQAVYQQLTELHRYLLAIQNSPVPGKSALKAV
QLRLDQNSSDPIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYMEVDWRDNVVKPFNEQLADNYPFNPRATQDAS
LDSFERFFKPDGILDNFYKNNLRLFLENDLTFGDDGRVLIREDIRQQLDTAQKIRDIFFSQQNGLGAQFAVETVSLSGNK
RRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGRAPRSIAFSGPWAQFRLFGAGQLTNVTSDTFNVRF
NVDGGAMVYQVHVDTEDNPFTGGLFSLFRLPDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 100
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 99
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 56