Gene Information

Name : UTI89_C0239 (UTI89_C0239)
Accession : YP_539273.1
Strain : Escherichia coli UTI89
Genome accession: NC_007946
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3523
EC number : -
Position : 245295 - 248762 bp
Length : 3468 bp
Strand : -
Note : -

DNA sequence :
ATGCCGCGGTTTAAAGTTTCCGCCACCTGGCTACTGACGCTGGCATGGATTTTTCTGCTGGTGTGGATCTGGTGGCAGGG
TCCAAAATGGACGCTCTATGAGCAGCACTGGCTGGCTCCGCTGGCAAACCGCTGGCTGGCGACCGCCGTCTGGGGACTTA
TCGCTCTGGTCTGGCTCACCTGGCGGGTGATGAAGCGTCTGCAAAAGCTGGAAAAACAGCAGAAACAGCAGCGGGAGGAA
GAAAAAGATCCGTTGACCGTGGAACTCCACCGCCAGCAGCAATATCTGGATCACTGGCTGCTGCGCCTGCGCCGCCATCT
GGATAACCGCCGTTATCTGTGGCAGTTGCCGTGGTATATGGTCATTGGTCCTGCGGGTAGCGGCAAAAGCACGCTGCTGC
GCGAGGGCTTTCCGTCTGACATTGTTTACACGCCGGAAAGCATCCGGGGTGTGGAATACCACCCGCTGATCACACCGCGA
GTGGGCAACCAGGCGGTAATTTTCGATGTTGACGGCGTACTGACCACTCCCGGCGGGGATGATCTGCTCCGCCGCCGCCT
GCGCGAACACTGGCTGGGCTGGCTGATGCAAACGCGCGCTCGCCAGCCGCTCAACGGTCTTATCCTGACGCTCGATCTTC
CCGATCTGCTGACGGCGGATAAATCCCGCCGTGAGACACTGGTACAAAATTTGCGCCAGCAACTTCAGGAGATCCGTCAG
AGCCTGCACTGCCGTCTGCCCGTTTACGTGGTGCTGACACGGCTGGATCTGCTGAACGGCTTTGCCGCGCTGTTCCATTC
ACTGGATAAAAAAGACCGCGATGCGATCCTCGGCGTCACATTTACCCGCCGCGCCCATGAAAGTGACGGCTGGCGCAGCG
AACTGGGGGCTTTCTGGCAGACGTGGGTACAACAGGTGAACCTGGCGCTGTCGGATCTGGTGCTCGCACAAACCGGTGCT
GCTCCCCGCAGCGCTGTGTTCAGCTTCTCCCGTCAGATGCAGGGAACAGGAGAAATCGTCACCGCACTGCTCGCCGCATT
GCTGGACGGTGAGAACATGGATGTAATGCTGCGTGGCGTCTGGCTCACATCCTCGCTACAGCGTGGCCAGGTGGATGATA
TTTTCACGCAGTCCGCCGCCCGCCAGTACGGACTGGGTAACAGCTCGCTGGCAACCTGGCCTCTGGTGGAGACGACGCCG
TATTTTACTCGCCGCCTCTTCCCGGAAGTCCTGCTGGCTGAGCCGAACCTGGCGGGTGAAAACAGCGTCTGGCTGAACAG
CTCCCGGCGCAGGCTGACCGCCTTTTCCACCTGTGGCGCGGCACTGGCGGCATTGATGGTCGGAAGCTGGCACCATTATT
ACAATCAGAACTGGCAGTCTGGCGTTAACGTACTGGCACAAGCTAAAGCCTTTATGGACGTACCACCACCGCAGGGAACG
GATGAATTCGGCAATCTGCAATTGCCATTGCTTAACCCGGTACGCGATGCCACCCTGGCCTATGGTGATTATCGCGATCA
CGGTTTTCTGGCGGATATGGGATTGTACCAGGGCGCCCGCGTAGGGCCGTATGTGGAGCAAACCTACATTCAGCTTCTTG
AGCAGCGTTATCTCCCCTCGTTAATGAACGGCCTGATCCGGGATCTAAACATTGCCCCGCCAGAGAGCGAAGAAAAGCTC
GCTGTGCTGCGCGTAGTGCGCATGATGGAAGACAAAAGTGGGCGCAACAACGAGGCGGTAAAACAGTACATGGCACGGCG
CTGGAGCAATGAATTTCACGGCCAGCGCGATATTCAGGCGCAACTGATGGTGCATCTGGACTATGCGCTGGAGCACACCG
ACTGGCACGCGCAGCGCCAAAGCAGCGACAGCGATGCTGTCAGCCGCTGGACCCCCTATGATAAACCGATCATTAATGCG
CAGCAGGAACTGAGCAAGCTGCCCATATACCAGCGTGTCTACCAGACCCTGCGCACCAAAGCATTAAGCGTGTTGCCCGC
CGATTTGAATTTGCGCGACCAGGTTGGTCCCACCTTCGACAACGTGTTCGTCGCCGGTAATGATGAAAAACTGGTGATCC
CGCAGTTCCTCACCCGCTATGGACTGCAAAGCTATTTTGTCAAACAGCGTGAGGGCCTCGTTGAGCTGACCGCGCTGGAT
TCGTGGGTACTGAACCTGACGCAAAGCGTCGCCTACAGCGAGGCCGACCGTGAAGAGATCCAGCGCCATATCACCGAACA
GTACATCAGTGACTATACCGCCACCTGGCGTGCCGGAATGGATAACCTCAACGTCCGTGACTATGAGGCCATGTCGGCGC
TGACCGACGCGCTGGAGCAGATTATCAGCGGCGATCAGCCATTCCAGCGTGCGCTGACGGCGCTGCGCGATAATACCCAC
GCGCTGACGCTCTCCGGCAAACTGGATGATAAGGCGAGGGAAGCGGCGATAAATGAGATGGATTACCGCCTGTTATCCCG
GCTGGGGCATGAGTTCGCACCGGAAAACAGCGCACTGGAGGAGCAAAAGGACAAGGCGAGTACGCTACAGGCCGTGTACC
AGCAACTGACCGAGCTGCACCGTTACCTGCTGGCGATCCAGAACTCGCCAGTGCCGGGGAAATCGGCGCTGAAAGCAGTA
CAGCTACGGCTGGATCAAAACAGCAGCGATCCAATCTTCGCCACCCGTCAGATGGCAAAAACCCTGCCTGCGCCTCTTAA
CCGCTGGGTAGGTAAGCTCGCGGATCAGGCCTGGCATGTGGTGATGGTGGAAGCCGTTCGTTACATGGAAGTGGACTGGC
GCGACAATGTAGTGAAACCCTTCAACGAGCAGCTTGCCGATAACTATCCGTTTAATCCGCGCGCCACACAGGATGCCTCA
CTGGATTCGTTTGAACGTTTCTTTAAACCGGATGGCATTCTGGACAATTTCTACAAGAACAACCTGCGCCTGTTCCTTGA
AAACGATCTGACCTTTGGCGACGACGGCAGAGTGTTAATCCGTGAAGATATCCGGCAGCAACTGGATACCGCGCAGAAAA
TCCGCGACATCTTCTTCAGCCAGCAGAACGGGCTGGGCGCACAGTTTGCCGTGGAAACCGTATCGCTTTCCGGCAATAAG
CGGCGCAGCGTACTTAACCTGGACGGCCAGTTAGTGGACTACAGCCAGGGACGCAACTACACCGCCCATCTGGTCTGGCC
GAACAACATGCGTGAAGGCAATGAAAGCAAGCTGACGCTGATTGGCACCAGCGGCAGAGCACCGCGCAGTATCGCGTTCA
GTGGACCGTGGGCGCAGTTCCGCCTGTTCGGCGCGGGCCAGTTGACCAATGTGACCAGTGACACCTTTAACGTGCGCTTT
AACGTGGACGGCGGCGCAATGGTTTACCAGGTGCATGTGGATACCGAAGATAACCCGTTCACCGGCGGTCTGTTCAGCCT
GTTCCGTTTACCGGATACGTTGTATTAA

Protein sequence :
MPRFKVSATWLLTLAWIFLLVWIWWQGPKWTLYEQHWLAPLANRWLATAVWGLIALVWLTWRVMKRLQKLEKQQKQQREE
EKDPLTVELHRQQQYLDHWLLRLRRHLDNRRYLWQLPWYMVIGPAGSGKSTLLREGFPSDIVYTPESIRGVEYHPLITPR
VGNQAVIFDVDGVLTTPGGDDLLRRRLREHWLGWLMQTRARQPLNGLILTLDLPDLLTADKSRRETLVQNLRQQLQEIRQ
SLHCRLPVYVVLTRLDLLNGFAALFHSLDKKDRDAILGVTFTRRAHESDGWRSELGAFWQTWVQQVNLALSDLVLAQTGA
APRSAVFSFSRQMQGTGEIVTALLAALLDGENMDVMLRGVWLTSSLQRGQVDDIFTQSAARQYGLGNSSLATWPLVETTP
YFTRRLFPEVLLAEPNLAGENSVWLNSSRRRLTAFSTCGAALAALMVGSWHHYYNQNWQSGVNVLAQAKAFMDVPPPQGT
DEFGNLQLPLLNPVRDATLAYGDYRDHGFLADMGLYQGARVGPYVEQTYIQLLEQRYLPSLMNGLIRDLNIAPPESEEKL
AVLRVVRMMEDKSGRNNEAVKQYMARRWSNEFHGQRDIQAQLMVHLDYALEHTDWHAQRQSSDSDAVSRWTPYDKPIINA
QQELSKLPIYQRVYQTLRTKALSVLPADLNLRDQVGPTFDNVFVAGNDEKLVIPQFLTRYGLQSYFVKQREGLVELTALD
SWVLNLTQSVAYSEADREEIQRHITEQYISDYTATWRAGMDNLNVRDYEAMSALTDALEQIISGDQPFQRALTALRDNTH
ALTLSGKLDDKAREAAINEMDYRLLSRLGHEFAPENSALEEQKDKASTLQAVYQQLTELHRYLLAIQNSPVPGKSALKAV
QLRLDQNSSDPIFATRQMAKTLPAPLNRWVGKLADQAWHVVMVEAVRYMEVDWRDNVVKPFNEQLADNYPFNPRATQDAS
LDSFERFFKPDGILDNFYKNNLRLFLENDLTFGDDGRVLIREDIRQQLDTAQKIRDIFFSQQNGLGAQFAVETVSLSGNK
RRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGRAPRSIAFSGPWAQFRLFGAGQLTNVTSDTFNVRF
NVDGGAMVYQVHVDTEDNPFTGGLFSLFRLPDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 100
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 99
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 56