Gene Information

Name : UTI89_C0949 (UTI89_C0949)
Accession : YP_539965.1
Strain : Escherichia coli UTI89
Genome accession: NC_007946
Putative virulence/resistance : Unknown
Product : prophage CP-933T replication protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 938384 - 941206 bp
Length : 2823 bp
Strand : -
Note : -

DNA sequence :
ATGACGGCAGAGTACATCAGGGACTGGCAACAACCGCGCCACGCAGTGGGGCGTGAAGGAACGGGGATCCCTGCTCCTGA
ATCCGCGCTTTCCTCCTGGCTGGATGCCTACCGGGTAGAGAACGAGCGCCGCCAGGAAATGGCTGATGCGGCGTTCTCCG
CCACGCCGCTGGGCAACCTGATTAATAAAAGCCTGGACGCACAGGAAAAACAGGACAAAACCATCACACTGGCAGGAGAC
GCCAGAAAACAGGCACGCGGCGCGGTGGATGAAGCCATGGCCTCGCTGCGCCTGCTGCCGTCCTATCTGCGCGATCCGCT
TATTCGCCACCTCTCTTTCCTGCGCAAAAAACAGGAAGCCGATCGTCAGAAAGGCAAAAAGAGCTGGCAGGCGGAACGCT
ACGCGCGCGGAACCCTGCGCAAAATATTCGAACGTCTGGACCGCACCGACAGCCGCTGGCTGACACCGGGTTATCGCTCC
ATTGCCGGACGCGAACGCCTGGACGATTTGCTTTACCTGCCGCAGCTCAACAAACACCAGATACAGACGCTGGCCACCAT
GACGGCGGCGATGTTCAGCAGCACCTTCGAAAAGCTCTGCGATGGCTTTGGTGCGACTGATGGCGAGCTGACCATGGATG
TAACGCTGAAGGCGTATCAGATGCTGGCCCGCATGGCGTTACACCTGCACGCCATGCCTCCGCATTATGACGCACTGACA
ACAGACAAAGACCGGAGGAACGAACCGGACACGGAACTGCTGCCGGGTGCAATCCTTCGCCTGACCTGTGCGGAATGGTG
GAAACGCAAACTGTGGCTTTTACGTTGCGAGTGGAGAGAAGAACAACTTCGCGCCGCCTGTCTGGTTTCCAGAAAAACAT
CACCCTATCTGAGCCAGGACGCATTAAGTGAATTTCGCGCACAGCGCGAGAAAACACGCGATTTCCTGAAAAGTTTCATG
CTGGAAAATGAAGACGGGTTCACGATTGATCTCGAGACAGTGTATTACGCGGGAGTAAGTAACCCGGTTCACCGTAAGGC
AGAAATGATGGCCACCATGAAGGGGCTGGAACTTCTGGCCGAAGCCCGTGGTGACAAAGCGGTGTTTCTGACTGTCACCT
GCCCGTCAAAATACCACGCAACAACGGAGAACGGTCATCCGAATCCCAAATGGAACGGGGCCACCATGCGCGACTCCAGC
GATTACCTGGTTAACACGTTTTTTGCGGCGGTCCGCAAAAAACTGAACCGCGACGGCCTGCGCTGGTATGGCATCCGCAC
GGTGGAGCCTCACCATGACGGCACCGTGCACTGGCATATGATGGTCTTTGCTCATCCGGAAGAAATCGACAGCATCGTGG
CCATCACCCGCGATATTGCCATTCAGGAAGATCGTCACGAACTGGGCGATGATATTACTCCGCGCTTTAAGGCGGAGTAT
GTCGACGGCTCAAAAGGCACACCAACCAGCTATATCGCGACCTACATCGGAAAAAACCTGGACAGCCGCGCCGTGGATGG
CATCGACCCGAAAACGGGCAAGCCACGCGTTGACCACGAAACCGGAAAATCAATGGCCGAGAGCGTGGAGCGCGCCATCG
GCTGGGCGCGCCTTCACCGGGTCCGCCAGTTCCAGTTCTTTGGCATCCCCTCCCGTCAGGTGTGGCGTGAACTGCGCCGC
CTTGCCAGCCAGATGGCACGCAACCCGGAAGGCCCGCAACAGCTGAAGGATGACGCAATGGATGCGGTACTCGCTGCCGC
TGATGCCGGGTGTTTTGCCACCTACATTGAAAAACAGGGTGGCGTGCTTGTTCCACGCAAAGACTACCTGATTCGCACCG
CCTACGACTTCGCAGAAGAGCTGAACGATTACGGCGAACAGAGCGTACAGATTTATGGGATCTGGTCACCACTCATCGGG
GAATCCTCCCGTGTGTGCACGCATCCGGATAACTGGAAGCTGGTAAGACGCAAACCGGAAGCGGAAGACAGCGCCCGCGA
AAATGGTTTTGACCTTCAGGGCGGCCCTGCCGCCCCTTGGACTCGTGGCAATAACTGTCCCCGTGTACAGGAAACGGACA
ACAACGGGACAGAACAGCCGGAAGAACGGCCAGCACCGTGGCCGCAGCTCCCTGACGGCGTTGAAGTGAACGAATGGATG
CGCTCACTGAAACGGCACGAACGCCGGGCGCTGATGCGTTCGCTTCGTGACAAACAGGCAAAAAACAGCAGTGATGAAAT
GCAGAGCTGGACACAGAGCCGCAAACAGCAGCGGCCTTTGCCTGATAACCACGAATTACTCGCTAAAGAATGGCGGGAGT
CTGCTGAATCTCTCGGCCTGCATATCGGTGAACAACAGATGCAGCACCTGTTACGGGGCGGCAGTCTGTACGTTGACGGC
AGCATCATTGCACCGCAGGGGTTTGAAATTGTACGCAAACCGGATACCCGCCCGGACAGCCGAATCACGCAGCTCTGGCA
GCGCCTGAGCCGTAATCACGGCGTAAGCAGCACGGAGATCCGCCATAACCCGGTCGCCAGCTATCTGGCGCAACTGGGGG
CATCAGACCCTGAAGCCGCCGCACGCCTGGCATCCACACTTCAGCAGGACCAGAACACCATGAAAACACCCGTTACCGTG
CTTTCTGACATGCTGCGCGCCATCCGCGACACAGAGCACGCACAGAGAATCAGTGAAACCACTGAACGCGCCCGCCGCAA
AGCAGACCTGCTGCGGGGTGGCCTGACCATTGGAAACAAAAAACAGACAGAAACGGGATTCACAAATCCCGTAAATGAGC
AAAAAACGCGCCGCGATATATGA

Protein sequence :
MTAEYIRDWQQPRHAVGREGTGIPAPESALSSWLDAYRVENERRQEMADAAFSATPLGNLINKSLDAQEKQDKTITLAGD
ARKQARGAVDEAMASLRLLPSYLRDPLIRHLSFLRKKQEADRQKGKKSWQAERYARGTLRKIFERLDRTDSRWLTPGYRS
IAGRERLDDLLYLPQLNKHQIQTLATMTAAMFSSTFEKLCDGFGATDGELTMDVTLKAYQMLARMALHLHAMPPHYDALT
TDKDRRNEPDTELLPGAILRLTCAEWWKRKLWLLRCEWREEQLRAACLVSRKTSPYLSQDALSEFRAQREKTRDFLKSFM
LENEDGFTIDLETVYYAGVSNPVHRKAEMMATMKGLELLAEARGDKAVFLTVTCPSKYHATTENGHPNPKWNGATMRDSS
DYLVNTFFAAVRKKLNRDGLRWYGIRTVEPHHDGTVHWHMMVFAHPEEIDSIVAITRDIAIQEDRHELGDDITPRFKAEY
VDGSKGTPTSYIATYIGKNLDSRAVDGIDPKTGKPRVDHETGKSMAESVERAIGWARLHRVRQFQFFGIPSRQVWRELRR
LASQMARNPEGPQQLKDDAMDAVLAAADAGCFATYIEKQGGVLVPRKDYLIRTAYDFAEELNDYGEQSVQIYGIWSPLIG
ESSRVCTHPDNWKLVRRKPEAEDSARENGFDLQGGPAAPWTRGNNCPRVQETDNNGTEQPEERPAPWPQLPDGVEVNEWM
RSLKRHERRALMRSLRDKQAKNSSDEMQSWTQSRKQQRPLPDNHELLAKEWRESAESLGLHIGEQQMQHLLRGGSLYVDG
SIIAPQGFEIVRKPDTRPDSRITQLWQRLSRNHGVSSTEIRHNPVASYLAQLGASDPEAAARLASTLQQDQNTMKTPVTV
LSDMLRAIRDTEHAQRISETTERARRKADLLRGGLTIGNKKQTETGFTNPVNEQKTRRDI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY4635 NP_458715.1 conserved hypothetical protein Not tested SPI-7 Protein 2e-137 42
t4328 NP_807922.1 hypothetical protein Not tested SPI-7 Protein 1e-137 42