PAI Gene Information


Name : sulI (EC042_4095)
Accession : YP_006098378.1
PAI name : Tn2411
PAI accession : NC_017626_R1
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : dihydropteroate synthase type-1
Function : -
Note : -
Homologs in the searched genomes :   97 hits    ( 97 protein-level )  
Publication :
    -Aslett,M.A., "Direct Submission", Submitted (25-SEP-2009) Aslett M.A., Wellcome Trust Sanger Institute, Pathogen Sequencing Unit, Wellcome Trust Genome Campus, Hinxton, Cambridge, Cambridgeshire. CB10 1SA, UNITED KINGDOM.

    -Chaudhuri,R.R., Sebaihia,M., Hobman,J.L., Webber,M.A., Leyton,D.L., Goldberg,M.D., Cunningham,A.F., Scott-Tucker,A., Ferguson,P.R., Thomas,C.M., Frankel,G., Tang,C.M., Dudley,E.G., Roberts,I.S., Rasko,D.A., Pallen,M.J., Parkhill,J., Nataro,J.P., Thomson,N, "Complete genome sequence and comparative metabolic profiling of the prototypical enteroaggregative Escherichia coli strain 042", PLoS ONE 5 (1), E8801 (2010) PUBMED 20098708 REMARK Publication Status: Online-Only.

    -Chaudhuri,R.R., Sebaihia,M., Hobman,J.L., Webber,M.A., Leyton,D.L., Goldberg,M.D., Cunningham,A.F., Scott-Tucker,A., Ferguson,P.R., Thomas,C.M., Frankel,G., Tang,C.M., Dudley,E.G., Roberts,I.S., Rasko,D.A., Pallen,M.J., Parkhill,J., Nataro,J.P., Thomson,N, "Direct Submission", Submitted (11-APR-2012) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGGTGACGGTGTTCGGCATTCTGAATCTCACCGAGGACTCCTTCTTCGATGAGAGCCGGCGGCTAGACCCCGCCGGCGC
TGTCACCGCGGCGATCGAAATGCTGCGAGTCGGATCAGACGTCGTGGATGTCGGACCGGCCGCCAGCCATCCGGACGCGA
GGCCTGTATCGCCGGCCGATGAGATCAGACGTATTGCGCCGCTCTTAGACGCCCTGTCCGATCAGATGCACCGTGTTTCA
ATCGACAGCTTCCAACCGGAAACCCAGCGCTATGCGCTCAAGCGCGGCGTGGGCTACCTGAACGATATCCAAGGATTTCC
TGACCCTGCGCTCTATCCCGATATTGCTGAGGCGGACTGCAGGCTGGTGGTTATGCACTCAGCGCAGCGGGATGGCATCG
CCACCCGCACCGGTCACCTTCGACCCGAAGACGCGCTCGACGAGATTGTGCGGTTCTTCGAGGCGCGGGTTTCCGCCTTG
CGACGGAGCGGGGTCGCTGCCGACCGGCTCATCCTCGATCCGGGGATGGGATTTTTCTTGAGCCCCGCACCGGAAACATC
GCTGCACGTGCTGTCGAACCTTCAAAAGCTGAAGTCGGCGTTGGGGCTTCCGCTATTGGTCTCGGTGTCGCGGAAATCCT
TCTTGGGCGCCACCGTTGGCCTTCCTGTAAAGGATCTGGGTCCAGCGAGCCTTGCGGCGGAACTTCACGCGATCGGCAAT
GGCGCTGACTACGTCCGCACCCACGCGCCTGGAGATCTGCGAAGCGCAATCACCTTCTCGGAAACCCTCGCGAAATTTCG
CAGTCGCGACGCCAGAGACCGAGGGTTAGATCATGCCTAG

Protein sequence :
MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVS
IDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSAL
RRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGN
GADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA