Name : ORF_2 (ORF_2)
Accession : AAZ04413.1
PAI name : PAI I APEC-O1
PAI accession : DQ095216
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : superfamily I DNA helicase
Function : -
Note : similar to GenBank Accession Number CAD42018
Homologs in the searched genomes : 42 hits ( 41 protein-level, 1 DNA-level )
Publication :
-Kariyawasam,S., Johnson,T.J. and Nolan,L.K., "The pap Operon of Avian Pathogenic Escherichia coli Strain O1:K1 Is Located on a Novel Pathogenicity Island", Infect. Immun. 74 (1), 744-749 (2006) PUBMED 16369033.
-Kariyawasam,S., Johnson,T.J. and Nolan,L.K., "Direct Submission", Submitted (15-JUN-2005) Veterinary Microbiology and Preventive Medicine, Iowa State University, 1802 Elwood Drive, VMRI 2, Ames, IA 50011, USA.
DNA sequence : | |
ATGGGACTGGTAATGGATGAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAA
GGGCAGTTTTGAACGGAAAGACGCCAAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATTG
TCAGTAAATTTTTTGAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCGTCTTGCGCCCAAAAGTTTATTTCCGGTTA
CTGCAGCATGGTAAGGACCGTTCCGCAGGCGCACCTGATATTGTTACCCCGCTAGTGACGCCAGCCTTGCTAAGCCGTGA
GGGTTTTTTATATCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTG
GTGAGATTGGGCAGTATGACAAATACAAGACAATCCATACCTCGTTCTCTATCAACTTTGATGACAGCATTGATAAGACT
GCCGAAACGGATGAAGAACGGGAAGCACGATATGCAGCCTTGCAGCAGGAGTGGCGTCAATATCTGGATGATTCAGAGAG
GCTGCTGAAGAACGTTGCCGGCGACTGGATTAAAAATCCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAA
CGGCGCAATCTGGTGGTGCCAGTTTCCATATCCTTTCGCTTTATGATCACCTGCTTGTTTGCAAGAAGGATGTGCCGCTC
TTCAATCGCTTCGCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCAGGAGCAAAATTCAGCGACAGGCTTGG
ACACTCCGGAGATAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCAAGACATGGCGATA
TCCTTGCTGTTAATGGACCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCG
GCTCTCGAAAAATCTGAGCCTCCGGTTATTATCGCGACTTCAACGAACAACCAGGCTGTAACGAACATTATCGAAGCGTT
CGGGAAAGATTTTTCACAGGGCACTGGTGCAATGGCCGGACGATGGTTACCTGAGCTGAAAAGCTTCGGCGCTTATTTTC
CCTCAAGCACTCGTAAAGCTGAGGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTAT
GTAGAGGATGCACTGCTGTTTTATCTCGAGAAAGCTAAGGCAGCTTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGT
CATTGAACTCCTGCATGGTCAGTTGGTAGCAAAATCCGAGCAATTGAAAAGACTGAACGCAACATGGCAAACGTTAAGCC
AGGTACGGGCTGCGCGTGAGCTTATTGCTAATGATATTGAGCAATATCTCGATAATTTAAATAAATTACTTTCCGGACAA
GAACAAAAAGTCACTCTACTGAAGAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATT
ATTTTCCTGGCTCCCAGCGGTTCGTAGTAAGCGACAGTACCAAATACAGCTGTTTCTCGAAGATAAATTAGGCGCGCTGA
TTGCAGGAAATCAGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAA
ACAACATACCGGCAGCAGATTGACTCCGCCCATGAAATCGTTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGGCT
GGCATTTGATTTAGGGTATGAGGGCGACGAGGAACTGAGCTTCTCACAGGCCGATGAACTGGCTGATACGCAGATTCGCT
TCCCTGCATTTTTACTGACGACTCACTACTGGGAAGGTCGTTGGCTGATGGATATGGCCAGAATTGATGATCTGCAGGAA
GAGAAGAAGAAAAAAGGCGCTAAAGGGGTAACCGCCCGTTGGCAACGTCGAATGAAACTCACTCCATGTGTGGTGATGAC
ATGCTATATGCTGCCCGGCAATATGCAGATAAGTGAGCACAAAGGACAACGTAAATTCGAGAAAAGTTATTTGTATGATT
TTGCCGATTTACTCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCATTAGCTAAGAAG
GCATTAGTGATTGGCGATACGGAGCAGATCCCGCCAATATGGAGTATTGCTCCCGCGATTGATGTCGGTAACATGCTTGC
GGAAAAAATTCTGTCTGGCAGTACGCAAGAAGAGATTACCGAGAAATATACGGCAATCGCAGACCTTGGTAAAAGTGCCG
CATCTGGCAGCGTTATGAAAATAGCGCAGTTTGCTTCGCGCTATCAATATGATCCCGAACTGGCTCGTGGTATGTACCTA
TATGAACACCGCCGGTGCTTCGATAATATTATTGGATACTGTAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAG
AGGGCGTGAAGAGAGCAATTTAATGCCCGCAATGGGTTATCTCCATATTGATGGTAAAGGAGAGCAGGCAAGTAGTGGAA
GTAGATATAATTTGCTTGAGGCTGAAACGATAGCGGCCTGGTTGGCAGAGAACCAGCAAAATATTGAAGCGCATTACGGC
AAATCGCTTCATGAAGTTGTCGGTATCGTGACGCCTTTTAGCGCGCAGGTATCCACTATCAAACAGGCGCTGGGTAAACA
AGGTATCAGTACGGGCGCGAATGAAAAGTCGCTCACAGTGGGCACCGTGCACTCTCTTCAGGGAGCGGAAAGAGCGATTG
TTATATTCTCGCCAGTCTATTCAAAACATGAAGACGGCGGGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCAGTC
TCCCGTGCGAAGGATAGCTTCCTGGTCTTCGGCGATATGGATCTGTTTGAGGTCCAGCCAGCCTCATCTCCACGGGGATT
ACTGGCAAAATACCTCTTTGAGTCAGAGAAGAATGCGCTCTCTTTTGATTATAAAGAGCGTAAGGATTTAAAAACTTCCG
AGACCAAAATCTACACACTTCATGGTGTGGAGCAGCATGATAACTTCCTGAATCAGACGTTCGAAAATACCGATAAACAC
ATCACGATAGTTTCTCCATGGCTGACCTGGCAAAAGCTGGAGCAAACCGGTTTTCTTGATTCTATGATTGCGGCGTGTTC
ACGTGGTATTAACGTCACGATAGTCACTGACAGAAGCTACAACACTGAACATAAAGATTTTGAGAAGCGAAAAGAGAAGC
AGCAGAACCTTAAAGCGGCGCTGGAGAAACTGAATGCGCTGGGTATTGCTACAAAGCTGGTAAACCGTGTTCATAGCAAA
ATTGTTATTGGTGATGATGGTTTGCTATGCGTGGGATCGTTCAACTGGTTTAGTGCGACACGGGAAGCGCGATATGAACG
ATACGATACCTCAATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGAGAGGCGTC
AGGTTTAG
|
Protein sequence : | |
MGLVMDENALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDEAIVSKFFEGEKDDVETVDVVLRPKVYFRL
LQHGKDRSAGAPDIVTPLVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTIHTSFSINFDDSIDKT
AETDEEREARYAALQQEWRQYLDDSERLLKNVAGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPL
FNRFASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARA
ALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEY
VEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQLVAKSEQLKRLNATWQTLSQVRAARELIANDIEQYLDNLNKLLSGQ
EQKVTLLKSAKTEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQ
TTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMARIDDLQE
EKKKKGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKK
ALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGSTQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYL
YEHRRCFDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGEQASSGSRYNLLEAETIAAWLAENQQNIEAHYG
KSLHEVVGIVTPFSAQVSTIKQALGKQGISTGANEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNSMLNVAV
SRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTSETKIYTLHGVEQHDNFLNQTFENTDKH
ITIVSPWLTWQKLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHKDFEKRKEKQQNLKAALEKLNALGIATKLVNRVHSK
IVIGDDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV
|
|