Gene Information

Name : Rahaq_4506 (Rahaq_4506)
Accession : YP_004215216.1
Strain :
Genome accession: NC_015062
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG4733
EC number : -
Position : 73523 - 76708 bp
Length : 3186 bp
Strand : -
Note : PFAM: protein of unknown function DUF1983; Fibronectin type III domain protein; KEGG: ent:Ent638_2219 fibronectin, type III domain-containing protein

DNA sequence :
ATGGCAACCGCAACCAAAATAAAAGGCCGCAAAGGTGGCAGTTCTTCATCACGCACGCCCGTAGAACAGCCAGATGATCT
TCAATCCATCGCGAAGGCAAAACTGCTTATCGCTTTGGGCGAGGGGGAATTTGGCGGCGGCCTGACTGGGCAATCAATTT
ATCTGGATGGAACGCCGCTGCTTAACAGTGACGGTTCAAGCAATTTCAGCGGGGTGGCGTGGGAGTTCCGCGCCGGGACG
CAGGCGCAATCCTACATTCAGGGATTGCCGGGTACTGAGAACGAAATTAGCGTCGGCACTGAGGTGAAAAGCACAGTTGC
CTGGACGCATACTTTTACTAATACCCAACTCTCAGCTATTCGCCTGCGGCTAAAATGGCCGTCTTTATTTAAGCAAGAAG
ATGACGGTGATCTGGTTGGGTATTCGATCAATTACACCATCGAATTGCAAACCGACGGCGGAGCATTTCAGACGGTAATC
AACACAGCAGTTACTGGCAAAACCACGTCAGGATACGAGCGCAGCCATCGCGTTGACCTTCCACCGGCTGGCACCACCTG
GACAATTCGTCTGCGCAAGATTACTGCAGATGCGAACAGCGCAAAGATTGGCGATGCAATGACGATCCAGAGCTACACGG
AAGTCATCGATGCAAAACTGAGGTATCCGAACACCGCACTGCTGTATATCGAATTCGATTCCAGTCAATTCAATGGCTCC
ATCCCGCAAATTTCATGCGAGCCACAGGGCCGCGTTATTCGCGTGCCTGATACTTATGATCCGGTAACACGAACTTATAG
CGGTACATGGACCGGTGCTTTCAAGTGGGCATGGTCAGATAACCCGGCGTGGGTTTTCTATGACCTTGTGGTCACTGACC
GTTTTGGCCTAGGTAACCGGCTTACCGCGGAGAACATCGACAAATGGGAACTTTACCAGGTCGCACAATATTGCGATCAG
ATGGTTCCAGATGGAAAAGGCGGTAACGGCACAGAGCCGCGTTATATCTGCAACGTTTACGTACAAAGTCGAAACGACGC
ATATACGGTTTTGAGAGACTTTGCGGCGATTTTCCGCGGCATGACGTACTGGGGCGGGGATCAGATAGTTGCCTTGGCTG
ACATGCCGCGTGATATCGATTACAGCTATACCCGCGCGAACGTCATCGATGGCCAGTTCAGTTACTCGAGCAGCACGACC
AAGACCCGTTACACAACGGCCCTTGTGTCATGGTCTGATCCGGACAATGCCTATGCTGATGCTATGGAGCCAGTTTTCGA
GCAGGATCTGGTTACGCGCTACGGGTTCAACCAACTTGAACTGACGGCTATTGGCTGTACCCGTCAGTCAGAAGCGAACA
GGAAAGGGCGCTGGGGTATCCTTACGAATAACAAAGACCGGGTGATAACTTTTGGTGTCGGGCTGGATGGCATGATCCCG
CAGCCGGGTTACATCATTGCGGTTGCCGATGAAATGCTGTCGGGAAAAGTGACCGGTGGCCGCATAAGTTCTGTGAGCGG
TCGCGCGATTACCTTGGACCGTGTTCCTGATGCCGCCGCCGGTGGCCGGCTGATTTTAAACCTTCCATCAGGTGCAGCTC
AGTCACGCACAATTCAGTCGGTGTCGGGGAAGGTTGTCACCGTCACCACGGCTTACAGCGAAACACCAGAGTCCGAAAGC
GTCTGGGTAGTTGAGTCAGACGAGCTGTATGCGCAGCAATACCGCGTGCTTAGCGTGGCTGACAACAACGACAATACGTT
CACCATTTCCGCGGCGTATCACGACCCGGATAAATACGCGCGCATCGATACCGGCGCCATCATCGACGAACGCCCGATCA
GTGTTATCCCGCCGGGCAGCCAGTCTGCCCCTGCCAATATTCAGATCGGGTCTTATTCTGTGGTCAATCAGGGGATCAGC
GTTCAGACCATGCGGGCTACGTGGGACGCAACGACAAACGCTATCGCTTATGAGGCGCAGTGGCGCCGTAACGATGGGAA
CTGGGTAAACGTTCCGCGCAGCTCTACCACGTCGTTTGAAGTGCCTGGCATTTATGCTGGTCGTTACCTGGTTCGTGTGC
GAGCGATTAATGTGGCAGAGATATCGAGTGGCTGGGGCTACTCGGTTGAAGTTACCCTGACCGGCAAAGAGGGGAACCCA
CCGAAGCCGGTAGGTTTCACGGCCACCGGCATCAACTGGGGTATTCAGCTGAACTGGGGCTTCCCGGAAAACACCTCGGA
CACGCTGAAAACAGAGATTCAGTACACGCCGAATTCTGATCAGTCTAATCCTCTTCTGTTGTCTGATGCTCCCTATCCAC
AGGCAATTTATACGCAACTGGGGTTAAGGGCCGGTCAGGTATTCTGGTACCGCGCTCAACTGGTGGATAAAACCGGAAAT
GAGTCAGGGTATACCGACTGGATCAGAGGGATGGTGAACGATAATGCAGATGATTACCTGGGCGATATCGCTGATGACTT
CCTGAGTTCTGCCGATGGTGACCGGCTGACGGGTGACATTGAAACCAACATCGATGCCATCCTGCAGAACGCCTTAAACC
TCAATTCGACCATTGATCACCAGTTCGCTCAGAACGGTGAGGTTAGGGCTGATATTCTGACCGTAAAAACTACTGTCGCC
GAGGTTGATCAGGCGATGGCTGATTTGACTACTCAAGTGCAAGCGCAAATCGGAGACGTGACGGCAGCGCTTGAGGACAA
ACTAACGGCCGTTGTTGATGCCAGTGGTGCTTCAGCAATTTACACCCTGAAAACCGGTGTGCGGATCGGTGGAGTGATGT
ACAACGCCGGGATGTCTATCGCTGTGTTGGCGCAGGCAGGCCAGCCGGTGGTGACGCGGGTGGGCTTTAACGCAAACCAG
TTCGTGCTGATGTCGGGATCAGGTGACACGCAGTATTCACCATTTGCCGTGGTGAACGGCCAGGTGTTTATCAGTGATGC
TTTTGTGCAGGACGGGACGATTACCAATGCGAAGATCGGCAACTTCATTCAGTCTAACAACTACGTAGCGGGTGTATCCG
GCTGGCGTTTGGATAAGGGCGGTACTTTCGTGAACTACGGTTCTGGTTCCGGCGGAAAGATGAAGACCACTAACACGACG
ATCAGTGTCGCTGACGCCAGCGGCGTACTGCGAGTCCAAATTGGTGAGCTGACAGGGGTATTCTAA

Protein sequence :
MATATKIKGRKGGSSSSRTPVEQPDDLQSIAKAKLLIALGEGEFGGGLTGQSIYLDGTPLLNSDGSSNFSGVAWEFRAGT
QAQSYIQGLPGTENEISVGTEVKSTVAWTHTFTNTQLSAIRLRLKWPSLFKQEDDGDLVGYSINYTIELQTDGGAFQTVI
NTAVTGKTTSGYERSHRVDLPPAGTTWTIRLRKITADANSAKIGDAMTIQSYTEVIDAKLRYPNTALLYIEFDSSQFNGS
IPQISCEPQGRVIRVPDTYDPVTRTYSGTWTGAFKWAWSDNPAWVFYDLVVTDRFGLGNRLTAENIDKWELYQVAQYCDQ
MVPDGKGGNGTEPRYICNVYVQSRNDAYTVLRDFAAIFRGMTYWGGDQIVALADMPRDIDYSYTRANVIDGQFSYSSSTT
KTRYTTALVSWSDPDNAYADAMEPVFEQDLVTRYGFNQLELTAIGCTRQSEANRKGRWGILTNNKDRVITFGVGLDGMIP
QPGYIIAVADEMLSGKVTGGRISSVSGRAITLDRVPDAAAGGRLILNLPSGAAQSRTIQSVSGKVVTVTTAYSETPESES
VWVVESDELYAQQYRVLSVADNNDNTFTISAAYHDPDKYARIDTGAIIDERPISVIPPGSQSAPANIQIGSYSVVNQGIS
VQTMRATWDATTNAIAYEAQWRRNDGNWVNVPRSSTTSFEVPGIYAGRYLVRVRAINVAEISSGWGYSVEVTLTGKEGNP
PKPVGFTATGINWGIQLNWGFPENTSDTLKTEIQYTPNSDQSNPLLLSDAPYPQAIYTQLGLRAGQVFWYRAQLVDKTGN
ESGYTDWIRGMVNDNADDYLGDIADDFLSSADGDRLTGDIETNIDAILQNALNLNSTIDHQFAQNGEVRADILTVKTTVA
EVDQAMADLTTQVQAQIGDVTAALEDKLTAVVDASGASAIYTLKTGVRIGGVMYNAGMSIAVLAQAGQPVVTRVGFNANQ
FVLMSGSGDTQYSPFAVVNGQVFISDAFVQDGTITNAKIGNFIQSNNYVAGVSGWRLDKGGTFVNYGSGSGGKMKTTNTT
ISVADASGVLRVQIGELTGVF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ESA_01044 YP_001437149.1 hypothetical protein Not tested Not named Protein 0.0 65