Gene Information

Name : YpAngola_A2374 (YpAngola_A2374)
Accession : YP_001606800.1
Strain : Yersinia pestis Angola
Genome accession: NC_010159
Putative virulence/resistance : Unknown
Product : fibronectin type III domain-containing protein
Function : -
COG functional category : S : Function unknown
COG ID : COG4733
EC number : -
Position : 2491614 - 2494817 bp
Length : 3204 bp
Strand : -
Note : identified by match to protein family HMM PF00041; match to protein family HMM PF09327

DNA sequence :
ATGGCACGTAAACCAATTAAAGGCCGCAAAGGTGGGGGCAGCAATGCCACAACGCCAGTTGAGTCACCGGACAGTATTCA
ATCGACGGCAAGAGCTAAAATACTCATTGCTTTGGGTGAGGGGGAGTTCGCCGGAGGTTTGGATGGAACCAATATCTATC
TGGACGGCACACCTATAAAGAACTCTGACGGTACTAGTAATTTCACTGGGGTTACTTGGGAGTATCGTCCCGGCACGCAG
GCTCAGGACTACATTCAAGGAATGCCAAATGTCGAGAATGAGATAACGGTTAACACAGAGCTTAAATCAGATACGCCATG
GGTGCGCTCCATCACAAATACCCAACTCTCGGCTACACGTGTTCGTCTTGGATGGCCCTCATTACAGCGTCAGGCGGACA
ATGGTGATGTTGGCGGTTATCGCATTGAGTACGCCGTCGATGTGGCAACGGATGGTGGCGCATATCAAACACTGCTTGAT
ACAGCCATTGATGGGAAAACAACAACTTTATATGAACGCTCGCACAGAATAAACCTACCCAAGGCCACAGCTGGTTGGCA
GGTTCGTACAAGGCGAAAAACAGCCAATGCCAACTCTGGCCGCATTGCCGACAAGATGAATGTCGAAGCTATTTCTGAAG
TCATCGATGCCAAGTTACGTTACCCAAATACCGCGCTTCTCTATATAGAATTCGACGCAACTCAATTTCAGAATATCCCT
ATTATCTCATGTGAGCCTAAAGGCCGGATTATCCGCGTACCTACTACATATGATCCAGTAACGCGTACCTACTCTGGTGT
GTGGGATGGTTCATTTAAATGGGCTCATACCAACAACCCAGCCTGGGTATTCTACAACATTGTATTAGCAGATCGCTTTG
GCCTTGGTCATCGGATTGAGGTCAGCCAGGTAGATAAGTGGGAGCTGTACCGAATTGGTCAATACTGCGATCAGCTTATT
CCTGATGGTCGGGGCGGTAGTGGTACTGAGCCTCGTTTTACCTGCGATGTGTATATTCAGTCTCAGGCCGAGGCATTTAC
TGTATTGCGTGATTTGGCCGCCATTTTTCGGGGCATGACCTATTGGGGAAATAATCAGCTTTGCACCCTGGCAGATATGC
CACGAGATGTGGACTATATATTTACCCGTGCCAGTGTGATTGACGGACGATTCACTTACGGTGGTGGTTCCGAGAAAAAG
CGCTATACAACCGCAATGGTGAGCTGGAGTGACCCCGCAAATAACTGTCAGGATGCAATCGAGGCAGTGTCAGATAACGA
CTTGGTTCGTCGCTACGGTGTCAATCAGCTTGATATGACGGCTATCGGCTGTATCCGGCAAACTGAGGCGAATAGGCGTG
GACGTTGGGCGCTACTGACAAACAGTAAAGACCGGACTGTTAATTTTAATGTAGGGTTAGACGGGGCCATTCCGTTGCCC
GGTCATATCATTGGTGTTGCGGATGATATGCTCTCTGGTCGGAAGATGGGCGGTCGCATTAGCTCAGTATCGGGCCGGAA
TATCACTCTTGACCGTGTTGCTGATGTGAAAGCAGGTGACCGGCTACTTGTTAACTTACCAAACGGTGTAGCTCAGGGCA
GAACGGTGCAAGTGGTCAACGGGAAAGTAATCACTGTCACAACGGCTTACAGTGAAGTGCCAGCAGCGGAAAGCGGTTGG
TCTGTTGATGCGGATGATTTAGCTATCCAGCAATATCGGGTTACTGGTATTTCTGACAATGACGACAATACATACAGTAT
CTCATCTGTTCAGCATGATCCGGACAAATATGAGCGAATTGATACGGGCGCTCGGATTGATGAAAGACCCATCAGCGTAA
TCCCGCCCGGCGTCCAGCCACCTCCGACAAATGTTGTTATTGATAGCTTCTCAGCACTTTCACAAGGGCTCGCAATAACC
ACCCTACGTGTTACGTGGGAACCAGCAGCCAGCGCGATAGCATACGAGGCTGAATGGCGACGTGATAACGGAAACTGGAT
ATCAGCACCGCGCACATCTGCTCAGGGATTTCAGGTTGAAGGTATTTATGCTGGACAATATCAGGCTCGCGTTCGTGCTA
TTAACCCCTCAGAAATATCCAGTATTTGGGCTAATGCTCAGGAAACCACATTAAACGGTAAAGAGGGAAATCCTCCAATG
CCAGTTGGATTTACAGCTACAGGCATTCTCTTTGGCATCACTCTCAATTGGGGATACCCTGAAGGAGCCGAAGATGCGTT
AAAAACAGAGATTGAATATAGCCTGTCTGCTGACGGCACCGATGCCATGCTGTTGAGTGATGTGCCGCATCCGCAACGGA
ACTACACTATGCAGGGGTTGAGAGCAGGGCAGGTGTTCTGGTTCCGTGCTCGGATAGTTGATAAATCCGGTAATCAGTCG
CCATGGATTGATTGGGTTCGTGGCATGTCCAGCACAGACACAAGCGCTATTCTCGAAGCGATTGGCGACGACTTTATCAA
TAACACAGTTGCGGGTCAGCAACTGATTAATGATGACTTCATGAATGCAGAGGGCATTCTCGAAACAGCGAAGGCCAATA
ACGCCAGCATCTGGCAGCAATGGGCTCAACACGGAGAGAATAAAGCCGGTGTTATCCACTTAACGACCACTGTTGCCGAT
GCTGAAAGAGCATTTGCTGAGTTTGAAACCCTTGTTACAGCAACATTTGAAGACCAGACAGCAGCGATAGACCAAAAAAT
GACAGCAGTTGTTGATGCCAACGGGGCTAGTGCTACTTATAGTTTAAGGGCCGGACTGAATTATAACGGCCAGTTTGTCA
GCGCAGGCATGGTAATTGGTGCAGAGTTTATTAATGGTGTAGCTAAATCCTCAATTGGTTTTACTGCCGATCAATTTATA
TTGCTCTCCGGTCCAACTGGTAATTTATTTTCGCCTTTTGCAGTGGTAAATGGTCAAGTGTTTATGAATGATGCATTTAT
TGCAAAGGCATCAATTGGGCGAGGAAAAATAACAGATACCCTTGACTCAGATAATTACGTGCAAGGAATATCCGGTCTAA
AACTGGATTTTAAAAATGGTAATGCTGAATTTAACAATGTAAATCTCAGGGGGAATATAACTATGGATAACACGATTAAT
GGTATTCGCACCATAGTAGATTATCGTGGGCAGAGGACATATCACGCAAATGGTCAGCCAGCGATAATATGCGGGTACTT
CTAA

Protein sequence :
MARKPIKGRKGGGSNATTPVESPDSIQSTARAKILIALGEGEFAGGLDGTNIYLDGTPIKNSDGTSNFTGVTWEYRPGTQ
AQDYIQGMPNVENEITVNTELKSDTPWVRSITNTQLSATRVRLGWPSLQRQADNGDVGGYRIEYAVDVATDGGAYQTLLD
TAIDGKTTTLYERSHRINLPKATAGWQVRTRRKTANANSGRIADKMNVEAISEVIDAKLRYPNTALLYIEFDATQFQNIP
IISCEPKGRIIRVPTTYDPVTRTYSGVWDGSFKWAHTNNPAWVFYNIVLADRFGLGHRIEVSQVDKWELYRIGQYCDQLI
PDGRGGSGTEPRFTCDVYIQSQAEAFTVLRDLAAIFRGMTYWGNNQLCTLADMPRDVDYIFTRASVIDGRFTYGGGSEKK
RYTTAMVSWSDPANNCQDAIEAVSDNDLVRRYGVNQLDMTAIGCIRQTEANRRGRWALLTNSKDRTVNFNVGLDGAIPLP
GHIIGVADDMLSGRKMGGRISSVSGRNITLDRVADVKAGDRLLVNLPNGVAQGRTVQVVNGKVITVTTAYSEVPAAESGW
SVDADDLAIQQYRVTGISDNDDNTYSISSVQHDPDKYERIDTGARIDERPISVIPPGVQPPPTNVVIDSFSALSQGLAIT
TLRVTWEPAASAIAYEAEWRRDNGNWISAPRTSAQGFQVEGIYAGQYQARVRAINPSEISSIWANAQETTLNGKEGNPPM
PVGFTATGILFGITLNWGYPEGAEDALKTEIEYSLSADGTDAMLLSDVPHPQRNYTMQGLRAGQVFWFRARIVDKSGNQS
PWIDWVRGMSSTDTSAILEAIGDDFINNTVAGQQLINDDFMNAEGILETAKANNASIWQQWAQHGENKAGVIHLTTTVAD
AERAFAEFETLVTATFEDQTAAIDQKMTAVVDANGASATYSLRAGLNYNGQFVSAGMVIGAEFINGVAKSSIGFTADQFI
LLSGPTGNLFSPFAVVNGQVFMNDAFIAKASIGRGKITDTLDSDNYVQGISGLKLDFKNGNAEFNNVNLRGNITMDNTIN
GIRTIVDYRGQRTYHANGQPAIICGYF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ESA_01044 YP_001437149.1 hypothetical protein Not tested Not named Protein 0.0 61