PAI Gene Information


Name : STY2875
Accession : NP_457158.1
PAI name : SPI-9
PAI accession : NC_003198_P4
Strain : Salmonella enterica RSK2980
Virulence or Resistance: Not determined
Product : large repetitive protein
Function : -
Note : Similar to several including: Salmonella typhi proline/threonine-rich protein TR:Q9X6M3 (EMBL:AF139831) (1605 aa) fasta scores: E(): 0, 99.8% id in 1569 aa, Pseudomonas aeruginosa hypothetical protein PA1874 TR:AAG05263 (EMBL:AE004613) (2468 aa) fasta sco
Homologs in the searched genomes :   47 hits    ( 40 protein-level,   7 DNA-level )  
Publication :
    -Parkhill,J., "Direct Submission", Submitted (25-OCT-2001) Submitted on behalf of the Salmonalla sequencing team, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.

    -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18", Nature 413 (6858), 848-852 (2001) PUBMED 11677608.

    -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Direct Submission", Submitted (10-SEP-2013) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGCGTCTACTCGCCGTGGTTTCGAAATTGACTGGCGTCTCCACCACTGTGGAATCCTCAGCGGTCACTCTTAACGCGCC
TTCAATTGTTAAATTATCAGTGGCCCGGGATGAAATTAGTCAACTTACGCGCATTAATCAGGATCTGGTGGTGAGGCTCC
ATTCCGGCGAAACGATCACGATTAAAAACTTTTACGTTACCAACGATCTGGGCGCAAGCCAGCTGGTACTGGCGGAAAAC
GATGGCACGTTATGGTGGGTGGAAAATCCGCAAGCCGGGCTACATTTTGAACAAATCGCTGATATTAATGAGCTGCTGGT
CACTTCCGGCGCTTCCCATGAAGCAGGCGGCGCCGTTTGGCCGTGGGTACTGGCTGGCGCGGTGGCGGCTGGCGGCATTG
CCGCTATCGCGTCTTCCGGCGGCGGCGATTCCCACCATCATTCGGATGGCGATAATCCGCCCCCCGATAACACCAATCCT
GACGGTAATCCCCCTGATAACAGCAATCCCGGCGGCAGTAACCCTAACGGCAATACTCCAGGTAGCAGTAATCCTGTAGA
TACTACCCCGCCTCTCGCTCCCGGCGAATTATTGATTTCAGCGGACGGAAAAACGGTGAGCGGCGAGGCCGAAGCGGGCA
GTCTCATTACCATCAAAGATCCCTCAGGCAACGTCGTTGGCGAGGGCAAAGCGGATAGCGACGGTAAATTTAGTATTGAT
CTGACAGCGCCACAGATTAGCGGCGAACAACTTACCGTGACTGCGACTGACGATGCCGGCAATACCGGCCCATCCGCAAC
CATTGATGCGCCCAACATTCCTCTCCCCGATACACCGGTTATCACCGCCGCTATCGATGATGCCGCTCCCCTCACCGGCA
CGCTGAGCAATAATCAGTTTACGAACGACAATACCCCCACTCTGGAGGGCACCGGCAGCGCAGGCACAGTCATCCATATT
TACGCCAATGGTCAGGAAATAGGCTCAACAACAGTTGATACCAGCGGAAACTGGCATTTTGCCATTACCAGCGCGCTAGC
GGATGGGGAAAATCATTTCACCGCCATTGCGACTAACGTTAAAGGCGAAAGTAGCGAATCAGCCCGCTTTACGCTGACTA
TCGACACACTCAGCCCCGATGCTCCACGCGTTGAACTGATGGCCGATAACACCGGTTTGCTCACCGGACCGCTACAGAAT
AATGACCGGACTGACGAGGCAAAACCGCTATTTTCCGGGCAGGGAGAGGCAGGCAATACCATCACGATTAAAGAGGGTTC
AACCGTTATCGGCAGCGCTACCGTAGACGAAAATGGACGCTGGACCTTTACGCCGACTACGCCGTTAAGCGATGGCGAAC
ATACCTTTACCGTCGAACAGAGCGACAAAGCCGGAAACGCAAGCCGCGTGACGACAACGCCTACTATCATTGTGGACACC
ACGCCGCCGGACGCCGCTATCATTGATAATGTCGCGAAAGACGGCACAACCGTTAGCGGCACCGCTGAAGCTGGCAGTAC
CGTGTCGATCTATGACCCGGCGGGAAATTACCTGGGCTCCACGATTACCGGAGAAAATAACCACTTCAGCATCACGCTGA
ATCCGGCCCAGACCCACGGCGAGCGTCTGGAAGCGCGTATTCAGGACGCCGTCGGTAACATCGGCCCCGCCACAGAGTTT
ACCGCTTCTGACTCACAGTATCCTGCCCAGCCGACTATCCTTACCGTGACGGATGACGCTGGCGCCGTTACCGGGCTGTT
GAAAAATGGCGATGCCACAGATGATAACCGCCCAACCCTCAGCGGTACTGCTGAACCAGGCAGTACGATATCGATTAACG
ATAATGGCTTTCCTGTTCCGAGCTTTCCGCCCATTGTCGCTGACGCTGACGGCAAATGGAGCTTTACCCCCTCGCTGGCG
CTTGCCGATGGCGACCATGTCTTTACCGCTACCGCGACCAACGATCGCGGCACTAGCGGGCAGTCCGTCGCCTTTACCAT
TGATATCGACACGCAGCCGCCGGTGCTGGAAGGCCTGGCAGTTAGCGACGTCGGCGACAGACTCACCGGCACTACGGAAG
CTGGCAGCACTGTGGTTATCAAAGATAGCCTGGGAAATACGCTCGGGAGTGGAACGGCAGGCGACGACGGCACCTTCTCA
ATAGGTATTAGCCCGGCGAAAATTAACGGTGAAACATTAAGCATTAGCGTTACCGATAAAGCCGCGAATAGCGGTCCGGT
AGAAACGCTGAACGCGCCGGATAAAACTGCGCCTGCGGCACCGAACGGTCTTATCGTGGCGACCGACGGTCTGTCCGTAA
GCGGTCAGGCGGAAGCCGGGGCAACGGTCACTATCCGCGACAGTAGCAATACCGTACTTGGCAGCGCCGTCGCTAACGGC
AACGGACAATTTATCGTTCCGCTGAATGCGGCGCAGACTAACGGCCAGGCGCTTATCGCTACCGCCACCGATATAGCGAA
CAACGAAAGCGCCGCCGCGACGGTCGACGCGCCGGACAGTACCGCGCCAGAAATGCCGAAAAACGTGGTAATTAGTGAGG
ATGGCGCCAGTATCAGCGGCACCGCCGAACCGGGTAGCTCCATTACGATCACCACGCCGGACGGCACGCCGCTCGGCAGC
GGCAAAGCAGATGGCGAAGGTCATTTTACCCTTCCCCTCGCCCCCGCACAGACCAACGGCGAACAGGTTACCGTCACCGC
CACCGACAGCGCCAACAACGTCAGCCCGCCAACTACAGCGCAAGCGCCCGATATCACCGCCCCGGATAAGCCCATTATCA
CCCAGGTGCTGGACGATGTTGAAAGCTTCACCGGGCCGCTGGTTAACGGACAAACCACCAATGACAACCGCCCCACCCTT
AGCGGTACGGCGGAGGCTGGCGCGCGTGTCGAAATTTTTGATAACGGCGTTTCGCTGGGGCTCGCCACGCTACAGCCCAA
CGGCGGCTGGACGTTTACGCCGTCGCAAAATTTAGGTGAAGGCGCGCATCGACTGACCGTAATCGCAACCGACGCTAAAG
GCAATGCCAGTCCGGCGGGGAACGAAAGTCCGGAATCCATCAGCTTTACCCTACGCATCGATACCCAGGCGCCGGATGCG
CCGCAGATCGTGTCAGCCGCCATCACAGGCGGAGAAGGCGAGGTGCTACTGGCAAACGGCAGTATTACCAATCAGCGTAT
GCCGACCCTCAGCGGCACCGGCGAACCCGGCGCCATCATCACCCTGTACAATAACGGCGTAGAACTGGCTACCGTCCAGG
TCAATCCACAGGGTAGCTGGACCTATCCGCTAACCCGTAATCTGAGCGAAGGGTTAAACATCCTGACGGCCACCGCCACG
GATGCCGCAGGCAATAGTAGCCCGACCTCCGGCGTTTTCTCCGTTACCCTTGATACCCAGCCTCCAGCGCAGCCTGACGC
GCCGCTAATCAGCGATAACGTCGCGCCGGTTATCGGCAACATCGGCAATAATGGCGCAACGAACGATACCACGCCGACCT
TCAGCGGCACGGGAGAGATCGGCAGCACGATAATTCTCTACAATAATGGCAGTGAAATTGGTCGCACAACGGTAGGCGAT
AACGGTAGCTGGAACTTTACGCCTGCGGCACTGACGCCAGAAACCTATACCATTACCGTCACGGAAACCGATATAGCGGG
CAATATCAGTCCACCTTCCGCCTCAGTCACTTTTACGCTAGACACCACTGCGCCCGCCAATCCGGTTATCACTTTTGCCG
AAGATAACGTCGGCGAAGTCCAGGATACTATTGTCAGCGGCGCAACCACTGACGACAATACACCGGTCATTCACGGCACT
GGCGACATCGGCAGCGTGATCACGCTCTATAATGGCAGTAGTGTCGTGGGCGTAGTCACCGTCGATGAGACCGGTACCTG
GACGCTGCCGGTGACCAGCGCGTTGCCGGATGGCGTCTACACCCTGACCGCCATTGCCGCCGATGCCGCCGGAAACAGCA
GCGGCGTATCGAACAGCTTTACCTTCACCGTCGACACCGTTCCGTTGCAGCCGCCCGTCGTCAATGAGATCCTCGACGAT
GTTGCACCAGTGACCGGGCCATTAACCGATGGCGCCTTTACTAACGATCGGACGCTGACTATTAACGGCAGCGGCGAAAA
CGGCAGCACCGTCACGATTTACGACAATGGCGTGGCAATCGGGACGGCGCTCGTTACCGACGGGGTCTGGACATTCAATA
CGTCCGAATTGTCAGAAGCCAGCCATGCGCTAACCTTCAGCGCGACTGACGATGCTGGAAATACCACGGCGCAAACCCAA
CCGATCACCATTACCGTGGATATCACCGCCCCGCCCGCGCCAACGATCCAGACGGTGGCCGATGATGGCACGCGCGTCGC
CGGACTTGCCGATCCTTACGCTACCGTTGAAATTCATCATGCCGATGGCACCCTGGTCGGCAGCGCTGTCGCTAATGGCA
CCGGTGAATTTGTCGTCACGCTCAGTCCGGCGCAAACCGATGGCGGTACGCTGACGGCAATTGCTATCGATCGCGCGGGG
AATAACGGCCCGGCTACGAATTTTCCTGCTTCCGACAGCGGTCTGCCCGCCGTCCCGGCCATCACGGCGATTGAAGATGA
TGTCGGGAGCATACAGGGGAATATTGCAGCGGGCGGCGCCACGGACGACACCATGCCGACGCTGCGCGGCACCACGGATA
TCGGCTCTACCGTTGAAGTTTTCATTGATGGCGATTCGGCAGGCTTTGCCACCGTTGACGCCAGCGGGAACTGGATCTTT
GAGATCGCGACGCCATTAAGCGAAAGCACACATTACTTCACCGTCCAGGCAACCAATGCGAATGGCCCGGGCGGCCTGTC
CGCACCGGTCGGGATCACTGTCGATCTTAGCGCGCCGGCACAACCGGTTATTACCAGCGCAACGGATGATGTCCCCGGCA
TGACCGGTACGCTGGATAACGGCGCGCTCACCAATGATTCACGCCCGACGCTCAACGGAACGGGAGAAGCAGGCGCCACA
ATCCGCATTCTGGATAACGGCGTAGAAATCGGTTCCGCTACGGTAGATCAAAGCGGCAACTGGCGCTTCACCCCGAACAC
GCCGCTGGAGAGCAACGCACACATCTTTACCGCCGTGGCGACCGATCCCGCCGGCAATAGCGGTCAGCTTTCGGATGGTT
TTACGCTGAACATTGACGCGCAGGCACCAGATGTGCCGGTTATCACCTCCGTGATTGACGATAACAATCAACCGACCGTC
CCGGTGTTACCGGGGCAATCCACCGACGATCGGCAGCCAATACTGAACGGAACTGGCGAACCTGGCGCGACAATCACCAT
TTTTGATAACGGTACGCCGCTTGGCACGGCTCAGGTAGGCGAAAATGGTAGCTGGACCTTCCCGGTGCCCCGCAATTTGT
CAGAGGGAAGCCATAATCTGACGGTTAGCGCTACCGATCCGGCGGGCAATACCAGCGCGGTCTCCGCGCCGTGGACGATC
GTGGTCGATATTACGCCTCCGGCGATCCCGGTTCTCACCTCCGTTGTGGATGACCAGCCCGGTATTACCGGCAACCTGGT
TAGCGGGCAGCTAACGAACGATGCGACGCCCACCCTGAACGGGCGCGGAGAGGCAGGCGCGACGATTAATGTCTATCTTG
ACGGTAATCCCGCGTCCATCGGTACCACGACGGTGAATAGCGACGGCACGTGGAGTTTCACGCCGCAGACGCCGCTTGCA
AACGGTAGCCACACGTTCACCCTTAGCGCCACCGATCCGGCGGGTAATAGCAGCGCGGTGTCCAGCGGATTTGTGCTGAC
GATTGACACCACACCGCCCGCCGCGCCGGTTATCGCCAGCGTGGCAGATAATACGGCGCCGGTAACGGGCATCGTCCCCA
ACGGCGGCTCGACGAACGAAACCCGACCAACACTCTCGGGTACCGGTGAGGCGGGTACAACCATCTCGATTTATAATGGC
AGCGCGCTGGTCGGCACGGCGCAAGTTCAGGCCAACGGTAGCTGGAGCTTTACACCGTCTACCTCGCTGGGCGCGGGCGT
CTGGAACCTGACGGCGACAGCAACCGATGCGGCAGGCAATACCAGCGCCGCGTCCGAAATACGCTCGTTTACTATTGATA
CCACGGCTCCCGCCGCGCCTGTTATTGATACGGTCTACGACGGTACGGGCCCCATTACCGGCAATCTGAGTTCAGGGCAG
ATCACAGACGAGGCGCGCCCTGTCATTAGCGGCACCCGTGAAGCCAACACAACTATTCGTCTCTACGATAACGGCACGCT
GCTGGCTGAAATTCCCGCCGACAATAGCAGTAGCTGGCGCTACACGCCCGACGCCTCGCTGGCGACGGGCAACCATGTAA
TTACCGTCATTGCCGTTGATGCCGCAGGCAACGCCAGCCCCGTTTCGGACAGCGTTAATTTCGTCGTCGATACCACGCCG
CCGCTGACGCCGGTAATCACATCAGTCAGTGACGATCAGGCGCCAGGCCTCGGCACGATCGCGAACGGCCAAAATACCAA
CGATCCTACGCCAACCTTCAGCGGCACCGCAGAAGCCGGCGCCACGATCACGCTCTATGAAAATGGTACGGTCATTGGCA
CGACAACGGCTCAGCCTGACGGCGCGTGGAGCGTCTCCACCTCAACGTTGGCAAGCGGAACGCACGTCATCACCGCCGTC
GCCACCGATGCCGCAGGAAACAGCAGCCCGAACAGTACGGCTTTCACCCTGACGGTCGATACCACCGCGCCGCAAACGCC
AATCCTGACGTCCGTGGTGGATGACGTCGCGGGCGGGGTCACAGGAAATCTCGCTAATGGTCAGATAACCAATGATAACC
GCCCCACGCTGAACGGCACTGCCGAAGCGGGCAGCGTGGTCAGTATCTATGATGGCGACACTCTGCTTGGCGTCACCTCG
GCTAACGCTAGCGGCGCATGGAGCTTCACGCCGACGACAGGGTTAAACGACGGCACGCGCACATTAACAGTGACCGCCAC
CGACCCAGCAGGCAACGTTAGCCCGGCCACCAGCGGTTTTACTATCGTGGTCGATACCCTTGCGCCAACGGTTCCGCTTA
TAACCAGCATCGTTGATGATGTCCCGAACAATACCGGCGCCATTGGCAATGGACAATCGACCAATGACACACAGCCGACG
CTCAACGGTACCGCGGAAGCCAACAGCGCGGTAAGTATCTTCGATAATGGCGCGCTGGTCGCGACCGTGAACGCCAATGC
CAGCGGCAACTGGAGCTGGACGCCAACCGCCTCGCTCGGCCAGGGAAGTCATGCCTATAGCGTTAGCGCCGCCGATGCGG
CTGGCAACGTTAGCGCCGCTTCGCCATCGACAACGATTATCGTGGATACCATTGCGCCCGGCGCGCCCGGTAACCTAGTC
ATCAATGCTACCGGTAATCGGGTGACGGGCACCGCGGAAGCAGGCAGCACAGTGACGATTACCTCTGAGACTGGTGTGGT
ACTGGGAACCGCCACCGCCGACGGTACGGGCAGCTTCACCGCCACACTCACGCCCGCGCAGACCAATGGTCAGCCGCTAC
TGGCATTTGCCCAGGATAAAGCAGGCAACACTGGCATTGCCGCCGGATTTACCGCGCCCGATACGCGCGTGCCGGAAGCG
CCGATCATCACCAACGTAGTGGATGATGTGGGTATTTATACCGGCGCTATCGCCAACGGTCAGGTCACTAATGACGCACA
ACCCACATTGAATGGTACCGCTCAGGCGGGCGCCACGGTGAGCATTTATAACAACGGGGCGCTGCTCGGCACCACCACGG
CGAACGCCAGCGGGAACTGGAGCTTTACCCCGACAGGCAATTTGACCGAAGGCAGCCACGCCTTCACCGCTACCGCAACT
AACGCCAACGGTACAGGCAGCGTCTCCACCGCCGCGACGGTGATTGTCGATACGCTGGCGCCCGGTACGCCGTCAGGTAC
GCTCAGCGCCGATGGCGGTTCACTTTCCGGACAGGCTGAGGCAAACAGCACCGTAACCGTAACGCTGGCGGGGGGCGTGA
CGCTCACCACCACCGCCGGCAGCAACGGCGCATGGTCTCTCACCTTGCCGACAAAACAAATTGAAGGTCAACTCATTAAC
GTGACGGCCACTGACGCTGCGGGTAACGCCTCCGGCACGTTAGGCATTACCGCGCCGATTCTGCCGCTGGCGGCAAGGGA
TAACATAACCAGTCTTGATCTGACCTCTACCGCCGTCACCAGCACGCAAAACTATTCGGATTACGGCCTGCTGCTGGTTG
GCGCGCTTGGCAATGTCGCCTCGGTTTTGGGTAACGATACCGCTCAGGTTGAGTTCACCATTGCTGAAGGTGGTACGGGC
GACGTCACCATCGATGCCGCCGCAACGGGAATCGTGCTTTCGCTGCTCAGCACTCAGGAGATAGTGGTACAGCGCTATGA
CACCAGCCTCGGCACCTGGACGACGATCGTCAACACCGCCGTTGGCGACTTCGCGAATTTGCTTACCCTGACCGGGAGCG
GCGTTACCCTGAACCTGAACGGACTGGGCGAAGGCCAGTACCGGGTACTCACTTATAACACCAGTCTGCTCGCCACCGGG
TCATATACCAGCCTGGATGTCGATGTACACCAGACCAGCGCAGGTATTATTAGCGGGCCAACCATCAGTACCGGCAACGT
CATGGCTGATGATACCGCGCCGACGGGCACCACGGTCACCGCCATCACCAACGCCAACGGCGTCAGTACGCCGGTCGGCG
CGGGCGGCGTGGATATCCTGGGGCAATACGGCACGCTGCACATTAATCAGGATGGCAGTTACACCTACACGCTGACTAAG
CCCACGGCGGGATACGGACATAAAGAGAGCTTCACCTACACCATCACCCAGAATGGCGTCGGTAGCAGCGCCGCGCAACT
GGTTATTAATTTGGGTCCCGCGCCTGTACCGGGCAGCGTGATAGCGACAGACAATAACGCCTCGCTGGTCTTTGATACTC
ACGTTAGCTACGTCAACAACGGCCCCTCGACACAAAGCGGCGTCACGGTATTAAGCGTCGGACTTGGTAATGTACTGAAC
GCGAATCTGCTTGATGATATGACTAATCCGATCATCTTTAACGTTGAAGAAGGCGCTACGCGAACCATGACGTTACAGGG
AACCGTCGGCGGCGTCTCACTGGTTTCCACGTTCGATCTGTACGTTTATCGCTTCAACGATGCCATTCAACAATATGAGC
AGTTCCGGGTGCAAAAGGGCTGGATTAACACCCTGCTGTTAGCCGGACAGTCCCAGCCGCTGACCCTGACGTTGCCTGGC
GGCGAATACTTGTTCGTGCTGAATACCGCCAGCGGCATTAGCGTCCTCACTGGCTATACGTTGGCGATTTCCCAGGACCA
CACCTATGCCGTTGACAGTATCACCGCCAACACCACCGGCAACGTACTGACCAATGATGTCGCCCCTACGGACGCCCTCC
TCACTGAAGTAAACGGCGTGGCGATTGCGGCGACCGGCACGACGGAGGTAAATGGGCTGTATGGCTCGCTCATCATTGAC
GCAAGAGGCAACTATACCTACACGCTGAAGAACGGCGTCGGCGCCGACAGCATTAAAACGCCGGACAGCTTTATCTATAC
GCTCAAAGCGCCAAACGGCGATACCGATACGGCCTCGCTCAATATCACGCCAACCGCCAGGGCGCTGGATGCGATTAATG
ATGTCAGCGATACCCTTAGCGTCGCCACGCTTCAGGATACCGCTGCCTGGCTGGACTCCAGCGTCGGCAGCGCCAGTTGG
GGGCTACTCGGCAAATCGGGCAGCGGGAGCGGCACCTTTGACGTTGCAACGGGCACCGTACTTAAAGGCGCGTCACTGGT
CTTTGATGTCTCCACGCTCATTACGCTGGGCAATCTGAATATTAGCTGGGCCATTCAGGAGAACGGAACCGTCATACGCA
ACGGAACCGTCCCGGTGGCGAATATCACGCTGGGCAGCGCGACGGTGACCGTCAACCTGAGCGGCCTGGAGCTGGATGCC
GGAACGTACACGCTTAACTTTACCGGCACCAATACCCTGGCCGGGGCGGCGACAATCACGCCACGCGTCATCGGCACCAC
CGTCGATCTGGATAATTTTGAAACGTCCGGAACGCATACCGTTCTCGGCAATATTTTTGACGGCAGCGACGCGGCGGGGG
CGATGGATCAGCTTAATACGGTGAATACTCGCCTGAGCATTAGCGGGTATAACGGCAGCGCCGCCACGCTGGACGCCGCG
GCGAATACCACCAGCGCCACGATTCAGGGACATTACGGCACATTGCAAATTAACCTCGATGGCGCTTACACCTACACCCT
GAATAATGGCGTCGCGATGTCGTCCATCACCAGTAAAGAGGTCTTTACCTATCAACTGGATGACAAGATAGGTCATACGG
ATAGCGCCACATTGACCATTGATATGGCGCCGCAAATCGTCAGTACCAACCAAAACGATGTTCTTATCGGCTCCGCCTAT
GGCGATACGCTGATTTATCACCTGTTAAACGGCGCGGACGCGACCGGCGGCAACGGCGCCGATCGCTGGCAAAACTTCTC
CACCGCGCAGGGCGACAAGATCGATATCCACGAACTGCTGACCGGCTGGGATCACCAGGCGGCGACGCTAGGTAACTTTG
TTCAGGTTCATACCAGCGGCGCCAATACGGTGATATCCGTCGATCGCGACGGCGCCGGCAGCGCGTTTAAATCAACTGAC
CTTGTCACTTTGGAGAATGTGCAGCTCACGCTAAATGATCTGTTGCAGAACAACCACCTGATAACCGGCGGTTGA

Protein sequence :
MRLLAVVSKLTGVSTTVESSAVTLNAPSIVKLSVARDEISQLTRINQDLVVRLHSGETITIKNFYVTNDLGASQLVLAEN
DGTLWWVENPQAGLHFEQIADINELLVTSGASHEAGGAVWPWVLAGAVAAGGIAAIASSGGGDSHHHSDGDNPPPDNTNP
DGNPPDNSNPGGSNPNGNTPGSSNPVDTTPPLAPGELLISADGKTVSGEAEAGSLITIKDPSGNVVGEGKADSDGKFSID
LTAPQISGEQLTVTATDDAGNTGPSATIDAPNIPLPDTPVITAAIDDAAPLTGTLSNNQFTNDNTPTLEGTGSAGTVIHI
YANGQEIGSTTVDTSGNWHFAITSALADGENHFTAIATNVKGESSESARFTLTIDTLSPDAPRVELMADNTGLLTGPLQN
NDRTDEAKPLFSGQGEAGNTITIKEGSTVIGSATVDENGRWTFTPTTPLSDGEHTFTVEQSDKAGNASRVTTTPTIIVDT
TPPDAAIIDNVAKDGTTVSGTAEAGSTVSIYDPAGNYLGSTITGENNHFSITLNPAQTHGERLEARIQDAVGNIGPATEF
TASDSQYPAQPTILTVTDDAGAVTGLLKNGDATDDNRPTLSGTAEPGSTISINDNGFPVPSFPPIVADADGKWSFTPSLA
LADGDHVFTATATNDRGTSGQSVAFTIDIDTQPPVLEGLAVSDVGDRLTGTTEAGSTVVIKDSLGNTLGSGTAGDDGTFS
IGISPAKINGETLSISVTDKAANSGPVETLNAPDKTAPAAPNGLIVATDGLSVSGQAEAGATVTIRDSSNTVLGSAVANG
NGQFIVPLNAAQTNGQALIATATDIANNESAAATVDAPDSTAPEMPKNVVISEDGASISGTAEPGSSITITTPDGTPLGS
GKADGEGHFTLPLAPAQTNGEQVTVTATDSANNVSPPTTAQAPDITAPDKPIITQVLDDVESFTGPLVNGQTTNDNRPTL
SGTAEAGARVEIFDNGVSLGLATLQPNGGWTFTPSQNLGEGAHRLTVIATDAKGNASPAGNESPESISFTLRIDTQAPDA
PQIVSAAITGGEGEVLLANGSITNQRMPTLSGTGEPGAIITLYNNGVELATVQVNPQGSWTYPLTRNLSEGLNILTATAT
DAAGNSSPTSGVFSVTLDTQPPAQPDAPLISDNVAPVIGNIGNNGATNDTTPTFSGTGEIGSTIILYNNGSEIGRTTVGD
NGSWNFTPAALTPETYTITVTETDIAGNISPPSASVTFTLDTTAPANPVITFAEDNVGEVQDTIVSGATTDDNTPVIHGT
GDIGSVITLYNGSSVVGVVTVDETGTWTLPVTSALPDGVYTLTAIAADAAGNSSGVSNSFTFTVDTVPLQPPVVNEILDD
VAPVTGPLTDGAFTNDRTLTINGSGENGSTVTIYDNGVAIGTALVTDGVWTFNTSELSEASHALTFSATDDAGNTTAQTQ
PITITVDITAPPAPTIQTVADDGTRVAGLADPYATVEIHHADGTLVGSAVANGTGEFVVTLSPAQTDGGTLTAIAIDRAG
NNGPATNFPASDSGLPAVPAITAIEDDVGSIQGNIAAGGATDDTMPTLRGTTDIGSTVEVFIDGDSAGFATVDASGNWIF
EIATPLSESTHYFTVQATNANGPGGLSAPVGITVDLSAPAQPVITSATDDVPGMTGTLDNGALTNDSRPTLNGTGEAGAT
IRILDNGVEIGSATVDQSGNWRFTPNTPLESNAHIFTAVATDPAGNSGQLSDGFTLNIDAQAPDVPVITSVIDDNNQPTV
PVLPGQSTDDRQPILNGTGEPGATITIFDNGTPLGTAQVGENGSWTFPVPRNLSEGSHNLTVSATDPAGNTSAVSAPWTI
VVDITPPAIPVLTSVVDDQPGITGNLVSGQLTNDATPTLNGRGEAGATINVYLDGNPASIGTTTVNSDGTWSFTPQTPLA
NGSHTFTLSATDPAGNSSAVSSGFVLTIDTTPPAAPVIASVADNTAPVTGIVPNGGSTNETRPTLSGTGEAGTTISIYNG
SALVGTAQVQANGSWSFTPSTSLGAGVWNLTATATDAAGNTSAASEIRSFTIDTTAPAAPVIDTVYDGTGPITGNLSSGQ
ITDEARPVISGTREANTTIRLYDNGTLLAEIPADNSSSWRYTPDASLATGNHVITVIAVDAAGNASPVSDSVNFVVDTTP
PLTPVITSVSDDQAPGLGTIANGQNTNDPTPTFSGTAEAGATITLYENGTVIGTTTAQPDGAWSVSTSTLASGTHVITAV
ATDAAGNSSPNSTAFTLTVDTTAPQTPILTSVVDDVAGGVTGNLANGQITNDNRPTLNGTAEAGSVVSIYDGDTLLGVTS
ANASGAWSFTPTTGLNDGTRTLTVTATDPAGNVSPATSGFTIVVDTLAPTVPLITSIVDDVPNNTGAIGNGQSTNDTQPT
LNGTAEANSAVSIFDNGALVATVNANASGNWSWTPTASLGQGSHAYSVSAADAAGNVSAASPSTTIIVDTIAPGAPGNLV
INATGNRVTGTAEAGSTVTITSETGVVLGTATADGTGSFTATLTPAQTNGQPLLAFAQDKAGNTGIAAGFTAPDTRVPEA
PIITNVVDDVGIYTGAIANGQVTNDAQPTLNGTAQAGATVSIYNNGALLGTTTANASGNWSFTPTGNLTEGSHAFTATAT
NANGTGSVSTAATVIVDTLAPGTPSGTLSADGGSLSGQAEANSTVTVTLAGGVTLTTTAGSNGAWSLTLPTKQIEGQLIN
VTATDAAGNASGTLGITAPILPLAARDNITSLDLTSTAVTSTQNYSDYGLLLVGALGNVASVLGNDTAQVEFTIAEGGTG
DVTIDAAATGIVLSLLSTQEIVVQRYDTSLGTWTTIVNTAVGDFANLLTLTGSGVTLNLNGLGEGQYRVLTYNTSLLATG
SYTSLDVDVHQTSAGIISGPTISTGNVMADDTAPTGTTVTAITNANGVSTPVGAGGVDILGQYGTLHINQDGSYTYTLTK
PTAGYGHKESFTYTITQNGVGSSAAQLVINLGPAPVPGSVIATDNNASLVFDTHVSYVNNGPSTQSGVTVLSVGLGNVLN
ANLLDDMTNPIIFNVEEGATRTMTLQGTVGGVSLVSTFDLYVYRFNDAIQQYEQFRVQKGWINTLLLAGQSQPLTLTLPG
GEYLFVLNTASGISVLTGYTLAISQDHTYAVDSITANTTGNVLTNDVAPTDALLTEVNGVAIAATGTTEVNGLYGSLIID
ARGNYTYTLKNGVGADSIKTPDSFIYTLKAPNGDTDTASLNITPTARALDAINDVSDTLSVATLQDTAAWLDSSVGSASW
GLLGKSGSGSGTFDVATGTVLKGASLVFDVSTLITLGNLNISWAIQENGTVIRNGTVPVANITLGSATVTVNLSGLELDA
GTYTLNFTGTNTLAGAATITPRVIGTTVDLDNFETSGTHTVLGNIFDGSDAAGAMDQLNTVNTRLSISGYNGSAATLDAA
ANTTSATIQGHYGTLQINLDGAYTYTLNNGVAMSSITSKEVFTYQLDDKIGHTDSATLTIDMAPQIVSTNQNDVLIGSAY
GDTLIYHLLNGADATGGNGADRWQNFSTAQGDKIDIHELLTGWDHQAATLGNFVQVHTSGANTVISVDRDGAGSAFKSTD
LVTLENVQLTLNDLLQNNHLITGG