PAI Gene Information


Name : STY4459
Accession : NP_458559.1
PAI name : SPI-4
PAI accession : NC_003198_P8
Strain : Salmonella enterica RSK2980
Virulence or Resistance: Not determined
Product : large repetitive protein
Function : -
Note : Similar to Synechocystis sp. hypothetical 308.8 kDa protein slr0364 TR:Q55582 (EMBL:D63999) (3029 aa) fasta scores:E(): 0, 26.2% id in 2291 aa. Also similar to Salmonella typhimurium SPI-4 hypothetical proteins: TR:O85324 (EMBL:AF060869) (1512 aa) fasta s
Homologs in the searched genomes :   4 hits    ( 4 protein-level )  
Publication :
    -Parkhill,J., "Direct Submission", Submitted (25-OCT-2001) Submitted on behalf of the Salmonalla sequencing team, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.

    -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18", Nature 413 (6858), 848-852 (2001) PUBMED 11677608.

    -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Direct Submission", Submitted (10-SEP-2013) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
GTGGAAGATCTGGCGGGGAATGTAAAAGAATCTGCGCCGTTAGAGGTACGTATTGATACCACGACAACCATTAACAATAT
CGTATTGCTTAATGATACTGGCGTGCAGAATGATCAATTAACGAATGTTGCCAAACCGTCATTCAGAATTGACGTTCCCG
GTGATGTCGTCCAGGTACGCGTAACCCTGGATGGTGGCGCTAACTGGAATGTGATACGCAAAAATGCCGACGGACAGTGG
ATTTTTGACAGCCCGAATACTCTGGTTGACGGCACATATACCCTTCGCGTAGAGGCCACGGATGAGGCAGGTAATATTGC
AAATAAAGATTTAGTATTTAATATCGATACTAATATACAGGTTCCGACTATTGCTTTAGACGCAGGACAAGATACCGGAG
CGAATACAGCCGATAATATTACTAATATTTCACGACCCACCTTTACGATTGGTAATGTTGACCCCGATGTTATCAAAGTC
GTGGTGACGATTGATGGTCATGATTATAACGCGACTAAGGTTGGGGCTGGTTGGCAATTTACACCAGGCAATGCCATTCC
GGATGGTTCTTATAATATTACCGTTACGGTTGAAGATAAGGCCGGAAATACCGCGACATCGAAACCATTACCTGTTGTGA
TAGATACGACGGCTGAAATTGAAAGCGTCACGTTGGTTACAGATAGCGGTGATAGCGATGTAGATAACATTACCAAAGTC
GACAAGCCGCAGTTTAGTATTGTTACCGCTGATGATATTACCCATGTGCGCGTTAAAATCGATAACGCCGCTAATTGGAT
TGAACTCACAAAAGGAGGGGATGGTCGCTGGATATTTAATGTCGGTTCGGCATTACCTGATGGGCAACACACTCTCTTGG
TTGATGTGACTGATATCGCCGGCAACGTTGCGCAAGAAACGCTGCAGTTTACGATTGATACGACTCTGCGAGAGCCGACA
ATTGTACTCGATCCCACCCATGATACTGGTGATGATACTAATGATAATCTTACCAGGATTAACAAACCGGTGTTTATTAT
CGGTAATGTCGATAATGATGTATCACACATTGTGGTTCATCTTGATGGTCGGGATTACACCATTGAAAACAAAGGGGGGA
ATTTAACCTTTACGCCGGATCAACCGCTGTCTGACGGTCAGCATACGATCTCTGTTACCGTAACGGATATTGCTGGTAAT
ACCAAAACATCGGCCGAACTGCAGATTGAAATCGACACGCAGGTTCAGATTGACAGTGTTACGTTAACAACAGATAGCGG
CGTCAACGATCACGATAATGTCACCAATGCTACCCGTCCCTCTTTTGAAATTGCAACGCCTGATGATGTGACATCGGTGC
TGGTTTCTTTCGATGGCGTAAACTGGACGCCCATCAGTAAAAATGCGGCCGGGCAGTGGCAATTTACTGCAGGTAGCGCA
TTGTCTGATGGTCATTATACTCTCCATGTCCAGGCGACGGATCGGGCAGGGAATACGGCAAATTCCACGCTGGGCTTTAC
CGTGGATACGCAGATTGACGGCCTGAGCGTCGTGATGCTGGACGACGCCGGAAAGGATTCTACGGATGGTATTACGAATA
TTACCTCTCCACGTTTTGAAATTTCAGCCAGAGAACAGTTGCAGAGCGTGACGGTAATTTTAAACGGGAAATCCAGCACC
CTGACTCAGGGGGCAGGTAATAAATGGCTGTTTACCCCTGATACACCGTTAGTGGATGGAACTTACAAAATAGAAATAGT
GGCTGAAGATATCGCAGGTAATAAAATTAGCAAAGAGGTATCATTCACAATAGACACTATTGTTTCTGATCCCAGTATTG
ATTTGCTGGATGCGGATGATACTGGCGAAAGCGCTGTTGATAATATTACGAGTGTCACTACACCACGTTTCGTTATTGGC
AATGTACCCGCCGATATTGATACTGTTGTAATCAGAATTAACGGCGTTTCTTATCCGGTTACGGCAAATGGCAATAACCT
CTGGGAATTTCAGGTTCCCGTTGCGTTAAACGATGGCGTATATGAAGCCGTTGTTGTCTTCAGAGATATTGCCGGAAATA
CTTCTGAAACTAAGCTGCCCTTTACCATTGATACCACGACAAGCGTCAGTGTCAGAATGGAGCCAGCGTCTGATACCGGC
AGCTCCAATAGCGATAACCTTACGAATAAGCAAAATCCCAAATTCGAAGGTACTGCAGAGCCCAATGCGAAACTGGTGAT
TACCATTGTTGACGATAAGTCAGGTCGGGAGGTTTTAAAACACACGATTACGGTTGGCGCCGATGGCAACTGGAGTGTGA
CGCCGAATATACTGCCGGATGGCATGTATACCATCAACGTCGTCGCAACAGATGTCGCGGGAAATACTGCGCAAACGCAG
GAAAGATTCACTATCGATACGGTTACGATCGATCCCACCATTCGCCTTTCGGATCCATCTATTGATGATCAGTATGAGGC
AACCAGCCTGCGTCCTGAGTTCAAAGGGCTCGCCGAAGCGTTCTCGACGATTATGATTCAGTGGGATGGGAAAGTGGTCG
GCTCGGCAAACGCCAATGCGAATGGTGAATGGAGTTGGACGCCGCCATCAGTATTAGCGCCAGGCTCCTATGTTGTGAGC
ATTGTTGCCAAAGATAAAGCGGGTAATGAATCGTCGCAGGTCGACTTTCCTGTAGTAATACCTGTTATTGATGTCACGCC
TCCAACCATAAAGCTCAGCGAGGAGAGCGATAGTGGTGCCTTAGGAGACTTTACGACGAATAATAAAACGCCGACCCTGA
TTGGGAGCACGTTACCTAATACAATTGTGAGTATTTATGTGGATGGCGTGAAGGTCGGCGAGGCGACAGCGGATACAGCG
GGCCGATATACTTTCCAGTTATCGGAAATGAAAGATGGCCATTATGTCGTCCAGGTGGGTATCGTCAACCCTCGCGATAA
TAGCGAACTGCGTTCCACCGCCGTTGATGTCACTATCGATACCGAGGTTGCTGAACTGGTATGGAATATATCTGGAATGC
ATGAGGGCGGATATATCAACACGGTGACGCCGGAGATTGGCGGCACCAGTGAGCCAAACAGCAAAATCACTATCTTTGTG
AATGGCGTTGAAAAAGCGATTGCTTATACGACAGGCGCAGGACACTGGGGCGTAGTATTACCCGCTTTGGGTAATGACGG
TAATTATGAATTAACGTTTAAAGTTGAAGACGTTGCCGGCAATATCAGAGAGTTTGGTCCGCAGAATGTGATACTGGATA
CGGTAATTTCGCCGTTAACCGTGGTATTACGCGAAGCTGATGACAGTGGCAAAGTTGGCGACTGGATCACCAATAAATCT
CATGTCACCATCGATGGTACTGCCGAAGCCGGAAGTACTTTGACCATCAGGAATCCGCAGGGAGTGGTTATTGCTACCCT
GGTGGTAGGCAATGATGGTCGATGGAGCGCAGAATTAGATCTGCGTGAAGGTAGTAATGCCTTTGTCGTGGTATCGGAAG
ATAAAGCGGGCAACGGCCAACAAAAAGATATTCTGATAGAACATGATACGCAGATTGAAATCAGCGATATTTCATTAAGT
CGGGATACTAATAGCGGTGATAAATATGATCTGATTACCAATAATAAGTCTCCGGTACTGGTTGCCATGACCGATCCCGG
CGCGACGGTACAGGTTTATATTAATGGTGTGTTACAAGGCACAGTAGAGGCGAGTTCGTCAGGTAATATTAGCTATACCA
TGCCGGCAAATAGCGCCGACGGCGAGTATCAGGTGCAATTTGTTGCTACGGATACTGCTGGTAACCGGGTTGAATCTGCG
ATTACGACCGTGACAATCGATTCTCAAATTGCTGTCTTTGATATTGATGAAGATTCATTACCGGCCCTCTCTAATAACCG
GGCGTTGTCAGTCTCAGGCGTCGGGGAGGCTGGTTCTCAGGTCAGCATCTTTGTCGATGGTAAATTAGTCAACGTTGTTA
TGGTTGAGGCTGATGGCTCATGGCGCGCGCCGATACTGCTGCAAGATGATGGTACGTTTAATATTCATTTCAGCATTACT
GACGTTGCTGGCAACACTCAAGTAAGCAAGAATTACAGTGTGGATGTCGATTCATCAACCGACTTCCCAACGCTCAACCT
TGAAGATGCGAGCAACTCTGGTTCACTTGACGATCTGATTACTAATCACAACAAGCCTGTGTTAGTTGGCACCGCAGAAG
CGGGAGCCACAATCCATATTTATGTGGATGAAAAGATCGTGGCAAATGTGCTTGTGCTTGAAGATGGAACCTGGTCCTAT
CAGTTTGATAATGCGTTAAAAGATGGTGAATATTCTATCCGTGTGGTTGCCGAAGACCCGGCAGGTAATACGGCAGAATC
GCCTCGCTTACTCGTCACGATAGATACCAGTACGTTTATCGATAATCCTGCTATGGTGGCAGGTTCTGATAACGGTATTT
TCAGTAATGATAGTATAACGAGTCAGACCCGGCCTACGTTTAGTATTTTTGGAGAAATGAACCAGAGTGTTCAGATTTTC
ATTGATGGGGTGTTAGTCGATACGATCACGGTGACGGACAGAAATCAAGTTTATCGACCTGAGTCACCGTTGGGCGATGG
TTCCCATAGCATTTATTATGTTATCACCGATAAAGCAGGCAACACGGCTACCTCGAAAACGCTAAACTTTACTATCGATA
CCTTTAATACGACGCCTGTCGCCATTGATTCTATCGGTGGACAAACGTTAGCAGAGATGACCGGTAGTGATGGCAAAATA
TATATAACGGACACGACGCGTAACTTATTGTTTAGTGGCAGTGCCGAGCCCAATAGCAAAATAGAAATCATTATTAATGG
CTTAAATGTGGGGGAAGTTTGGGTTAATGAAAAAGGCCACTGGCAGATGCCGGTGAACCCGCTTTATTTCACAGAAGGCC
AACTGGATATCACTGTTAAATCTACGGACCGTGCTGGTAACGTAAATCAGGAAAAGTATTCCATTTGGGTCGATACGCAT
ATCCAGGTATTTACCAGCGAGCTTGATGACAATAAATCATCATCGAAAACGGACTGGTGGAGTAATAGCTCCACTATTAC
CATGAGAGGTATGGGGGAAATTGGCGCTACGGTATCATTAATCGTGGCAGGGGTCACGCTGGCAACCGCTGTCGTTGCGG
CTAATGGGCAGTGGGAATTATCGACCGATCAGCTTCCGGAAGGGAAATACGATATCACTTTGAGTATTGAGGATAACGCA
GGCAACCGTAAGGAAGAGGTACATGAAATATTTATTGATCGAACGCCGCCAAACGCTCCGGTCGTAACTTATTCAGACAT
TGTCAACGATCTAATTATTATGCAGGGAACGGCGGAAGCCAAATCTCAGCTAATAATAACCGATAGTAATGGGAATACTT
ATACGTTAACCGTTCCTGATAATGGTAAATGGAGTATGGCGATCCCGTATCCATCGGAAGGGAAGTTTACCATTACGAGT
GTGGATGCGATTGGTAACCGGAGTGATGATGTCCCTCTCGATATCATGAAAGAGGTTCCCGTTATTTCATTATCTCCAGA
CTCAGACAGTGGTACGGTGGGCGATAATATTACGCGCGATAAGCAACCTACCTTTATTATCGGGAATCTGGAAAGCGATG
TTGTGGTCGTTCAGGTCGATATCAATGGGACAGTATATAATGCTGAAAAAAATGCCGATGGCGTTTGGTTCTTTACGCCA
GGTACACCGTTAGCTGATGGTTCCTATACGATATCGGTAATCGCAAGCGATGCCGCGGGTAATCAGAAAAACTCGTTACC
CATTACCGTTACGATCGACAGTACGCTGACGGTGCCGGAGATTGCGTTGGCAGCAGGTGAAGACAATGGCGTTTCAGACA
GCGATAACGTGACGAATCACACCCAGCCTAAGTTCACGCTGCAGCATATTGATGCTGATGTGACCGGGGTGACCGTAAAC
GTGACGCATAACGGCGTGACAGACACCTATCAGGCGACGCAAGGCGCGGATGGCTGGACCTTCACGCCGCCAGCCGCCTG
GAATGATGGTACCTACACGCTGAGCGTGACGGTGGTGGATCGCGCGGGGAACTCACAGCAATCTGCTTCGCTAGCGGTGA
CGGTTGACTCAACGGTGACGGTAACAGCGGATAGCCAGCATGACGATGCGAGCGATGACGCCACGCCAACAGCGGTTACT
CCACCGGAGTCTGAAACAGTGAATGCCGAAAGCGATACGCATCTTCGTACAGTGCCGTCTGCGGCGGAAGAAAGCGTGGT
GAAGGAGACGGCCTATAGTATTACATTGTTAAACGCTAACTCTGGGGATGAAATAGATCGTTCAATTAGTCAGACACCTT
CTTTTGAAATATCAGTACCTGAGAATATTGTTAATGTCAGTGTTATGTTTGAAGGAGAAGAGTTTACTCTGCCGATAACT
AACCAGAAAGCAATATTCGAAGTTCCGCTATCTTTGGAAGATGGTGAATATACTATGGACGTGAAATTCATTGATAAGGA
CGATGATTTCCTGATTAAGGAAAAAACATTCTCAGTCGATCACTCCTCGGCGGATATTGTGAACGCAATGAATGCAAGAG
GAAAGACCGAGGATGATATTAATGATTCCCCTTCCACGAGTTCTGTAGGGCACAACAATAACGGCGCTATTGATGTTTTC
GCCGTTAATGAAGTTACGCTACCTGTAGATAATCAAGAAGAACACGCATAA

Protein sequence :
MEDLAGNVKESAPLEVRIDTTTTINNIVLLNDTGVQNDQLTNVAKPSFRIDVPGDVVQVRVTLDGGANWNVIRKNADGQW
IFDSPNTLVDGTYTLRVEATDEAGNIANKDLVFNIDTNIQVPTIALDAGQDTGANTADNITNISRPTFTIGNVDPDVIKV
VVTIDGHDYNATKVGAGWQFTPGNAIPDGSYNITVTVEDKAGNTATSKPLPVVIDTTAEIESVTLVTDSGDSDVDNITKV
DKPQFSIVTADDITHVRVKIDNAANWIELTKGGDGRWIFNVGSALPDGQHTLLVDVTDIAGNVAQETLQFTIDTTLREPT
IVLDPTHDTGDDTNDNLTRINKPVFIIGNVDNDVSHIVVHLDGRDYTIENKGGNLTFTPDQPLSDGQHTISVTVTDIAGN
TKTSAELQIEIDTQVQIDSVTLTTDSGVNDHDNVTNATRPSFEIATPDDVTSVLVSFDGVNWTPISKNAAGQWQFTAGSA
LSDGHYTLHVQATDRAGNTANSTLGFTVDTQIDGLSVVMLDDAGKDSTDGITNITSPRFEISAREQLQSVTVILNGKSST
LTQGAGNKWLFTPDTPLVDGTYKIEIVAEDIAGNKISKEVSFTIDTIVSDPSIDLLDADDTGESAVDNITSVTTPRFVIG
NVPADIDTVVIRINGVSYPVTANGNNLWEFQVPVALNDGVYEAVVVFRDIAGNTSETKLPFTIDTTTSVSVRMEPASDTG
SSNSDNLTNKQNPKFEGTAEPNAKLVITIVDDKSGREVLKHTITVGADGNWSVTPNILPDGMYTINVVATDVAGNTAQTQ
ERFTIDTVTIDPTIRLSDPSIDDQYEATSLRPEFKGLAEAFSTIMIQWDGKVVGSANANANGEWSWTPPSVLAPGSYVVS
IVAKDKAGNESSQVDFPVVIPVIDVTPPTIKLSEESDSGALGDFTTNNKTPTLIGSTLPNTIVSIYVDGVKVGEATADTA
GRYTFQLSEMKDGHYVVQVGIVNPRDNSELRSTAVDVTIDTEVAELVWNISGMHEGGYINTVTPEIGGTSEPNSKITIFV
NGVEKAIAYTTGAGHWGVVLPALGNDGNYELTFKVEDVAGNIREFGPQNVILDTVISPLTVVLREADDSGKVGDWITNKS
HVTIDGTAEAGSTLTIRNPQGVVIATLVVGNDGRWSAELDLREGSNAFVVVSEDKAGNGQQKDILIEHDTQIEISDISLS
RDTNSGDKYDLITNNKSPVLVAMTDPGATVQVYINGVLQGTVEASSSGNISYTMPANSADGEYQVQFVATDTAGNRVESA
ITTVTIDSQIAVFDIDEDSLPALSNNRALSVSGVGEAGSQVSIFVDGKLVNVVMVEADGSWRAPILLQDDGTFNIHFSIT
DVAGNTQVSKNYSVDVDSSTDFPTLNLEDASNSGSLDDLITNHNKPVLVGTAEAGATIHIYVDEKIVANVLVLEDGTWSY
QFDNALKDGEYSIRVVAEDPAGNTAESPRLLVTIDTSTFIDNPAMVAGSDNGIFSNDSITSQTRPTFSIFGEMNQSVQIF
IDGVLVDTITVTDRNQVYRPESPLGDGSHSIYYVITDKAGNTATSKTLNFTIDTFNTTPVAIDSIGGQTLAEMTGSDGKI
YITDTTRNLLFSGSAEPNSKIEIIINGLNVGEVWVNEKGHWQMPVNPLYFTEGQLDITVKSTDRAGNVNQEKYSIWVDTH
IQVFTSELDDNKSSSKTDWWSNSSTITMRGMGEIGATVSLIVAGVTLATAVVAANGQWELSTDQLPEGKYDITLSIEDNA
GNRKEEVHEIFIDRTPPNAPVVTYSDIVNDLIIMQGTAEAKSQLIITDSNGNTYTLTVPDNGKWSMAIPYPSEGKFTITS
VDAIGNRSDDVPLDIMKEVPVISLSPDSDSGTVGDNITRDKQPTFIIGNLESDVVVVQVDINGTVYNAEKNADGVWFFTP
GTPLADGSYTISVIASDAAGNQKNSLPITVTIDSTLTVPEIALAAGEDNGVSDSDNVTNHTQPKFTLQHIDADVTGVTVN
VTHNGVTDTYQATQGADGWTFTPPAAWNDGTYTLSVTVVDRAGNSQQSASLAVTVDSTVTVTADSQHDDASDDATPTAVT
PPESETVNAESDTHLRTVPSAAEESVVKETAYSITLLNANSGDEIDRSISQTPSFEISVPENIVNVSVMFEGEEFTLPIT
NQKAIFEVPLSLEDGEYTMDVKFIDKDDDFLIKEKTFSVDHSSADIVNAMNARGKTEDDINDSPSTSSVGHNNNGAIDVF
AVNEVTLPVDNQEEHA