Gene Information

Name : SARI_01743 (SARI_01743)
Accession : YP_001570771.1
Strain : Salmonella enterica RSK2980
Genome accession: NC_010067
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 1669131 - 1673285 bp
Length : 4155 bp
Strand : -
Note : 'KEGG: nma:NMA0905 5.0e-94 iga; IgA1 protease K01347; COG: COG3468 Type V secretory pathway, adhesin AidA; Psort location: nuclear, score: 23'

DNA sequence :
ATGAATAAAATATATGCACTGAAGTATAGTGTCAGACAAGGTGCTCTTGTGCCTGTTTCGGAACTGGCAACCCATGCAAA
AAAATCATCTCGTACAGGTCTGATAAAAAGACTTATTCCATCGGTTATTATTAACACCCTTCTTTTAGGATATTCAGTCA
CATCGCTGGCCTCAGTCGTCAGGTATGATATCCCCTACCAGACAATAAGAGACTTTGCGGAGAATAAAGGGCAATTTACG
CCGGGTAGTATGAATATTCCTGTTTACAGTAAAACAGGCACAGTTATTGGTTATCTGAATAATGCTCCAATGCCCGATTT
CAGTAGCGCCAACCACCAGTCAGCGGTTGCCACACTTGTTTCGCCACAGTATATCATGAGCGTAAAACATAATGGCGGCT
ATCAGAGTGTCAGTTTTGGTGATGGCGAAAATGGGTATCGTCTTGTTGACAGAAATAATCAGCCTGGACGAGATTTTCAT
GCCCCACGTTTAAATAAATTAGTTACCGAAGTCGAGCCTTCATTGATGACGCAGTCGGGGATGGTGTCTGGGGCTTATAG
TGACAAAAATCGTTATCCCGCGTTTTACCGTGTAGGGTCCGGAACCCAGGAAATCAAAGAGACAGATGGCCATATCATCT
CAATTTCTGGGGCATACAGTTATCTGACCGGCGGTACGGCAGGTGCGCCGGGATCTTATAATCAGGGCCAGATGATCAGT
ACTAATACTAATAAAGAGTTGTATAGCCTGGCGCAGGGGCCGATGGGAACACACCCACGCGTTGGTGATAGTGGTTCACC
TTTATTCGCCTATGATTCTGTTTTACAAAAGTGGGTCATTGTCGGGGTGGATAGCTCCGGCGGCGGCGGTGGAACTAACT
GGGCAGTGGTTGATGCGAATTTTGTAAACCAGGCCATTCAGGACGATACCGATGCGCTCGTAACTTTTATGGCAGACCAG
GGCCCTCTGCGCTGGGCGTTTAATTCAACCGATGGTACAGGAACGCTTATCCAGCAAGAGACTGTCTATCAGATGCATGG
TCAGAAAGGCACAGACCTGAATGAGGGGAAAAATCTGGTATTCAATGGCGCTGACGGCCAGATTGCGCTGGAAGACGCGG
TTAACCAGGGGGCGGGGGCGCTGGCTTTTAATGGTAATTATACGGTATTCACCACTAACGGCTCCACCTGGAGGGGCGCA
GGCCTGGACATTACCCAAGATGCGGAGGTGAGCTGGCAGGTGAACGGTGTTCAGGGCGACAATTTGCATAAGATTGGCGA
GGGCGTGCTGAAGGTCAATGGAACCGGTATTAACCCAGGCGGACTGAAGGTTGGGGACGGTACGGTTATTCTGGCTCAGC
GTCCTGATGAGGACGGAAAGGTTCAGGCCTTCAGTTCGGTGAATATCGCCAGTGGCCGACCAACGGTGATCCTTACCGAT
AGCCGGCAGGTGAACTCGGATAACATTAGCTGGGGCTTTAGGGGAGGCCGGCTTGATATAAACGGCAATGATGTCACTTT
TCATAAAATTAACGCCGCGGATAATGGCGCGAATATCATTAATACCAGTGATACCTTTGCTACCGTTTCAATTAAGCCCC
TCACGGACATGACGGTTACTATTAATGACTGGGATAAAAATAAGCCTTCCGGCGGTGCTGCCGGGTTATTGTATAAATAT
AATAACATTTACACCCATACAGTAGATTATTTTATCCAGAAGCGAAAAGGGTATGGATATTATCCTGTAAATCAGTCAGA
CAACGACAGTTGGGAGTATGTAGGACACAATGAGATGCAGGCTATTGAACGGGTAAAATCGCGGCGACCTATTGATGATA
GAATTTATCACGGTAATATCGTAGGCAATATTCATCTTAATATTGATACCTCTCACAGTAACGGCGGCGTTATTTTTGAC
GGCAATATCGATACGCCTGATGGCGAGTTGATGCAGTCTGGCGGCCAACTGACTTTCCAGGGCCATCCGGTTATCCATGC
CTATAACGGCAAATGGTTGACTGATAAGCTTAAATCACTTGGTGATGACTCTGTCAGAAATCAACCAACGTCTTTTGACC
AGCCTGACTGGGAGAACCGGACATTTCATCTGAAAACATTAGTACTGAAAAATACCTATTTCGGCCTGGCCAGAAACGCG
TCCCTGAACGGGAATATTGAGGCTGTTCATTCGTCCGTAACGTTGGGTACGCCGAATTTGTATATTGACCTGAATGACGG
TAATGGCACAAAAGTAACGCCTCAAAAGGGAATCTCGGTTGCCGGACAAGAATCGGATATGAGCCGCTATGCCGGGAAAG
TGACGCTTAGCGAGCAGTCAACTCTCGACGTGCGTGAAATTTTCACCGGCAGTATCCAAAGCCAGGACAGCGACGTCACC
GTTTCCTCGCGTCATGCGAAGCTGGATGATTACAGCCGGTTCGGCAACACGTCACTGACTCTTCAGGAAGGGGCCAGATT
AACCGCCACCGGCGGGTGGTGGAGTGATTCAGACGTCATTGTCGGACCAGCCGCTACGCTGAGCCTGACAGGTACGCCCG
CAGCTAGCCTGCCAGGGCAGATGAGTCCGGCGTTTTATTCCACCAATTATGGTGCGGGTTATCAACTGGGGGCCGCTAGT
GAATTGCAATTTTCTCCTTACACCTTTGTAACCGGCGATATCCGGGCAGCGGGAGATGCCCGTATCGCCATTGGTAGTAG
TGATAATGTCGTCCTGGCAGATAATCTACCCCTGGAAGAGCAGATGATGTACGGTTTGTTTAATGGTTTCCGTAACGTCT
ATTCGGGTAATGTTAGCGCCCCGCAAGGGCGAATGGCGATGACGGATACGCAGTGGCAGATGCCCGGCGATTCTCGTATA
GGGGCACTGCGTATGACACGGTCGCTGGCCGGCTTTACCGGGCGCGGATTTCACACCCTGGCTACCGACACATTACAGGC
CAACCAGTCTGCTTTTGCGCTCAGAACGGACCTGAAGGACAGTGACAAGATTGTAGTAAACCAGAACGCAGAGGGCCGGG
ACAACACCCTGTTTGTGAACTTCCTGAAAAAACCGTCCGGGAAAGAATCCCTGAATATTCCACTGGTCAGCGCCCCGGCG
GGGACAAGTCCAACGATGTTTAAGGCCGCAGAACGGGTGACCGGGTTTAGTCTGGTGACGCCGACCCTGCACACGACAGA
GCAGGATGGCAAAATACAGTGGATACTGGACGGGTTTAAAACCGCGCCGGACAAAGGGACAGCCACGTCGGCCAACAGTT
TTATGGGCATGGGGTATAAAAACTTCATGACCGAAGTGAACAATCTGACCAAACGTATGGGTGATCTGCGGGATACTCAG
GGCGAGGACGGGATGTGGGTACGTATCATGAATGGTGCCGGAACCGGTGACGCCGGATATTCTGATCGCTACACCCATCT
GCAAACGGGGTTTGATAAAAAACATCGGTTGCCAGGGGCTGACTTATTCACCGGTGTCCTGATGAGTTATACCGACAGCA
GCGCCAGCGGTCGGGCCTACAGTGGCGACACACATTCGCTCGGCGGCGGGTTGTACGCCTCCGTGATGTTTGACTCGGGA
GCATATATGGATGTTATCGGCAAGTATATCCACCATGATAATGACTATAACGCCGGATTTGCCGGTCTGGGTAAACGGAA
TTACGGTACACACTCATGGTATGCCGGTCTGGAGGGCGGATATCGCTATCATCTCACAGAAAGCCTGTATATTGAGCCTC
AGGCGGAACTGGTTTACGGAACCGTGTCCGGAACGACGCTGAAATGGAACGATAATGGCATGGATGTGTCGATGCGCAGC
AAGACGTATAACCCGTTGATAGGGCGTACAGGCGTGGCCCTGGGGAAAACGTTCAGTGGAAAGGACTGGAGCGTTACGGC
CCATACTGGCGTGGATTACCAGTTTGACCTGGTAACCAATGGCGAGAAGGCGCTACGTGATGCATCCGGCGAGAAACGTT
TTACCGGTGATAAGGACAGCAGAATGCTGTACAACGTGGGCCTGAATGCGCAGGTGAAGGACAATGTACGTTTTGGCCTG
GAGCTGGAGCAGTCGGCATTTGGCAAATATAATGTTGACCATGCCATAAACGCTAACTTCCGCTATATGTTCTGA

Protein sequence :
MNKIYALKYSVRQGALVPVSELATHAKKSSRTGLIKRLIPSVIINTLLLGYSVTSLASVVRYDIPYQTIRDFAENKGQFT
PGSMNIPVYSKTGTVIGYLNNAPMPDFSSANHQSAVATLVSPQYIMSVKHNGGYQSVSFGDGENGYRLVDRNNQPGRDFH
APRLNKLVTEVEPSLMTQSGMVSGAYSDKNRYPAFYRVGSGTQEIKETDGHIISISGAYSYLTGGTAGAPGSYNQGQMIS
TNTNKELYSLAQGPMGTHPRVGDSGSPLFAYDSVLQKWVIVGVDSSGGGGGTNWAVVDANFVNQAIQDDTDALVTFMADQ
GPLRWAFNSTDGTGTLIQQETVYQMHGQKGTDLNEGKNLVFNGADGQIALEDAVNQGAGALAFNGNYTVFTTNGSTWRGA
GLDITQDAEVSWQVNGVQGDNLHKIGEGVLKVNGTGINPGGLKVGDGTVILAQRPDEDGKVQAFSSVNIASGRPTVILTD
SRQVNSDNISWGFRGGRLDINGNDVTFHKINAADNGANIINTSDTFATVSIKPLTDMTVTINDWDKNKPSGGAAGLLYKY
NNIYTHTVDYFIQKRKGYGYYPVNQSDNDSWEYVGHNEMQAIERVKSRRPIDDRIYHGNIVGNIHLNIDTSHSNGGVIFD
GNIDTPDGELMQSGGQLTFQGHPVIHAYNGKWLTDKLKSLGDDSVRNQPTSFDQPDWENRTFHLKTLVLKNTYFGLARNA
SLNGNIEAVHSSVTLGTPNLYIDLNDGNGTKVTPQKGISVAGQESDMSRYAGKVTLSEQSTLDVREIFTGSIQSQDSDVT
VSSRHAKLDDYSRFGNTSLTLQEGARLTATGGWWSDSDVIVGPAATLSLTGTPAASLPGQMSPAFYSTNYGAGYQLGAAS
ELQFSPYTFVTGDIRAAGDARIAIGSSDNVVLADNLPLEEQMMYGLFNGFRNVYSGNVSAPQGRMAMTDTQWQMPGDSRI
GALRMTRSLAGFTGRGFHTLATDTLQANQSAFALRTDLKDSDKIVVNQNAEGRDNTLFVNFLKKPSGKESLNIPLVSAPA
GTSPTMFKAAERVTGFSLVTPTLHTTEQDGKIQWILDGFKTAPDKGTATSANSFMGMGYKNFMTEVNNLTKRMGDLRDTQ
GEDGMWVRIMNGAGTGDAGYSDRYTHLQTGFDKKHRLPGADLFTGVLMSYTDSSASGRAYSGDTHSLGGGLYASVMFDSG
AYMDVIGKYIHHDNDYNAGFAGLGKRNYGTHSWYAGLEGGYRYHLTESLYIEPQAELVYGTVSGTTLKWNDNGMDVSMRS
KTYNPLIGRTGVALGKTFSGKDWSVTAHTGVDYQFDLVTNGEKALRDASGEKRFTGDKDSRMLYNVGLNAQVKDNVRFGL
ELEQSAFGKYNVDHAINANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 52
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 52
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 52
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 52
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 51
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 51
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 51
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SARI_01743 YP_001570771.1 hypothetical protein VFG0903 Protein 0.0 52
SARI_01743 YP_001570771.1 hypothetical protein VFG0861 Protein 0.0 52
SARI_01743 YP_001570771.1 hypothetical protein VFG0635 Protein 0.0 52
SARI_01743 YP_001570771.1 hypothetical protein VFG0904 Protein 0.0 51
SARI_01743 YP_001570771.1 hypothetical protein VFG1689 Protein 0.0 51