Gene Information

Name : CFSAN002050_05940 (CFSAN002050_05940)
Accession : YP_008259134.1
Strain : Salmonella enterica CFSAN002050
Genome accession: NC_021818
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 892899 - 896009 bp
Length : 3111 bp
Strand : +
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
ATGAAACGAGGTCTGAACACCAGCTACCGGCTGGTATGGAATACGGTACAGGGGGCATTTGTGGTGGTCTCCGAACTGGC
AAAAGCAAACGGTAAACGTGCCAGCGCCGTGCTGGTGGCTGCGGGGCTGGTGGTCAGCCCGATAGTGATGGCGGAGGAGA
CAACCAGCGTTCCTTTTGGTGAAGTGGTTAACGGCGCGGTCATCGACAATCACGACACACAACGGGTGTCTGGTGAAACC
AACGACACGGTGATTAATACCGGCCTTGGCCCTGACAGTGAGGGCAATGGTGGAGGTCAGGGTGTTGAAGATGGCGGTGT
ATCAAACCGAACAACTATCAATGATGGTGGATTACAGTGGGTGGATGCTGGCGGCCTGGCCACTGATACCACGGTAAACT
CCGGTGGCGGTCAGTCTGTCGCGGGGCAAACTAAAGGGACGGTTCTGAATGGGGGTGAGCAGTATGTTCATTCAGGTGGT
GTTGCCGATGGTACGGTGATCAATACTGGTGGCTATCAGACCGTCAAAGCGGGAGGAACAGCAGCAAACACCGTTGTTAA
TACGGGTTCAGAAGGTGGTCCCGATTCAGAAAATGATGATGCCGGTCAGTGGGTTTATGGGACGGCTCAGAATACCACTA
TCAATGCCAATGGTCGCCAGGTTGTCCAGGAAGGCGGTGTATCTACTGATACTGTAATTAATTCAGGCGGCGATCAGAGC
GTACATGGTAATGCAGAAAACACTCAGCTTGATGGCGGGTATCAGTATGTTCATTCAGGTGGAGTTGCTGATGGCACAGT
AATTAATGAGGGAGGGTATCAAACGGTTAAAGCCGGAGGAACAGCGACAAATACCGTTGTTAATACAGGTTCTGAAGGTG
GCCCGGATGCAGAAAATAGTGACGGCCAGTGGGTCTACGGAACCGCAGAGAATACCACCATCAATGCGAATGGACGCCAG
GTTATTAATGAAGGGGGAGTTGCCACAACGACAACCATTTATTCAGGCGGCGATCAGAGCGTACATGGGTATGCAGAAGA
TACGGTACTCGATGGTGGGTATCAGTACGTTCATTCAGGTGGCATGGCGGTTAATACAACCATTAATTCCAGTGGCTGGC
AGGTGGTCAAAGAGGGCGCTCAGGCGCGCGATACCACGGTAAACAGCAACGGCATTCTCCAGGTCAATGCGGACGGCGTG
GCCTCAGATGTCACCATCAACGCCGGTGGTGCGCTGGTCACCAATACCGACGCCGATGTTTCCGGCACCAACCGGCTGGG
CGGTTTTAGCGTGGATTCGGTTACCGGCAGCGCCAGCAACGTGGTGCTGGAAAACGGCGGGCAGCTCAACGTGTTCACCG
GTGATTCTGCCAGTGCCACCACCGTGGATAATGGCGGCACGCTCAGCGTAGCGGCTGGCGGTACGGCGACTGATACCACG
ATTTACGCAGGCGGGGCGTTAATCGCCGATACCCGCTCTACGGTCAGCGGCACCAACGCGCAGGGCGAGTTCAGCATTGA
CGGCGCAACCGGTCAGGCCAGCAGCCTGTATCTGGAAAACGGCGGCTACTTTTCCGTACGCTCCGGCGGGAATGCGGCAG
ATACTCGCGTTGGCAGTGGCGGTGAACTGAACGTGGAGGATGGCGGGACGTTGAGCGGCACGACCCGTCTGACCGAAGAC
GCCACCCTGACGCTTACCGGCGATGCCGTCAGCACCGGCACCATCGACAGCGCCGGGACGATCACTTTTGCGCAGGCTGA
TGATACCGGTTTTACGCCGCATACCCTCACGACCACCAGCCTGGTGGGCAACGGCGGCACCATCAACATGTCCGTTAACC
TCAACGATCCGGCGTTCCCGACCGATATGCTGATCATTGACGGCGGGCAGGCGACGGGCACCACCAACCTGAATATCACC
AATACCGGCAGCGTCGGGCTGGGGCTTGCCACCACGGGTGAGGGCATCAAAGTTGTGGATGCTGTGAATGGCGCAACCAC
CGACGATAACGCTTTCGCTCTCAGCCAGCCGCTTCAGGCGGGCGCATACAATTACACCCTGGATCACGGCACGACGGATG
AGGACTGGTATCTGTCCAGCGAAGCGGACTATCGCGCTGAAGCCGCTCTCTACTCCTCGATGCTCACGCAAAGCATGGAT
TATGATCGCCTGCTGGCGGGCAGCTACAACCAGCGCCGCGCGGCGCAGGGTGGTCAGGGTGTGTGGGCGCGGATTCAGGG
CGGGCACATTGGGCATGACGATAATGGCGGCATTGCCGGAGGCGACACGCCGGAAAGCAGCGGCAGCTACGGTTTTATCC
AGACAGGCGCTGACCTGCTGCGCAGCGACGCAGGTCCGGTCTCGCTGACTACCGGGATCTACGGCGCGGCAGGTTTGTCC
TCGGTTGACGTCAAAAACGACGATCACTCTGATGCAGGCGACGTGCGCGACAACGTGTACAGCCTGGGCGGCTACCTGAC
GATGGTGCATAACGCCAGCGGCGGTTGGGTGGATATGGTTGCGCAGGGCTCGCGTCACTATCTTGAAGCGACGTCTGATA
ATAATGACTTTAACACTCACGGCTGGGGCTGGCTGGGATCGCTGGAAACGGGGCTTCCGCTCAGCGTCGGCCATGGCCTG
GTGCTGGAACCGCAGATCCAGTACATCTGGCAGGGGCTTTCACTCGAGGATGGTCACGACAACGGTGGCTACGTGAACTT
CGGCGACGGCAGCGCTCAGCATCTCCGCGCCGGTCTGCGCTTCGGCAATATGTCAGAAATGGCGTTTGGCCGGGGAAGCT
CATCGCAGGCAGGATTCGGCGACAGCATGAAACACCGCGTCAGCGAACTGCCGGTAAACTGGTGGGTGCGCCCCTCGGTT
ATTCGCACCTTCAGCTCTGACGGCGACATGAGTATGGGGACCGACACGGCGGGCAGCAATGTGATCTTCAGCCCGTCGCA
GGATGGCACCTCGCTGGATTTGCAGGTGGGCGTTGAAACCCTCGTGCGCCAGAACGTCAGCCTGGGCGTACAGGGCGGCT
ACACCCGCAGCGTCAGCGGGGAAAGCGCCGACGGATATAACGGTCAGGCCACGCTGAAGGTGACATTCTGA

Protein sequence :
MKRGLNTSYRLVWNTVQGAFVVVSELAKANGKRASAVLVAAGLVVSPIVMAEETTSVPFGEVVNGAVIDNHDTQRVSGET
NDTVINTGLGPDSEGNGGGQGVEDGGVSNRTTINDGGLQWVDAGGLATDTTVNSGGGQSVAGQTKGTVLNGGEQYVHSGG
VADGTVINTGGYQTVKAGGTAANTVVNTGSEGGPDSENDDAGQWVYGTAQNTTINANGRQVVQEGGVSTDTVINSGGDQS
VHGNAENTQLDGGYQYVHSGGVADGTVINEGGYQTVKAGGTATNTVVNTGSEGGPDAENSDGQWVYGTAENTTINANGRQ
VINEGGVATTTTIYSGGDQSVHGYAEDTVLDGGYQYVHSGGMAVNTTINSSGWQVVKEGAQARDTTVNSNGILQVNADGV
ASDVTINAGGALVTNTDADVSGTNRLGGFSVDSVTGSASNVVLENGGQLNVFTGDSASATTVDNGGTLSVAAGGTATDTT
IYAGGALIADTRSTVSGTNAQGEFSIDGATGQASSLYLENGGYFSVRSGGNAADTRVGSGGELNVEDGGTLSGTTRLTED
ATLTLTGDAVSTGTIDSAGTITFAQADDTGFTPHTLTTTSLVGNGGTINMSVNLNDPAFPTDMLIIDGGQATGTTNLNIT
NTGSVGLGLATTGEGIKVVDAVNGATTDDNAFALSQPLQAGAYNYTLDHGTTDEDWYLSSEADYRAEAALYSSMLTQSMD
YDRLLAGSYNQRRAAQGGQGVWARIQGGHIGHDDNGGIAGGDTPESSGSYGFIQTGADLLRSDAGPVSLTTGIYGAAGLS
SVDVKNDDHSDAGDVRDNVYSLGGYLTMVHNASGGWVDMVAQGSRHYLEATSDNNDFNTHGWGWLGSLETGLPLSVGHGL
VLEPQIQYIWQGLSLEDGHDNGGYVNFGDGSAQHLRAGLRFGNMSEMAFGRGSSSQAGFGDSMKHRVSELPVNWWVRPSV
IRTFSSDGDMSMGTDTAGSNVIFSPSQDGTSLDLQVGVETLVRQNVSLGVQGGYTRSVSGESADGYNGQATLKVTF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed AAL08472.1 putative autotransporter Not tested SRL Protein 0.0 61
unnamed CAE85197.1 antigen43 protein orthologue Not tested PAI V 536 Protein 0.0 61
Z1211 NP_286746.1 adhesin Not tested TAI Protein 0.0 61
Z1651 NP_287154.1 adhesin Not tested TAI Protein 0.0 61
aidA ADD91708.1 AidA Not tested PAI-I AL862 Protein 0.0 49
unnamed CAD66200.1 hypothetical protein Virulence PAI III 536 Protein 0.0 49
sap AAK00474.1 Sap Not tested SHI-1 Protein 0.0 48
SF2991 NP_708765.1 outer membrane fluffing protein Not tested SHI-1 Protein 0.0 48
flu CAI43838.1 antigen 43 Not tested LEE Protein 2e-178 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
CFSAN002050_05940 YP_008259134.1 hypothetical protein VFG1063 Protein 0.0 61
CFSAN002050_05940 YP_008259134.1 hypothetical protein VFG1675 Protein 0.0 49
CFSAN002050_05940 YP_008259134.1 hypothetical protein VFG0655 Protein 0.0 48