Gene Information

Name : pic (S3178)
Accession : NP_838464.1
Strain : Shigella flexneri 2457T
Genome accession: NC_004741
Putative virulence/resistance : Virulence
Product : serine protease precurser
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 3059914 - 3063126 bp
Length : 3213 bp
Strand : -
Note : residues 1 to 1070 of 1070 are 96.16 pct identical to residues 304 to 1373 of 1373 from GenPept : >gb|AAK00464.1| (AF200692) Pic [Shigella flexneri 2a]

DNA sequence :
ATGCAGGACGATTTCGATGCCCCCGTAGACTTTGTTTCCGGACTGGGCCCCCTGAACTGGACATACGACAAAACATCAGG
CACAGGTACCCTGAGCCAGGGCAGTAAAAACTGGACCATGCACGGGCAGAAAGACAATGACCTCAATGCCGGTAAAAATC
TGGTATTCAGCGGGCAGAATGGTGCAATTATCCTGAAAGACAGTGTGACTCAGGGTGCCGGTTATCTCGAATTTAAAGAC
AGTTACACCGTATCTGCTGAATCCGGAAAAACATGGACGGGTGCCGGCATTATTACTGACAAGGGGACGAATGTAACCTG
GAAGGTCAACGGCGTTGCCGGTGACAACTTGCATAAGCTGGGGGAAGGAACCCTGACCATAAACGGAACAGGTGTAAACC
CGGGAGGACTGAAAACGGGAGACGGTATCGTTGTACTTAACCAGCAGGCAGACACTGCAGGTAATATCCAGGCCTTCAGT
TCAGTGAACCTCGCCAGCGGACGTCCGACCGTGGTGCTCGGGGATGCCCGTCAGGTCAATCCGGATAACATTTCATGGGG
ATACCGGGGAGGTAAGCTTGACCTTAATGGTAATGCCGTTACCTTCACCCGACTGCAGGCTGCTGATTACGGGGCGGTGA
TTACAAATAATGCACAGCAAAAATCCCAGCTTTTACTGGATCTTAAGGCTCAGGATACAAATGTCAGTGAACCGACGATT
GGAAATATATCCCCCTTTGGTGGTACCGGCACTCCAGGAAACCTGTACAGCATGATACTCAACAGCCAGACCCGCTTCTA
TATTCTGAAATCTGCCAGCTATGGTAACACTCTGTGGGGGAACAGCCTGAATGATCCGGCTCAGTGGGAGTTTGTTGGCA
TGAACAAAAACAAAGCAGTTCAGACAGTAAAAGATAGGATCCTGGCCGGGCGGGCAAAACAACCCGTTATCTTTCATGGT
CAGCTGACCGGGAATATGGATGTCGCCATTCCACAGGTGCCGGGGGGAAGAAAGGTAATCTTTGATGGTAGCGTGAACCT
GCCGGAAGGTACCCTGAGTCAGGACAGTGGCACCCTGATATTCCAGGGACATCCGGTTATCCATGCCTCCATCAGTGGCA
GTGCACCGGTCAGCCTGAACCAGAAAGACTGGGAAAACCGTCAGTTTACAATGAAAACACTGTCGCTGAAAGACGCTGAC
TTCCATCTTTCACGTAACGCCTCGCTGAACAGTGACATTAAGTCGGATAACAGCCATATCACACTGGGAAGTGACAGGGC
ATTTGTGGATAAAAATGACGGAACAGGAAATTATGTCATTCCGGAGGAAGGTACCTCTGTCCCGGACACCGTGAATGACA
GGAGCCAGTATGAAGGGAATATTACGCTGAACCATAACTCAGCCCTGGATATCGGCAGCAGGTTCACCGGGGGGATTGAC
GCTTATGACAGTGCCGTCAGCATCACCTCTCCGGACGTCCTGTTGACAGCCCCGGGTGCTTTTGCCGGCAGTTCACTGAC
AGTGCATGATGGCGGTCATCTTACAGCACTGAACGGTCTTTTCAGCGACGGGCATATTCAGGCCGGTAAGAACGGCAAAA
TCACCCTGAGCGGTACACCGGTTAAAGATACGGCTAATCAGTATGCCCCTGCTGTATATCTGACGGACGGATATGACCTG
ACCGGCGATAACGCAGCACTGGAAATTACCCGTGGAGCACATGCTTCCGGTGATATTCATGCCTCTGCGGCATCAACAGT
TACCATCGGGTCTGACACGCCGGCAGAACTGGCTTCTGCGGAAACGGCTGCATCGGCGTTTGCCGGCAGTCTTCTTGAGG
GCTATAACGCAGCATTCAATGGTGCCATAACCGGTGGCAGGGCTGATGTCAGTATGCATAATGCACTGTGGACTCTGGGT
GGGGACTCTGCCATCCACAGTCTTACCGTCAGAAACAGCCGTATTAGTTCTGAAGGAGACCGTACATTCCGTACCCTGAC
GGTGAATAAACTGGATGCAACAGGCAGTGATTTTGTTTTGCGTACGGACCTGAAAAATGCCGATAAAATTAATGTGACTG
AAAAAGCCACTGGTTCAGATAACAGCCTGAACGTCAGCTTTATGAATAATCCTGCTCAGGGACAGGCCCTGAATATTCCT
CTGGTCACGGCACCGGCGGGAACTTCAGCAGAGATGTTTAAGGCCGGCACCCGGGTGACAGGTTTCAGTCGGGTGACCCC
AACCCTGCATGTTGATACCAGTGGTGGCAATACGAAGTGGATACTGGATGGTTTTAAAGCGGAGGCTGATAAAGCCGCTG
CCGCGAAGGCTGACAGTTTCATGAATGCCGGGTATAAAAACTTCATGACGGAAGTTAACAATCTGAACAAACGTATGGGT
GACCTGCGTGACACAAACGGTGATGCCGGTGCCTGGGCGCGCATCATGAGTGGTGCCGGTTCTGCAGACGGTGGTTACAG
TGATAATTACACCCATGTTCAGGTCGGCTTTGACAAAAAACATGAACTGGACGGTGTGGACCTGTTTACCGGTGTCACGA
TGACCTATACCGACAGCAGTGCAGACAGCCATGCATTCAGCGGAAAGACGAAATCGGTGGGGGGCGGTCTGTATGCTTCA
GCATTGTTTGAGTCCGGTGCCTATATCGATTTGATTGGTAAATATATTCACCATGACAATGATTACACAGGTAACTTTGC
TAGCCTGGGAACGAAACACTACAACACCCATTCCTGGTATGCCGGTGCTGAAACGGGTTACCGCTATCACCTGACAGAGG
ACACGTTCATTGAGCCGCAGGCTGAACTGGTTTACGGCGCCGTGTCCGGGAAAACATTCCGCTGGAAAGACGGTGATATG
GACCTGAGCATGAAGAACAGGGACTTCAGTCCGCTGGTTGGAAGAACAGGGGTTGAACTGGGCAAGACCTTCAGTGGTAA
GGACTGGAGTGTGACGGCCCGTGCCGGAACCAGCTGGCAGTTTGACCTGCTGAATAATGGAGAGACCGTACTGCGTGATG
CGTCCGGGGAGAAACGGATAAAAGGAGAGAAGGACAGCCGGATGCTGTTTAATGTTGGTATGAATGCGCAGATAAAGGAC
AATATGCGCTTTGGTCTGGAGTTTGAGAAGTCAGCCTTTGGTAAATATAACGTGGATAATGCGGTAAACGCGAATTTCCG
GTATATGTTCTGA

Protein sequence :
MQDDFDAPVDFVSGLGPLNWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIILKDSVTQGAGYLEFKD
SYTVSAESGKTWTGAGIITDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGIVVLNQQADTAGNIQAFS
SVNLASGRPTVVLGDARQVNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSQLLLDLKAQDTNVSEPTI
GNISPFGGTGTPGNLYSMILNSQTRFYILKSASYGNTLWGNSLNDPAQWEFVGMNKNKAVQTVKDRILAGRAKQPVIFHG
QLTGNMDVAIPQVPGGRKVIFDGSVNLPEGTLSQDSGTLIFQGHPVIHASISGSAPVSLNQKDWENRQFTMKTLSLKDAD
FHLSRNASLNSDIKSDNSHITLGSDRAFVDKNDGTGNYVIPEEGTSVPDTVNDRSQYEGNITLNHNSALDIGSRFTGGID
AYDSAVSITSPDVLLTAPGAFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNGKITLSGTPVKDTANQYAPAVYLTDGYDL
TGDNAALEITRGAHASGDIHASAASTVTIGSDTPAELASAETAASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLG
GDSAIHSLTVRNSRISSEGDRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMNNPAQGQALNIP
LVTAPAGTSAEMFKAGTRVTGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMG
DLRDTNGDAGAWARIMSGAGSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYAS
ALFESGAYIDLIGKYIHHDNDYTGNFASLGTKHYNTHSWYAGAETGYRYHLTEDTFIEPQAELVYGAVSGKTFRWKDGDM
DLSMKNRDFSPLVGRTGVELGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKD
NMRFGLEFEKSAFGKYNVDNAVNANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 100
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 100
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 99
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 99
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 54
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 49
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 49
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 48
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 46
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 2e-170 43
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 2e-170 43
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 1e-170 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser VFG0635 Protein 0.0 100
pic NP_838464.1 serine protease precurser VFG0861 Protein 0.0 99
pic NP_838464.1 serine protease precurser VFG0903 Protein 0.0 97
pic NP_838464.1 serine protease precurser VFG1689 Protein 0.0 49
pic NP_838464.1 serine protease precurser VFG0904 Protein 0.0 49
pic NP_838464.1 serine protease precurser VFG0772 Protein 0.0 46
pic NP_838464.1 serine protease precurser VFG0630 Protein 7e-171 43
pic NP_838464.1 serine protease precurser VFG0844 Protein 3e-175 43
pic NP_838464.1 serine protease precurser VFG0862 Protein 3e-171 42