Gene Information

Name : pic (S3178)
Accession : NP_838464.1
Strain : Shigella flexneri 2457T
Genome accession: NC_004741
Putative virulence/resistance : Virulence
Product : serine protease precurser
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 3059914 - 3063126 bp
Length : 3213 bp
Strand : -
Note : residues 1 to 1070 of 1070 are 96.16 pct identical to residues 304 to 1373 of 1373 from GenPept : >gb|AAK00464.1| (AF200692) Pic [Shigella flexneri 2a]

DNA sequence :
ATGCAGGACGATTTCGATGCCCCCGTAGACTTTGTTTCCGGACTGGGCCCCCTGAACTGGACATACGACAAAACATCAGG
CACAGGTACCCTGAGCCAGGGCAGTAAAAACTGGACCATGCACGGGCAGAAAGACAATGACCTCAATGCCGGTAAAAATC
TGGTATTCAGCGGGCAGAATGGTGCAATTATCCTGAAAGACAGTGTGACTCAGGGTGCCGGTTATCTCGAATTTAAAGAC
AGTTACACCGTATCTGCTGAATCCGGAAAAACATGGACGGGTGCCGGCATTATTACTGACAAGGGGACGAATGTAACCTG
GAAGGTCAACGGCGTTGCCGGTGACAACTTGCATAAGCTGGGGGAAGGAACCCTGACCATAAACGGAACAGGTGTAAACC
CGGGAGGACTGAAAACGGGAGACGGTATCGTTGTACTTAACCAGCAGGCAGACACTGCAGGTAATATCCAGGCCTTCAGT
TCAGTGAACCTCGCCAGCGGACGTCCGACCGTGGTGCTCGGGGATGCCCGTCAGGTCAATCCGGATAACATTTCATGGGG
ATACCGGGGAGGTAAGCTTGACCTTAATGGTAATGCCGTTACCTTCACCCGACTGCAGGCTGCTGATTACGGGGCGGTGA
TTACAAATAATGCACAGCAAAAATCCCAGCTTTTACTGGATCTTAAGGCTCAGGATACAAATGTCAGTGAACCGACGATT
GGAAATATATCCCCCTTTGGTGGTACCGGCACTCCAGGAAACCTGTACAGCATGATACTCAACAGCCAGACCCGCTTCTA
TATTCTGAAATCTGCCAGCTATGGTAACACTCTGTGGGGGAACAGCCTGAATGATCCGGCTCAGTGGGAGTTTGTTGGCA
TGAACAAAAACAAAGCAGTTCAGACAGTAAAAGATAGGATCCTGGCCGGGCGGGCAAAACAACCCGTTATCTTTCATGGT
CAGCTGACCGGGAATATGGATGTCGCCATTCCACAGGTGCCGGGGGGAAGAAAGGTAATCTTTGATGGTAGCGTGAACCT
GCCGGAAGGTACCCTGAGTCAGGACAGTGGCACCCTGATATTCCAGGGACATCCGGTTATCCATGCCTCCATCAGTGGCA
GTGCACCGGTCAGCCTGAACCAGAAAGACTGGGAAAACCGTCAGTTTACAATGAAAACACTGTCGCTGAAAGACGCTGAC
TTCCATCTTTCACGTAACGCCTCGCTGAACAGTGACATTAAGTCGGATAACAGCCATATCACACTGGGAAGTGACAGGGC
ATTTGTGGATAAAAATGACGGAACAGGAAATTATGTCATTCCGGAGGAAGGTACCTCTGTCCCGGACACCGTGAATGACA
GGAGCCAGTATGAAGGGAATATTACGCTGAACCATAACTCAGCCCTGGATATCGGCAGCAGGTTCACCGGGGGGATTGAC
GCTTATGACAGTGCCGTCAGCATCACCTCTCCGGACGTCCTGTTGACAGCCCCGGGTGCTTTTGCCGGCAGTTCACTGAC
AGTGCATGATGGCGGTCATCTTACAGCACTGAACGGTCTTTTCAGCGACGGGCATATTCAGGCCGGTAAGAACGGCAAAA
TCACCCTGAGCGGTACACCGGTTAAAGATACGGCTAATCAGTATGCCCCTGCTGTATATCTGACGGACGGATATGACCTG
ACCGGCGATAACGCAGCACTGGAAATTACCCGTGGAGCACATGCTTCCGGTGATATTCATGCCTCTGCGGCATCAACAGT
TACCATCGGGTCTGACACGCCGGCAGAACTGGCTTCTGCGGAAACGGCTGCATCGGCGTTTGCCGGCAGTCTTCTTGAGG
GCTATAACGCAGCATTCAATGGTGCCATAACCGGTGGCAGGGCTGATGTCAGTATGCATAATGCACTGTGGACTCTGGGT
GGGGACTCTGCCATCCACAGTCTTACCGTCAGAAACAGCCGTATTAGTTCTGAAGGAGACCGTACATTCCGTACCCTGAC
GGTGAATAAACTGGATGCAACAGGCAGTGATTTTGTTTTGCGTACGGACCTGAAAAATGCCGATAAAATTAATGTGACTG
AAAAAGCCACTGGTTCAGATAACAGCCTGAACGTCAGCTTTATGAATAATCCTGCTCAGGGACAGGCCCTGAATATTCCT
CTGGTCACGGCACCGGCGGGAACTTCAGCAGAGATGTTTAAGGCCGGCACCCGGGTGACAGGTTTCAGTCGGGTGACCCC
AACCCTGCATGTTGATACCAGTGGTGGCAATACGAAGTGGATACTGGATGGTTTTAAAGCGGAGGCTGATAAAGCCGCTG
CCGCGAAGGCTGACAGTTTCATGAATGCCGGGTATAAAAACTTCATGACGGAAGTTAACAATCTGAACAAACGTATGGGT
GACCTGCGTGACACAAACGGTGATGCCGGTGCCTGGGCGCGCATCATGAGTGGTGCCGGTTCTGCAGACGGTGGTTACAG
TGATAATTACACCCATGTTCAGGTCGGCTTTGACAAAAAACATGAACTGGACGGTGTGGACCTGTTTACCGGTGTCACGA
TGACCTATACCGACAGCAGTGCAGACAGCCATGCATTCAGCGGAAAGACGAAATCGGTGGGGGGCGGTCTGTATGCTTCA
GCATTGTTTGAGTCCGGTGCCTATATCGATTTGATTGGTAAATATATTCACCATGACAATGATTACACAGGTAACTTTGC
TAGCCTGGGAACGAAACACTACAACACCCATTCCTGGTATGCCGGTGCTGAAACGGGTTACCGCTATCACCTGACAGAGG
ACACGTTCATTGAGCCGCAGGCTGAACTGGTTTACGGCGCCGTGTCCGGGAAAACATTCCGCTGGAAAGACGGTGATATG
GACCTGAGCATGAAGAACAGGGACTTCAGTCCGCTGGTTGGAAGAACAGGGGTTGAACTGGGCAAGACCTTCAGTGGTAA
GGACTGGAGTGTGACGGCCCGTGCCGGAACCAGCTGGCAGTTTGACCTGCTGAATAATGGAGAGACCGTACTGCGTGATG
CGTCCGGGGAGAAACGGATAAAAGGAGAGAAGGACAGCCGGATGCTGTTTAATGTTGGTATGAATGCGCAGATAAAGGAC
AATATGCGCTTTGGTCTGGAGTTTGAGAAGTCAGCCTTTGGTAAATATAACGTGGATAATGCGGTAAACGCGAATTTCCG
GTATATGTTCTGA

Protein sequence :
MQDDFDAPVDFVSGLGPLNWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIILKDSVTQGAGYLEFKD
SYTVSAESGKTWTGAGIITDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGIVVLNQQADTAGNIQAFS
SVNLASGRPTVVLGDARQVNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSQLLLDLKAQDTNVSEPTI
GNISPFGGTGTPGNLYSMILNSQTRFYILKSASYGNTLWGNSLNDPAQWEFVGMNKNKAVQTVKDRILAGRAKQPVIFHG
QLTGNMDVAIPQVPGGRKVIFDGSVNLPEGTLSQDSGTLIFQGHPVIHASISGSAPVSLNQKDWENRQFTMKTLSLKDAD
FHLSRNASLNSDIKSDNSHITLGSDRAFVDKNDGTGNYVIPEEGTSVPDTVNDRSQYEGNITLNHNSALDIGSRFTGGID
AYDSAVSITSPDVLLTAPGAFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNGKITLSGTPVKDTANQYAPAVYLTDGYDL
TGDNAALEITRGAHASGDIHASAASTVTIGSDTPAELASAETAASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLG
GDSAIHSLTVRNSRISSEGDRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMNNPAQGQALNIP
LVTAPAGTSAEMFKAGTRVTGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMG
DLRDTNGDAGAWARIMSGAGSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYAS
ALFESGAYIDLIGKYIHHDNDYTGNFASLGTKHYNTHSWYAGAETGYRYHLTEDTFIEPQAELVYGAVSGKTFRWKDGDM
DLSMKNRDFSPLVGRTGVELGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKD
NMRFGLEFEKSAFGKYNVDNAVNANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 100
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 100
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 99
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 99
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 54
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 49
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 49
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 48
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 46
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 2e-170 43
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 1e-170 43
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 2e-170 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser VFG0635 Protein 0.0 100
pic NP_838464.1 serine protease precurser VFG0861 Protein 0.0 99
pic NP_838464.1 serine protease precurser VFG0903 Protein 0.0 97
pic NP_838464.1 serine protease precurser VFG1689 Protein 0.0 49
pic NP_838464.1 serine protease precurser VFG0904 Protein 0.0 49
pic NP_838464.1 serine protease precurser VFG0772 Protein 0.0 46
pic NP_838464.1 serine protease precurser VFG0630 Protein 7e-171 43
pic NP_838464.1 serine protease precurser VFG0844 Protein 3e-175 43
pic NP_838464.1 serine protease precurser VFG0862 Protein 3e-171 42