Gene Information

Name : pic (EC042_4593)
Accession : YP_006098864.1
Strain : Escherichia coli 042
Genome accession: NC_017626
Putative virulence/resistance : Virulence
Product : serine protease (autotransporter)
Function : -
COG functional category : -
COG ID : -
EC number : 3.4.21.-
Position : 4924840 - 4928958 bp
Length : 4119 bp
Strand : -
Note : -

DNA sequence :
GTGAATAAAGTTTATTCTCTTAAATATTGCCCCGTCACCGGGGGGCTTATTGCTGTCTCTGAACTTGCCCGCAGGGTAAT
AAAAAAGACATGCCGAAGATTAACGCATATTCTTCTGGCTGGCATTCCAGCAATCTGTCTGTGTTACTCTCAGATATCCC
AGGCGGGTATTGTCCGTTCCGATATTGCCTATCAGATTTATCGTGATTTCGCCGAAAACAAAGGGCTTTTTGTACCTGGT
GCCAATGATATTCCGGTATATGATAAGGACGGAAAACTTGTGGGAAGACTGGGTAAAGCCCCAATGGCCGATTTCAGCAG
TGTGAGCTCAAATGGCGTTGCTACGCTTGTATCACCTCAGTATATCGTCAGCGTAAAGCATAACGGAGGATATCGGAGTG
TGAGCTTTGGTAATGGGAAAAATACATATTCCCTTGTTGACCGTAATAACCACCCTTCTATTGACTTCCATGCTCCACGT
CTGAATAAACTGGTTACAGAAGTTATTCCCTCAGCGGTAACATCAGAAGGAACCAAAGCCAATGCTTATAAATACACTGA
ACGTTACACCGCTTTTTATCGGGTGGGTAGTGGTACGCAGTACACTAAGGACAAGGACGGAAATTTAGTTAAGGTTGCCG
GTGGATATGCTTTTAAAACAGGAGGAACCACAGGAGTTCCTCTGATATCTGATGCAACAATAGTCTCTAATCCCGGGCAA
ACTTATAATCCTGTAAACGGCCCTTTACCTGACTATGGAGCCCCTGGGGATAGTGGTTCTCCTTTGTTTGCTTATGATAA
ACAACAAAAAAAATGGGTTATTGTTGCTGTATTAAGAGCATATGCAGGTATTAATGGTGCTACGAACTGGTGGAATGTCA
TACCAACAGATTATCTGAACCAGGTTATGCAGGACGATTTCGATGCCCCCGTAGACTTTGTTTCCGGACTGGGCCCCCTG
AACTGGACATACGACAAAACATCAGGCACAGGTACCCTGAGCCAGGGCAGTAAAAACTGGACCATGCACGGGCAGAAAGA
CAATGACCTCAATGCCGGTAAAAATCTGGTATTCAGCGGGCAGAATGGTGCAATTATCCTGAAAGACAGTGTGACTCAGG
GTGCCGGTTATCTCGAATTTAAAGACAGTTACACCGTATCTGCTGAATCCGGAAAAACATGGACGGGTGCCGGCATTATT
ACTGACAAGGGGACGAATGTAACCTGGAAGGTCAACGGCGTTGCCGGTGACAACTTGCATAAGCTGGGGGAAGGAACCCT
GACCATAAACGGAACAGGTGTAAACCCGGGAGGACTGAAAACGGGAGACGGTATCGTTGTACTTAACCAGCAGGCAGACA
CTGCAGGTAATATCCAGGCCTTCAGTTCAGTGAACCTCGCCAGCGGACGTCCGACCGTGGTGCTCGGGGATGCCCGTCAG
GTCAATCCGGATAACATTTCATGGGGATACCGGGGAGGTAAGCTTGACCTTAATGGTAATGCCGTTACCTTCACCCGACT
GCAGGCTGCTGATTACGGGGCGGTGATTACAAATAATGCACAGCAAAAATCCCAGCTTTTACTGGATCTTAAGGCTCAGG
ATACAAATGTCAGTGAACCGACGATTGGAAATATATCCCCCTTTGGTGGTACCGGCACTCCAGGAAACCTGTACAGCATG
ATACTCAACAGCCAGACCCGCTTCTATATTCTGAAATCTGCCAGCTATGGTAACACTCTGTGGGGGAACAGCCTGAATGA
TCCGGCTCAGTGGGAGTTTGTTGGCATGGACAAAAACAAAGCAGTTCAGACAGTAAAAGATAGGATCCTGGCCGGGCGGG
CAAAACAACCCGTTATCTTTCATGGTCAGCTGACCGGGAATATGGATGTCGCCATTCCACAGGTGCCGGGGGGAAGAAAG
GTCATCTTTGATGGTAGCGTGAACCTGCCGGAAGGTACCCTGAGTCAGGACAGTGGCACCCTGATATTCCAGGGACATCC
GGTTATCCATGCCTCCATCAGTGGCAGTGCACCGGTCAGCCTGAACCAGAAAGACTGGGAAAACCGTCAGTTTACAATGA
AAACACTGTCGCTGAAAGACGCTGACTTCCATCTTTCACGTAACGCCTCGCTGAACAGTGACATTAAGTCGGATAACAGC
CATATCACACTGGGAAGTGACAGGGCATTTGTGGATAAAAATGACGGAACAGGAAATTATGTCATTCCGGAGGAAGGTAC
CTCTGTCCCGGACACCGTGAATGACAGGAGCCAGTATGAAGGGAATATTACGCTGAACCATAACTCAGCCCTGGATATCG
GCAGCAGGTTCACCGGGGGGATTGACGCTTATGACAGTGCCGTCAGCATCACCTCTCCGGACGTCCTGTTGACAGCCCCG
GGTGCTTTTGCCGGCAGTTCACTGACAGTGCATGATGGCGGTCATCTTACAGCACTGAACGGTCTTTTCAGCGACGGGCA
TATTCAGGCCGGTAAGAACGGCAAAATCACCCTGAGCGGTACACCGGTTAAAGATACGGCTAATCAGTATGCCCCTGCTG
TATATCTGACGGACGGATATGACCTGACCGGCGATAACGCAGCACTGGAAATTACCCGTGGAGCACATGCTTCCGGTGAT
ATTCATGCCTCTGCGGCATCAACAGTTACCATCGGGTCTGACACGCCGGCAGAACTGGCTTCTGCGGAAACGGCTGCATC
GGCGTTTGCCGGCAGTCTTCTTGAGGGCTATAACGCAGCATTCAATGGTGCCATAACCGGTGGCAGGGCTGATGTCAGTA
TGCATAATGCACTGTGGACTCTGGGTGGGGACTCTGCCATCCACAGTCTTACCGTCAGAAACAGCCGTATTAGTTCTGAA
GGAGACCGTACATTCCGTACCCTGACGGTGAATAAACTGGATGCAACAGGCAGTGATTTTGTTTTGCGTACGGACCTGAA
AAATGCCGATAAAATTAATGTGACTGAAAAAGCCACTGGTTCAGATAACAGCCTGAACGTCAGCTTTATGAATAATCCTG
CTCAGGGACAGGCCCTGAATATTCCTCTGGTCACGGCACCGGCGGGAACTTCAGCAGAGATGTTTAAGGCCGGCACCCGG
GTGACAGGTTTCAGTCGGGTGACCCCAACCCTGCATGTTGATACCAGTGGTGGCAATACGAAGTGGATACTGGATGGTTT
TAAAGCGGAGGCTGATAAAGCCGCTGCCGCGAAGGCTGACAGTTTCATGAATGCCGGGTATAAAAACTTCATGACGGAAG
TTAACAATCTGAACAAACGTATGGGTGACCTGCGTGACACAAACGGTGATGCCGGTGCCTGGGCGCGCATCATGAGTGGT
GCCGGTTCTGCAGACGGTGGTTACAGTGATAATTACACCCATGTTCAGGTCGGCTTTGACAAAAAACATGAACTGGACGG
TGTGGACCTGTTTACCGGTGTCACGATGACCTATACCGACAGCAGTGCAGACAGCCATGCATTCAGCGGAAAGACGAAAT
CGGTGGGGGGCGGTCTGTATGCTTCAGCATTGTTTGAGTCCGGTGCCTATATCGATTTGATTGGTAAATATATTCACCAT
GACAATGATTACACAGGTAACTTTGCTAGCCTGGGAACGAAACACTACAACACCCATTCCTGGTATGCCGGTGCTGAAAC
GGGTTACCGCTATCACCTGACAGAGGACACGTTCATTGAGCCGCAGGCTGAACTGGTTTACGGCGCCGTGTCCGGGAAAA
CATTCCGCTGGAAAGACGGTGATATGGACCTGAGCATGAAGAACAGGGACTTCAGTCCGCTGGTTGGAAGAACAGGGGTT
GAACTGGGCAAGACCTTCAGTGGTAAGGACTGGAGTGTGACGGCCCGTGCCGGAACCAGCTGGCAGTTTGACCTGCTGAA
TAATGGAGAGACCGTACTGCGTGATGCGTCCGGGGAGAAACGGATAAAAGGAGAGAAGGACAGCCGGATGCTGTTTAATG
TTGGTATGAATGCGCAGATAAAGGACAATATGCGCTTTGGTCTGGAGTTTGAGAAGTCAGCCTTTGGTAAATATAACGTG
GATAATGCGGTAAACGCGAATTTCCGGTATATGTTCTGA

Protein sequence :
MNKVYSLKYCPVTGGLIAVSELARRVIKKTCRRLTHILLAGIPAICLCYSQISQAGIVRSDIAYQIYRDFAENKGLFVPG
ANDIPVYDKDGKLVGRLGKAPMADFSSVSSNGVATLVSPQYIVSVKHNGGYRSVSFGNGKNTYSLVDRNNHPSIDFHAPR
LNKLVTEVIPSAVTSEGTKANAYKYTERYTAFYRVGSGTQYTKDKDGNLVKVAGGYAFKTGGTTGVPLISDATIVSNPGQ
TYNPVNGPLPDYGAPGDSGSPLFAYDKQQKKWVIVAVLRAYAGINGATNWWNVIPTDYLNQVMQDDFDAPVDFVSGLGPL
NWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIILKDSVTQGAGYLEFKDSYTVSAESGKTWTGAGII
TDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGIVVLNQQADTAGNIQAFSSVNLASGRPTVVLGDARQ
VNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSQLLLDLKAQDTNVSEPTIGNISPFGGTGTPGNLYSM
ILNSQTRFYILKSASYGNTLWGNSLNDPAQWEFVGMDKNKAVQTVKDRILAGRAKQPVIFHGQLTGNMDVAIPQVPGGRK
VIFDGSVNLPEGTLSQDSGTLIFQGHPVIHASISGSAPVSLNQKDWENRQFTMKTLSLKDADFHLSRNASLNSDIKSDNS
HITLGSDRAFVDKNDGTGNYVIPEEGTSVPDTVNDRSQYEGNITLNHNSALDIGSRFTGGIDAYDSAVSITSPDVLLTAP
GAFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNGKITLSGTPVKDTANQYAPAVYLTDGYDLTGDNAALEITRGAHASGD
IHASAASTVTIGSDTPAELASAETAASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLGGDSAIHSLTVRNSRISSE
GDRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMNNPAQGQALNIPLVTAPAGTSAEMFKAGTR
VTGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMGDLRDTNGDAGAWARIMSG
AGSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYASALFESGAYIDLIGKYIHH
DNDYTGNFASLGTKHYNTHSWYAGAETGYRYHLTEDTFIEPQAELVYGAVSGKTFRWKDGDMDLSMKNRDFSPLVGRTGV
ELGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKDNMRFGLEFEKSAFGKYNV
DNAVNANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 99
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 99
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 99
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 99
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 53
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 48
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 48
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
pic YP_006098864.1 serine protease (autotransporter) VFG0635 Protein 0.0 99
pic YP_006098864.1 serine protease (autotransporter) VFG0861 Protein 0.0 99
pic YP_006098864.1 serine protease (autotransporter) VFG0903 Protein 0.0 97
pic YP_006098864.1 serine protease (autotransporter) VFG1689 Protein 0.0 47
pic YP_006098864.1 serine protease (autotransporter) VFG0904 Protein 0.0 47