Gene Information

Name : S4031 (S4031)
Accession : NP_839201.1
Strain : Shigella flexneri 2457T
Genome accession: NC_004741
Putative virulence/resistance : Unknown
Product : ISSfl3 orfC
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 3923034 - 3924635 bp
Length : 1602 bp
Strand : +
Note : residues 1 to 533 of 533 are 96.99 pct identical to residues 1 to 533 of 533 from GenPept : >gb|AAL72376.1| (AF386526) hypothetical protein [Shigella flexneri 2a]

DNA sequence :
ATGAACAATGAACTCCCCGATGATATTGAGCTGCTTAAAGCCATGTTGCGTAAGCAACAGAGTCGGCTTCGACAGTATGC
CTGTCAGGTCGCGGGCTATGAGCAGGAAATTGAACGGCTGAAAGCGCAACTCGACAGGTTGCGTCGTATGTTGTTCGGCC
AGAGTTCAGAGAAAAAGCGTCATAAGCTTGAAAATCAGATCCGACAGGCAGAAAAACGACTGTCGGAACTGGAAAACCGG
CTGAACACAGCCAGAAATCTTCTGGAAGATGCATCGTCAGTCACAGATTCACCTGACACCAGTCCCCCGTCAGAAAACCC
GATCGCCAGTAAGCCTGAATCCCCGGGACGAAAATCTTCACGAAAACCGCTGCCGGCAGAACTTCCCCGGGAGACACATC
GCCTTCTGCCTGCTGAAACCAGTTGCCCGGCCTGTGGAGGTGTTCTGAAAGAAATGGGGGAAACAATCTCAGAGCAACTG
GATATCATTAATACCGCCTTTAAAGTTATCGAAACCATACGTCCCAAACTGGCCTGTAGCCGGTGTGATGTCATCGTTCA
GGCACCACTTCCTCCTAAACCGATCGAACGCGGTTATGCCAGTGCAGGGTTACTTGCACGGATCCTGGTCAGCAAATATA
TGGAACATATCCCTTTATATCGCCAGTCAGAAATATACGCGCGACAGGGCGTGGAGCTGAGCCGTAATACCATGGTGCGC
TGGGTATCAGAAATGGCAGACAAACTCCGTCCTCTGTATATAGCGCTGAATGACTATGTTCTGGAGGCAGGAAAGGTGCA
CGCAGATGACACTCCGGTGAAAGTACTGGCCCCGGGGAACGGAAAGACGAAAACGGGTCGTCTGTGGGTATACGTCAGGG
ATGATCGTAATGCGGGTTCATCCCTGCCGGCAGCCGTCTGGTTCGCGTATTCGGCAGATCGCAAAGGAGAACATCCGCAG
CTCCACCTGGCAAAGTATCAGGGCGTACTGCAGGCTGATGCCTATGCAGGTTATAACGTACTGTACGAAACGGGCCGGGT
GAAGGAAGCCGGGTGCCTGGCCCACGCCCGCCGAAAAATCCATGACGAGGATGTGCGCCGTCCGACAGAAATGACTCAGG
AAGCGCTCAGACGGATAGCAGAGTTATACGACATAGAAGCGGAGATACGTGGCAGTCCGGCAGAGGAACGGCTTGCAGTC
AGAAAAGCCAGAAGCGTCCAGTTGATGCAGTCGTTGTACGACTGGATACAGTTGCAGAGGAAAACGCTGTCGAAACATGC
GAAGATGGCGAAGGCGTTCGACTATATCCTGAATCACTGGAATGCGCTGAACGAGTTCTGTCGTGACGGCCGGGTGGAAA
TAGACAACAACATCGGTGAAAACGCGTTACGATCGGTGGCGGTTGGAAGAAAAAATTATCTCTTTTTCGGCTCAGACAAG
GGAGGAGAAAGTGCGGCGATCATCTACAGTCTGCTGGTCACCTGCAAACAGAACGAAGTGGAGCCGGAGGACTGGTTGCG
CGAAGTGATCGAGAAGCTCAATGACTGGCCGTCGAACCAAGTGCATGAACTGCTGCCCTGGAACTTCTCGTCTGTAAAAT
AA

Protein sequence :
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRMLFGQSSEKKRHKLENQIRQAEKRLSELENR
LNTARNLLEDASSVTDSPDTSPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLKEMGETISEQL
DIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVR
WVSEMADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGSSLPAAVWFAYSADRKGEHPQ
LHLAKYQGVLQADAYAGYNVLYETGRVKEAGCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAKMAKAFDYILNHWNALNEFCRDGRVEIDNNIGENALRSVAVGRKNYLFFGSDK
GGESAAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z4317 NP_289543.1 hypothetical protein Not tested OI-122 Protein 2e-149 64
ECO103_3554 YP_003223421.1 hypothetical protein Not tested LEE Protein 6e-150 64
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 2e-127 61
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 6e-149 59
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 3e-117 57
tnp AEA34686.1 transposase Not tested Not named Protein 4e-135 56
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 9e-134 56
unnamed AAC31494.1 L0015 Not tested LEE Protein 6e-134 55
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 5e-133 55
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 5e-133 55
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 2e-133 55
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 2e-133 55
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 7e-134 55
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 7e-134 55
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 6e-134 55
unnamed AAL57570.1 unknown Not tested LEE Protein 1e-133 55
l0015 CAD33775.1 L0015 protein Not tested PAI I 536 Protein 4e-87 54
unnamed AAL08460.1 unknown Not tested SRL Protein 3e-97 53
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 3e-119 51
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 3e-119 51
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 2e-121 50
tnp AEA34687.1 IS66 family transposase Not tested Not named Protein 5e-76 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
S4031 NP_839201.1 ISSfl3 orfC VFG0793 Protein 3e-134 55
S4031 NP_839201.1 ISSfl3 orfC VFG1516 Protein 2e-87 54
S4031 NP_839201.1 ISSfl3 orfC VFG1051 Protein 1e-97 53
S4031 NP_839201.1 ISSfl3 orfC VFG1736 Protein 6e-99 52