Gene Information

Name : EcE24377A_0228 (EcE24377A_0228)
Accession : YP_001461387.1
Strain : Escherichia coli E24377A
Genome accession: NC_009801
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3523
EC number : -
Position : 243795 - 247319 bp
Length : 3525 bp
Strand : -
Note : identified by match to protein family HMM PF06744; match to protein family HMM PF06761; match to protein family HMM TIGR03348

DNA sequence :
GTGTTCAGATTACCAACACCCCGACTACTCAGCGGACTCAAATCAGCCCTGCGACCGGCGATGCCCCGGTTTAAAATCTC
TGCTTTCTGGCTGCTGATACTGGCGTGGATTTTTCTGCTGGTGTGGATCTGGTGGAAAGGCCCGATGTGGACGCTGTATG
AAGAACAGTGGCTCAAACCACTGGCGAACCGCTGGCTGGCAACGGCGGCGTGGGGGATTATTGCCCTGGTGTGGCTTACC
GTCCGGGTGATGAAGCGCCTGCAACAGCTTGAAAAAATGCAAAAGCAACAGCGCGAGGAAGCCGTTGATCCGCTCAGCGT
GGAACTGAACGCCCAGCAGCGTTATCTTGACCGCTGGCTGCTGCGCCTGCAACGCCATCTCGACAACCGCCGTTTCCTGT
GGCAGTTGCCGTGGTATATGGTCATCGGCCCGGCGGGCAGTGGCAAAACGACACTGCTGCGCGAGGGGTTTCCGTCCGAC
ATTATTTATGCCCCGGAGGGCGCACGCGGCGCAGAACAACGCCTGTACCTCACGCCCCATGTCGGTAAACAGGCGGTGAT
CTTTGATATCGACGGCACACTCTGCGCTCCCGCTGATGCGGATATCCTGCATCGCCGTCTGTGGGAACATGCCCTCGGCT
GGCTGAAAGAAAAGCGCGCGCGCCAGCCGCTGAACGGGATTATTCTGACACTCGATTTACCCGATCTACTGACCGCAGAC
AAACGCCGCCGCGAGCATCTGTTACAGACGCTGCGCAGCCGCTTACAGGATATACGCCAGCATCTTCACTGCCAGTTACC
GGTTTACGTGGTACTGACCCGGCTTGATTTATTACAGGGTTTTGCCGCCCTGTTCCAGTCACTCAACAGACAGGATCGCG
ATGCGATTCTGGGCGTCACGTTCACCCGCCGTGCCCATGAAAATGATGACTGGCGAACAGAGTTGAATGCTTTCTGGCAG
ACATGGGTGGATCGAATGAATCTGGCGTTGCCGGATCTGATGGTCGCTCAGACTCACACCCGCGCGTCTTTATTCAGTTT
TTCCCGCCAGATGCAGGGAAGCCGTGAACCGCTGGTGTCACTGCTTGAGGGTCTGCTTGATGGCGAAAATATGAACGTGA
TGCTGCGTGGTGTCTATCTCACCTCTTCGCTTCAGCGTGGACAGATGGATGATATATTCACCCAGTCTGCCGCCCGCCAG
TACCGGCTGGGCAATAACCCACTGGCGTCCTGGCCCCTGGTGGACACCGCGCCTTATTTCACCCGCAGCCTGTTCCCGCA
GGCATTACTCGCAGAGCCTAATCTGGCAACAGAGAGCCGCGCCTGGCTGATACGTTCCCGTCGCCGCCTGACGGTTTTCT
CTGCCACAGGCGGCGTGGCAGCACTGCTGCTCATCACCGGCTGGCATCACTATTACAACGGTAACTATCAGTCCGGCATC
ACCGTGCTTAAGCAGGCCAAAGCCTTTATGGACGTGCCGCCTCCGCAGGGGGAAGATGACTTTGGCAACCTGCAACTGCC
GCTCCTGAACCCGGTACGCGATGCCACACTGGCCTATGGCGACTGGGGCGACCGCAGCCGTCTGGCCGATATGGGACTGT
ACCAGGGACGACGTATCGGGCCTTATGTGGAACAGACCTATCTGCAACTGCTGGAGCAACGTTACCTGCCCTCGCTGTTT
AACGGGCTGGTCAAAGCGATGAACGCCGCGCCGCCGGAGAGTGAAGAAAAACTCGCGGTGCTGCGCGTGATGCGAATGCT
GGAGGACAAAAGCGGACGTAACAATCAGGTGGTGAAGCAGTATATGGCAAAACGCTGGAGCGAAAAATTCCACGGCCAGC
GCGATATCCAGGCACAACTGATGTCCCATCTTGACTACGCGCTGGCTCATACTGACTGGCATGCAGAGCGTCAGGCGGGC
GACGGTGACGCCATCAGCCGCTGGACGCCATATGACAAGCCCGTGGTATCAGCACAGAAAGAACTGAGCAAACTGCCTGT
CTACCAGCGGGTTTACCAGAGTCTGAAAACGCGGGCGCTGGGCGTTCTTCCTGCCGACCTCAATCTGCGTGACCAGGTAG
GGCCAACCTTTGACCAGGTGTTTACATCTGCCGATGACAACAAACTGGTTGTTCCACAGTTTCTTACCCGTTACGGCCTG
CAAAGCTATTTTGTAAAACAGCGCGATGAACTGGTTGAACTGACGGCGATGGATTCCTGGGTACTTAACCTCACCCGCAA
TGTGAAATACAGTGACGCCGACCGCGCGGAAATCCAGCGCCAGTTGACCGAGCAGTATATCAGCGACTACACCGCCACCT
GGCGGGCCGGGATGGACAATCTGAATATCCGCAATTTTGAGTCCATCGGACAACTGACCGGGGCGCTGGAGCAGGTTATC
AGCGGCGACCAGCCTTTGCAGCGGGCGCTGACCGTGCTGCGTGACAACACACAGCCAGGCGTCTTTTCTGAAAAACTCTC
TGCCAAAGAACGGGAGGAAGCCCTGGCAGAGCCGGATTACCAGTTACTCACCCGCCTCGGGCATGAATTCGCCCCGGAAA
ACAGTACCCTGGCAGTACAGAAAGACAAAGAAAGCACGATGCAGGCCGTGTATCAGCAACTCACCGAGTTGCACCGCTAC
CTGCTGGCAATCCAGAACGCGCCTGTACCAGGGAAATCGGCGCTGAAAGCCGTGCAGTTACGGCTTGATCAGAACAGCAG
CGATCCGATATTCGCCACCCGCCAGATGGCAAAAACGCTGCCTGCTCCGCTCAACCGCTGGGTTGGCAGGCTGGCTGACC
AGGCCTGGCATGTGGTGATGGTGGAGGCTGTTCATTATATGGAAGTGGACTGGCGCGACAGCGTGGTGAAACCGTTTAAC
GAGCAACTGGCAAATAACTATCCGTTTAATCCGCGTTCTGCACAGGATGCCTCACTGGATGCCTTCGAACGCTTCTTTAA
ACCGGATGGCATACTGGATACCTTCTACCAGCAGAACCTGAAGCTGTTTATCGATAATGACCTGAGTCTGGAGGATGGCG
ATAACAACGTCATTATTCGCGAAGATATTATTGCGCAACTGGAAACTGCGCAGAAAATCCGTGACATCTTCTTCAGCAAA
CAGAACGGTCTGGGAACATCCTTTGCCGTGGAAACGGTATCGCTTTCAGGCAATAAACGCCGCAGTGTACTGAACCTTGA
CGGTCAGTTAGTCGATTACAGCCAGGGCCGTAACTATACCGCCCATCTGGTCTGGCCTAACAACATGCGCGAAGGCAACG
AAAGTAAGCTGACGCTCATCGGCACCAGCGGCAACGCGCCGCGCAGTATCAGCTTCAGCGGGCCGTGGGCGCAGTTCCGC
CTGTTCGGGGCCGGACAACTGACCGGAGTACAGGATGGCAACTTTACCGTGCGCTTTAGCGTGGACGGTGGCGCGATGAC
CTACCGTGTGCATACCGACACGGAAGATAACCCGTTCAGCGGTGGGTTGTTCAGCCAGTTTGGTCTGTCAGACACACTGT
ACTGA

Protein sequence :
MFRLPTPRLLSGLKSALRPAMPRFKISAFWLLILAWIFLLVWIWWKGPMWTLYEEQWLKPLANRWLATAAWGIIALVWLT
VRVMKRLQQLEKMQKQQREEAVDPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSD
IIYAPEGARGAEQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTAD
KRRREHLLQTLRSRLQDIRQHLHCQLPVYVVLTRLDLLQGFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQ
TWVDRMNLALPDLMVAQTHTRASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQ
YRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWLIRSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGI
TVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQGRRIGPYVEQTYLQLLEQRYLPSLF
NGLVKAMNAAPPESEEKLAVLRVMRMLEDKSGRNNQVVKQYMAKRWSEKFHGQRDIQAQLMSHLDYALAHTDWHAERQAG
DGDAISRWTPYDKPVVSAQKELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGL
QSYFVKQRDELVELTAMDSWVLNLTRNVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESIGQLTGALEQVI
SGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRY
LLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFN
EQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSK
QNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFR
LFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 78
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 78
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 57