Gene Information

Name : CAP2UW1_4715 (CAP2UW1_4715)
Accession : YP_003165285.1
Strain :
Genome accession: NC_013193
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG4644
EC number : -
Position : 145042 - 148008 bp
Length : 2967 bp
Strand : -
Note : -

DNA sequence :
ATGCCTCGTCGTTCCATCCTGTCCGCCGCCGAGCGGGAAAGCCTGCTGGCGTTGCCGGACACCAAGGACGATTTGATCCG
GCATTACTCGCTCAGTGATAGTGACCTTTCGATCATCCGGCAGCGGCGCGGGCCTGCGAACCGGTTGGGCTTCGCGGTGC
AGCTTTGCTACCTGCGCTTTCCCGGCATTCTCCTTGGCGTCGATCAGCCGCTGTTCCTGCCCTTGCTGAAACTGGTCGCC
GACCAGCTCAAGGTCGGTGTTGAAAGCTGGAACGACTACGGGCAGCGGGAACAGACCCGGCGCGAGCACCTGGTCGAACT
GCAAACGGTGTTCGGCTTCCAGCCGTTCACCATGAGCCATTACCGACAGGCCGTCCACACGCTGACCGAGCTGGCCATGC
AGACCGACAAAGGCATCGTGCTGGCCAGCGCCTTGATTGAGCAACTGCGGCGGCAGTCGATCATCCTGCCCGCGCTCAAC
GCCATCGAGCGCGCCAGTGCTGAGGCCATCACTCGCGCCAACCGACGCATCTATGAAGCCCTGTCCGAGCCGCTGTCGAA
CGGACACCGGCATCGCCTCGACGATCTGCTGAAACGCCGCGACAACGGCAAGATGACCTGGCTGGCCTGGCTGCGTCAGT
CGCCTGTCAAACCGAACTCCCGCCACATGCTCGAACACATTGAGCGCCTCAAAGCCTGGCAGGCCCTCGACCTGCCTTCT
GGCATCGAGCGGCTGGTTCACCAGAACCGGCTGCTCAAAATCGCCCGCGAGGGCGGCCAGATGACGCCCGCCGACCTGGC
CAAGTTCGAGCCGCGGCGGCGCTACGCCACCCTCGTGGCGCTGGCCATCGAAGGCATGGCCACCGTCACCGACGAAATCA
TCGACTTGCACGACCGCATCCTGGGGAAGCTGTTCAACGCCGCCAAGAACAAGCATCAGCAGCAGTTCCAGGCTTCTGGT
AAAGCCATCAACGCCAAGGTGCGACTGTACGGGCGCATCGGTCAGGCGCTGATCGACGCCAAACAGTCAGGCCGCGACCC
GTTCGCGGCCATTGAGGCCGTCATGTCCTGGGACGCTTTCGCCGAGAGCGTCACCGAGGCGCAAAAGCTCGCGCAGCCCG
ATGACTTCGACTTCCTGCATCGCATCGGCGAGAGTTACGCCACCTTGCGCCGCTACGCGCCGGAATTCCTTGCCGTGCTC
AAGCTGCGGGCCGCGCCCGCCGCCAAGGGTGTGCTTGATGCCATCGAGGTGCTGCGCGGTATGAACACCGACAACGCCCG
CAAGGTGCCAACCGATGCCCCGACCGACTTCATCAAGCCGCGCTGGCAGAAACTGGTGATGACCGACGCTGGTATCGACA
GACGCTACTACGAACTGTGCGCCCTGTCGGAACTCAAGAACTCCCTTCGCTCGGGCGACATTTGGGTGCAGGGTTCGCGC
CAGTTCAAGGACTTCGAGGACTACCTGGTGCCGCCCGATAAGTTCACCAGCCTCAAGCAGTCCAGCACATTGCCGCTGGC
CGTTGTCACCGACTGCGACCAATATCTGCACGACCGGCTGACCCTACTGGAAACGCAGCTTGCCACCGTCAACCGCATGG
CGGCGGCCAACGATCTGCCCGATGCCATCATCACCGAGTCCGGCTTGAAAATCACGCCGCTGGATGCGGCGGTGCCCGAC
ACCGCGCAGGCGCTGATCGACCAGACGGCCATGATCCTGCCACACGTCAAGATCACCGAATTGCTGCTCGAAGTCGATGA
GTGGACGGGCTTCACCCGCCACTTCACGCACCTGAAATCGGGCGATCTGGCCAAGGACAAAAACTTGCTGCTAACGACGA
TCCTGGCCGACGCAATCAACCTGGGCCTGACGAAGATGGCTGAGTCCTGTCCCGGCACGACCTACGCCAAGCTCGCCTGG
TTGCAAGCCTGGCATACCCGCGACGAAACCTATTCGACGGCGCTGGCTGAATTGGTGAACGCCCAATTCCGGCATCCCTT
CGCCGAGCATTGGGGCGACGGCACCACGTCATCATCGGACGGCCAGAACTTCCGCACCGGCAGCAAGGCCGAGAGTACCG
GCCACATCAACCCGAAATATGGCAGCAGCCCAGGACGGACGTTCTACACCCACATCTCCGACCAGTACGCGCCGTTCCAC
ACCAAGGTGGTAAATGTCGGCGTGCGCGACTCCACCTATGTGCTCGATGGCCTGCTGTACCACGAGTCCGACCTGCGCAT
CGAAGAACACTACACCGACACGGCGGGCTTCACCGATCATGTCTTTGCTCTGATGCACCTCTTGGGCTTCCGCTTCGCGC
CGCGCATCCGCGATTTGGGAGACACCAAGCTCTACATTTCCAAGGGCGAAACCGCCTATGACGCGCTCAAGCCGATGATC
GGCGGCACGCTCAACATCAAGCATGTCCGCGCCCATTGGGACGAAATTTTGCGGCTGGCCACCTCGATCAAGCAGGGCAC
GGTGACGGCCTCGCTGATGCTCAGGAAGCTCGGCAGTTACCCGCGCCAGAACGGCCTGGCCGTCGCCCTGCGCGAGTTGG
GCCGCATCGAGCGCACGCTGTTCATCCTCGACTGGCTGCAAAGCGTCGAGCTGCGCCGCCGTGTGCATGCCGGGTTGAAC
AAGGGCGAAGCCCGCAACGCACTGGCCCGGGCCGTGTTCTTCAACCGCTTGGGCGAAATCCGCGACCGCAGTTTCGAGCA
GCAGCGCTATCGGGCCAGCGGCCTCAACCTGGTGACGGCGGCCGTTGTGCTGTGGAACACGGTCTATCTGGAGCGCGCAG
CGCAAGCGTTACGTGGCAATGGCCATACTGTCGATAACGCGTTGTTGCAGTACCTGTCGCCGCTCGGCTGGGAACACATC
AACCTGACCGGCGACTACCTCTGGCGCAGCAGCGCCAAGATCGGCGCGGGCAAGTTCAGGCCGCTACGACCATTGCAACC
GGCTTAG

Protein sequence :
MPRRSILSAAERESLLALPDTKDDLIRHYSLSDSDLSIIRQRRGPANRLGFAVQLCYLRFPGILLGVDQPLFLPLLKLVA
DQLKVGVESWNDYGQREQTRREHLVELQTVFGFQPFTMSHYRQAVHTLTELAMQTDKGIVLASALIEQLRRQSIILPALN
AIERASAEAITRANRRIYEALSEPLSNGHRHRLDDLLKRRDNGKMTWLAWLRQSPVKPNSRHMLEHIERLKAWQALDLPS
GIERLVHQNRLLKIAREGGQMTPADLAKFEPRRRYATLVALAIEGMATVTDEIIDLHDRILGKLFNAAKNKHQQQFQASG
KAINAKVRLYGRIGQALIDAKQSGRDPFAAIEAVMSWDAFAESVTEAQKLAQPDDFDFLHRIGESYATLRRYAPEFLAVL
KLRAAPAAKGVLDAIEVLRGMNTDNARKVPTDAPTDFIKPRWQKLVMTDAGIDRRYYELCALSELKNSLRSGDIWVQGSR
QFKDFEDYLVPPDKFTSLKQSSTLPLAVVTDCDQYLHDRLTLLETQLATVNRMAAANDLPDAIITESGLKITPLDAAVPD
TAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFTHLKSGDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAW
LQAWHTRDETYSTALAELVNAQFRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKYGSSPGRTFYTHISDQYAPFH
TKVVNVGVRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLYISKGETAYDALKPMI
GGTLNIKHVRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLN
KGEARNALARAVFFNRLGEIRDRSFEQQRYRASGLNLVTAAVVLWNTVYLERAAQALRGNGHTVDNALLQYLSPLGWEHI
NLTGDYLWRSSAKIGAGKFRPLRPLQPA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 95
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 93
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 88
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 73
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 73

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
CAP2UW1_4715 YP_003165285.1 hypothetical protein VFG1031 Protein 0.0 93