Gene Information

Name : XF2739 (XF2739)
Accession : NP_300016.2
Strain : Xylella fastidiosa 9a5c
Genome accession: NC_002488
Putative virulence/resistance : Unknown
Product : type I restriction-modification system endonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 2630677 - 2633811 bp
Length : 3135 bp
Strand : -
Note : similar to GI|1075407 (percent identity: 42 %/query alignment coverage: 99.9 %/subject alignment coverage: 100.2 %); identified by sequence similarity; ORF located using Glimmer/RBSfinder

DNA sequence :
ATGGCATTTTTATCAGAAGCCACCTTGGAACAAGCACTGCTGGAACAGCTGCGTGCCCTGGGCTACAGCATCGAGCGTGA
AGAAAACATCGGCCCTGATGGCTATCGTCCGGAACGCGCCAGTCACGACGAGGTTGTCCTCAAACAACGCTTCGTTGATG
CCGTTGCACGTCTCAACCCAGGGCTGCCACCAGAGGCGAGGCAGGATGCAATCCGTAAGGTCACACAGTCTGAGTTGCCA
TCATTACTCGAAGAAAACCGCCGCATCCACAGACTGATCACCGAAGGTGTCGATGTCGAATACTATGCCCAGGACGGTAC
CCTGACAGCGGGCAACGTTGCGCTCATCGACTTCGACCACCCGCAACAGAACGATTGGCTGGCCGTGAACCAGTTCGTGG
TGATTGCTGATCACTACAACCGGCGTCCTGACGTGGTCCTCTTCGTGAACGGCTTGCCATTGGCCGTGATCGAATTGAAG
GCACCAGGAAGTATCAATGCAACCTTGATTGCAGCCTTCAATCAGTTGCAGACCTATAAAGAGCAGATCCCGGCACTGTT
CAACACCAACGCCCTGCTAGTGACTTCTGACGGGATGACGGCCCGCTTCGGCGCGCTGTCGGCCGATTTAGAACGCTTCA
TGCCTTGGCGCACCACCGACGGCACCGACATTGCTCCAAAAGGGGTGCCGGAACTTGCAACACTGATTGAGGGGATATGC
GAGCCGCATCGCCTGCTCGACCTACTGCGCCATTTCACCGTTTTCCGGAAGACGGATGCTGGCTTGGCGAAGATCATCGC
CGGTTACCACCAATTCCACGCCGTACGGCAGGCCATCAACAGCACCGTGGCGGCTTCCTCGCCCCAGGGGAATCAACGGA
TTGGTGTCATCTGGCACACCCAAGGTTCCGGAAAAAGCCTACTGATGGCTTTCTATGCCGGACAACTGGTCAAACACCCC
GCAATGGCCAACCCAACGCTTGTTGTACTCACAGACCGTAACGATCTTGACGATCAGTTATTCACCACATTCTCACAGTG
CAGTGATTTAATCCGGCAAACCCCGGTACAGGCCCAGAGCCGCGACCAGGTGCGGAAACTCTTGAACCGTGCATCCGGTG
GAGTGATTTTTACCACCTTGCAGAAATTCGGTGACATCACAGAACCACTGACCACACGACGTAATGTCGTCGTCATCGCC
GATGAAGCACACCGCAGCCAATATGGCTTCAAAGCCAAAGTGGACACCAAAACCGGCAAAATCTCCTATGGCTTCGCCAA
GTACATGCGTGATGCCCTGCCCAACGCATCTTGCATCGGTTTTACCGGCACACCGATTGAGGCCGACGACGTCAATACCC
CAGCCGTATTCGGCAACTACATCGACATCTACGACATCAGCCGTGCGGTCGAAGATGGCGCGACCGTGCCGATCTACTAC
GAATCGCGGTTGGCACGTATTGCACTGGATGAAGCCGAGAAACCGCAGATTGATGCTGAAGTCAACGCACTGACCGAAGC
CGATTCCGAGGCCGAGCAAGAGCGCTTCAAGAAAAAATGGGCAACGGTCGAAACCTTAGTCGGCAGTGATAAACGGCTTG
CTTTAATCGCCAAGGACATCATCACCCATTGTGAAGATCGCCTGGCCGCTCTGGATGGGAAGGCGATGGTGGTCTGTATG
AGCCGTCGTATCTGTGTCGCCCTATACGATCACATCGTGGCACTGCGGCCCGATTGGCACAGCACCGATCACAAGGCCGG
ATCGCTCAAAATCGTCATGACCGGCACGGCCAGTGACCCGCCGCAGTGGCGACAGCATATCGGCAACAAAGCCCGGCGTG
ATCTGCTGGCCGAACGTGCCCGTGACCCCAAAGACCCACTCAAACTAGTGATTGTCCGGGATATGTGGCTCACTGGTTTT
GACGCCCCCTGCATGCACACCATGTACCTTGACAAACCGATGCAAGGGCATGGATTGATGCAGGCCATTGCACGGGTGAA
CCGCGTGTTCCGCGACAAACCCGCCGGATTGATCGTGGACTACATCGGCATCGCACAAAATCTCAGAACGGCCCTGCAAC
AGTATTCGAAGAACGATCAGCAACACACCGGTGTTGATGAAGCACAGGCCATCGCACTCATGATGGAGAAATACCAGATC
GTCCGGGATATGTACCACGGCTACGATTACCACACCGCAATGAGTGGTACCCCACAAGAGCGCCTTGCCATGATGGGTGG
AGCCATTGAGTGGATACTCAATCTTCAGCAGCAACTGGCAGCCATAGCGCAGACCAAAGAAGGCAAAAAAGAGGCCCACC
GCCGCTATCAGGATGCCGTGCTAGCGTTGTCTAAGGCGTTCGCCCTGGCATCGGCCTCCGATGAGGCCCGCCACATCCGC
GAAGACGTCGGCTTCTTCCAGGCGATCCGTGCCGCCCTCGTCAAGAGCGCCGATCATTCCGGGGGCAGCGAACCGCAACG
TGACCTGGCCATCCAGCAGATCGTGAGCCGAGCCGTCGTCTCGACGGAGATCGTCGATATCCTGGCGGCTTCTGGAATCA
ACAGCCCAGATATTTCCATCCTGTCCGACGCATTTCTCGCTGAAATTCAGCAGATGCAACGAAAGCACCTCGCCTTAGAA
GCCTTGCGCAAGCTGTTGAACGACCGCATTCGTTCCCGCAGCACCGTCAACCTCGTGCAAACCAAGGCCTTCTCCGAGCG
CCTGGAAGGTGCTGTGGCGCGCTACCACGCCAACGCGATCACCACTGCCCAGGTGCTCCAGGAACTGATCCAATTGGCCA
AAGACATCCGTGCTGCTCGCCAGCGCGGCGAAGCCTCCGGATTGTCTGATGAAGAAGTTGCCTTCTACGACGCACTGGCC
GAAAACGAAAGCGCTGTACAGATGATGGGTGATCATACCCTCCGGCTGATCGCTCACGAATTACTCATGCGCCTGCGCGA
AAACGTCTCAGTGGATTGGGCCCATCGTGACTCCGCCCGTGCCCGGATGCGTGTCCTAGTGAAGCGCATCCTGCGCAAGT
ACGGCTACCCGCCTGATCTACAGAACACGGCTGTGCAGACCGTCCTCCAGCAGGCCGAAGCCTTCTCGTCCCAGTGGAGT
GCTTCTGGACCCTAA

Protein sequence :
MAFLSEATLEQALLEQLRALGYSIEREENIGPDGYRPERASHDEVVLKQRFVDAVARLNPGLPPEARQDAIRKVTQSELP
SLLEENRRIHRLITEGVDVEYYAQDGTLTAGNVALIDFDHPQQNDWLAVNQFVVIADHYNRRPDVVLFVNGLPLAVIELK
APGSINATLIAAFNQLQTYKEQIPALFNTNALLVTSDGMTARFGALSADLERFMPWRTTDGTDIAPKGVPELATLIEGIC
EPHRLLDLLRHFTVFRKTDAGLAKIIAGYHQFHAVRQAINSTVAASSPQGNQRIGVIWHTQGSGKSLLMAFYAGQLVKHP
AMANPTLVVLTDRNDLDDQLFTTFSQCSDLIRQTPVQAQSRDQVRKLLNRASGGVIFTTLQKFGDITEPLTTRRNVVVIA
DEAHRSQYGFKAKVDTKTGKISYGFAKYMRDALPNASCIGFTGTPIEADDVNTPAVFGNYIDIYDISRAVEDGATVPIYY
ESRLARIALDEAEKPQIDAEVNALTEADSEAEQERFKKKWATVETLVGSDKRLALIAKDIITHCEDRLAALDGKAMVVCM
SRRICVALYDHIVALRPDWHSTDHKAGSLKIVMTGTASDPPQWRQHIGNKARRDLLAERARDPKDPLKLVIVRDMWLTGF
DAPCMHTMYLDKPMQGHGLMQAIARVNRVFRDKPAGLIVDYIGIAQNLRTALQQYSKNDQQHTGVDEAQAIALMMEKYQI
VRDMYHGYDYHTAMSGTPQERLAMMGGAIEWILNLQQQLAAIAQTKEGKKEAHRRYQDAVLALSKAFALASASDEARHIR
EDVGFFQAIRAALVKSADHSGGSEPQRDLAIQQIVSRAVVSTEIVDILAASGINSPDISILSDAFLAEIQQMQRKHLALE
ALRKLLNDRIRSRSTVNLVQTKAFSERLEGAVARYHANAITTAQVLQELIQLAKDIRAARQRGEASGLSDEEVAFYDALA
ENESAVQMMGDHTLRLIAHELLMRLRENVSVDWAHRDSARARMRVLVKRILRKYGYPPDLQNTAVQTVLQQAEAFSSQWS
ASGP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 47
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 47
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 47
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 47