Gene Information

Name : XF2409 (XF2409)
Accession : NP_299688.2
Strain : Xylella fastidiosa 9a5c
Genome accession: NC_002488
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1112
EC number : -
Position : 2285562 - 2289128 bp
Length : 3567 bp
Strand : +
Note : similar to SP|P39369 (percent identity: 47 %/query alignment coverage: 96.8 %/subject alignment coverage: 99.4 %); identified by sequence similarity; ORF located using Glimmer/RBSfinder/Start codon shift: 2523

DNA sequence :
ATGCAAGCACGTTCTTCTGCTTACGCACGCTATTGGCGCCATTCACTGGCTGACAGTGGAACAGACCGTGGTGTACTTCG
TGATGCTGCTGACACTTCATTCTTGGAGGTGTCCAAGCAGCAATTAGAAAGCGGCTGTGTGGACCGCAACATTGTCGCGG
CCTGTTTCAATAAAGAAGGCGAACAGGTCCAAACGGTCGAAGTGGTGATCCGCCTCAAGGTCTATGTGCTGCTGCTACAA
CACGGCACGCCGCAGCGCGCGGGAAACCCCAAGATCGTCACGCCGCTGGTGACACAGGCGTTATTGGGGCGGGATGGCAG
GATCTATCCGTTGGCGAAGACGATGATCTGTCGTGACATTTTGGAGCCTTTGGCAGGGGGACATTTCGCGATTGGCACGG
TGGCGGATCAGGATGCGTTTCTGACAAGGCAGCCAGTGCCTGGCATTGCTTTTACGGGGACAGAAAATAATGGGGACTCA
TTGGAGGAGTCCAACAATGCCGACTTTGATCGGCAATGGAAGGAGTTTCTAGCGGGTTGTGATCGGCTATTAGCGGAGGT
CGGTGGTGCCTGGCCCGGCGCTCAGGGGGATTATCTGGAGGCCGACTATGGGTATCTCACCAAGAAAGCAGCGTTCTCCA
GCCGACATATCCTGGCTTTATACGACCATCTTTTGCTCGAAACAACGCCACAGGTGCCGTTGTTTGAGCGCTATGCCAGC
GGTGAGATGGCGCCGCCGGAACCGTGTTTGCCGCCGCATGCCGGGTTCTCACAACGGCTGGCGCATGGCAACAATGAGCA
TCCTTTGACCCGGACGCAACGCGATGCGCTAACGCATCTGCTGGTAGCTCGACAAGGGGAAATTCTTGCGGTGAATGGCC
CGCCGGGCACTGGAAAAACCACGCTGGTGTTATCGGTGGTGGCGTCGTTATGGGCGCATGCGGCGCTGGCAGGGGGGGAG
CCGCCGGTCATTGTCGCTGCGTCGACCACGAATCAGGCGGTGACCAATATCATTGATGCGTTTGGGAAGAATTTTGCAAA
GGGAGAAGGTCCTTTTGCGGGGCGTTGGCTACCCAAAATTAATAGTTTCGGCGCGTTATTAGGCTCGGACGACAAAAAGA
AAAAGACGGCAGACAAATACCACACCGAGGATTTTTTTAATGCGGTGGAATCAGCCGGCTACATCGCTGAGGCAGAACAG
CATTATTTGCGTGCGGCGGCGGCGGCATTTCCAGAGATCCCGCAGGCGTCGTTCAATGTCAAGAGTGTCACCGCGGCATT
GCTGGGGAAGATTCGGAAGGAGGCGGCAACACTGGCCGACATTGAGCAGGCGTGGTCAGGCTTGAATGATGCGCGTGATG
CGCTGCGGGCCGAGTTGGGGGAGAGGCCCGCTACGACGATGACGCAACGCCGCACCCAACGGGACGCAGCGCAGGCGGAG
AAGCAGGTCATTGAAACGCTCATCACCCAGTGGGAGCGCTATGCTGCGGACGAGTCGCTGGTGTATGGGTTGTTTTCTTG
GCTGCCTGCGGTGGCCAAAAAGCGGATGCGGTTGATCCGTGTGTTTTTGAACTCCATCTGGCCAGTGGATTACCCAACAC
AGACGTGGGAGAGCATTGAGCAGGTTGATGGGTGGCTCAGCGAGATACAGCGCCAATGTGATCACAGGCTTCAGGAGCAC
TCCTGCGCTGTGACACGCGGTGAAGCGGTGCTGGATGCGGAGCAGTGCCACATCACCACATGGCAGAAGGCGTTAGCACC
TCTAGAGCTGACGGAGGCGGAGGCGGCAGGCCAGGTGTCTCTTGCGGACTGTGATGCACACGCCGATACATCCATCCGCT
TTCGGATCTTTCTGCTAACCACGCACTATTGGGAGGGGCGTTGGTTGTCGGAGATGCAGGCCCAGTTGCCAGGGGATCTG
GATGCGGAGAAGAAAAAGACGGGGCGTACCGCCGTGGAACCGCGCTGGCGGCGACGGATGAAATTAACGCCGTGTATTGT
GTCGACGTTTTTCATGCTGCCCAAAAAGATGCAGGTTTTCAGGAGTGATGAGAAAAGCTTCAGCCCTCACTACCTCTATG
ACTTTGCGGATTTGCTGATTGTTGACGAGGCGGGCCAGGTGCTTCCTGAAGTAGCTGGTGCGTCATTTGCACTGGGCAAG
CAAGCTTTGGTGATTGGGGACCAGCTGCAGATTGAACCGATTTGGTCGATTCCGAAAAGCATTGATATTGGTAACCTGCA
GGCTGCTGAGCTGCTGGGGAAGGAGGAGGATGCCTACGAGCGGGTTTGTGAGTCTGGAAGATCCGCGGCGTCAGGCAGAG
TGATGCAGATCGCACAACGGCTGAGCCGGTATCACTATGACCCTGCGATGGCACGCGGCATGTTTCTGCATGAGCACCAT
CGCTGCTTTGATGAGATTGTCAGTTACTGCAATGCGCTGTGCTATCAAGGCAAACTCATCCCCCAACGCGGACCCAAGGT
GGCAGCGCTGAAGAATGCAGAAGGAAAGGAAACAGGAGACAGTTTGCCGGCGCTGGGATATCTGCACGTCGATGGTCTCT
GCCAGAAAAGTAGCGGCGGCAGTCGATATAACCTCTATGAGGCGGAAACCATTGCGGCGTGGCTTGCCGAGCACCACGAG
TCGCTGCAAGCACAGTATGGGGAGCCGCTGCATCGCATTGTGGGCATCCTCACGCCGTTTGAGACGCAGGCGCGTGCAAT
TTCACAAGCCTGCCGTGACGTGGGAATTGAGGTGGGCCACGAAAAGGATGGGATCACGGTTGGGACGGTGCATGCGCTGC
AAGGGGCGGAACGGCCTGTGGTGATCTTTTCTGCGGTGTATAGCAAGCATGCCGATGGCGGTTTTATCGACCAGCGCGCA
AGCATGTTGAATGTGGCGGTATCGCGTGCCAAAAACACTTTTTTAGTGTTTGGGGATATGGATGTCTTCACGGCGGCACC
AAAGAGCAGGCCGCGTGGCCTGTTGGCGCATTATCTTTTCAAAGATCCAAGCAACGCGCTGTGTTTTCAACCGTTAGTCC
GCAAAGATCTGCAACAGGTCAGTACAGCGGTGGAGGTACTGCAGGACGCCGCCGAACATGACGCTTTTTTATTGCAGGCG
CTCAACAAGGTGCAGCGTGAAATCCATATTGTGAGTCCCTGGATCAACAAAGATCGCATTCAAGACATCGGTGCCTTCAA
AGCAATGCAGGAGGCGGTGGAGCGTCAGGTGCAGGTCACGGTCTACACGGATCAGGATTTGAACACCGACGATAAGAAAG
ACACAAAGAAGATCGCTGAAGTGCTGCAAGCCGCCAGGGCGTTGCGCGGTGTGGGGATTGAGGTGCATTTTGTCGACAGA
GTCCACAGCAAAATGGTGATTGGGGACGATGAGGTGTTTTGTGTTGGGTCGTTCAATTGGTTCAGTGCCAACAGAAGCGC
AATGTACGCCAAGCATGAAACGTCATTGGTTTATCGTGGTCGCGGGCTTGCAGATGAGAGGCAAACCAGGCTGAACAGTT
TACGGCAGCGAATCACGGATGATCCTCAGCAAGTCGGCCAGGCGTGA

Protein sequence :
MQARSSAYARYWRHSLADSGTDRGVLRDAADTSFLEVSKQQLESGCVDRNIVAACFNKEGEQVQTVEVVIRLKVYVLLLQ
HGTPQRAGNPKIVTPLVTQALLGRDGRIYPLAKTMICRDILEPLAGGHFAIGTVADQDAFLTRQPVPGIAFTGTENNGDS
LEESNNADFDRQWKEFLAGCDRLLAEVGGAWPGAQGDYLEADYGYLTKKAAFSSRHILALYDHLLLETTPQVPLFERYAS
GEMAPPEPCLPPHAGFSQRLAHGNNEHPLTRTQRDALTHLLVARQGEILAVNGPPGTGKTTLVLSVVASLWAHAALAGGE
PPVIVAASTTNQAVTNIIDAFGKNFAKGEGPFAGRWLPKINSFGALLGSDDKKKKTADKYHTEDFFNAVESAGYIAEAEQ
HYLRAAAAAFPEIPQASFNVKSVTAALLGKIRKEAATLADIEQAWSGLNDARDALRAELGERPATTMTQRRTQRDAAQAE
KQVIETLITQWERYAADESLVYGLFSWLPAVAKKRMRLIRVFLNSIWPVDYPTQTWESIEQVDGWLSEIQRQCDHRLQEH
SCAVTRGEAVLDAEQCHITTWQKALAPLELTEAEAAGQVSLADCDAHADTSIRFRIFLLTTHYWEGRWLSEMQAQLPGDL
DAEKKKTGRTAVEPRWRRRMKLTPCIVSTFFMLPKKMQVFRSDEKSFSPHYLYDFADLLIVDEAGQVLPEVAGASFALGK
QALVIGDQLQIEPIWSIPKSIDIGNLQAAELLGKEEDAYERVCESGRSAASGRVMQIAQRLSRYHYDPAMARGMFLHEHH
RCFDEIVSYCNALCYQGKLIPQRGPKVAALKNAEGKETGDSLPALGYLHVDGLCQKSSGGSRYNLYEAETIAAWLAEHHE
SLQAQYGEPLHRIVGILTPFETQARAISQACRDVGIEVGHEKDGITVGTVHALQGAERPVVIFSAVYSKHADGGFIDQRA
SMLNVAVSRAKNTFLVFGDMDVFTAAPKSRPRGLLAHYLFKDPSNALCFQPLVRKDLQQVSTAVEVLQDAAEHDAFLLQA
LNKVQREIHIVSPWINKDRIQDIGAFKAMQEAVERQVQVTVYTDQDLNTDDKKDTKKIAEVLQAARALRGVGIEVHFVDR
VHSKMVIGDDEVFCVGSFNWFSANRSAMYAKHETSLVYRGRGLADERQTRLNSLRQRITDDPQQVGQA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 47
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 47
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 46
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 46
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
XF2409 NP_299688.2 hypothetical protein VFG0627 Protein 0.0 46
XF2409 NP_299688.2 hypothetical protein VFG1537 Protein 0.0 45