Gene Information

Name : XfasM23_1514 (XfasM23_1514)
Accession : YP_001830198.1
Strain : Xylella fastidiosa M23
Genome accession: NC_010577
Putative virulence/resistance : Virulence
Product : phospholipase D/transphosphatidylase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1112
EC number : -
Position : 1675754 - 1679326 bp
Length : 3573 bp
Strand : +
Note : PFAM: phospholipase D/Transphosphatidylase; KEGG: ecv:APECO1_3532 superfamily I DNA helicase

DNA sequence :
ATGCAAGCACGCTCTTCTGCTTACGCGCGCTATTGGCGCCATTCACTGGCTGACAGTGGAACAGACCGTGGTGTACTTCG
TGATGCTGCTGACAGTTCATTCTTGGAGGTGTCCAAGCAGCAATTAGAAAGCGGCTGTGTGGACCGCGACATTGTCGCGG
CCTGTTTCAATAAAGAAGCTGAACAGGTCCAAACGGTCGAAGTGGTGATCCGCCTCAAGGTCTATGTGCTGCTGCTGCAA
CACGGCACGCCGCAGCGCGCGGGACACCCCAAGATCGTCACGCCGCTGGTGACACAGGCGTTATTGGCGCGGGATGGCAG
GATCTATCCGTTGGCGCGGACGATGATCCCTCGTGACATTCTGAAGCCTTTGGTAGGGGGACATTTCGCCATTGGCACGG
TGTCGGATCAGGATGCGTTTCTGACAAGGCAGCCGGTGCCTGGCATTGCTTTTACGGGGACGGAAAATAATGCGGACTCA
TTGGGGGATTCCAACAATGCCGACTTTGATCGGCAATGGAAGCAATTTTTAGCGGGCTGTGATCGGCTATTAGCGGAGGT
CGGTGGTGGCTGGCCTGGCGCTCAGGGGGATTATCTGGAGGCCGACTATGGGTATCTCACCAAGAAAGCAGCGTTCTCCA
GCCGACATATCCTGGCTTTATACGACCATCTTTTGCTCGAAACAACGCCACAGGTGCCGTTGTTTGAGCGCTATGCCAGC
GGTGAGATGGCGCCGCCTGAACTATGTTTGCCGCCGCATGCCGGGTTCTCACAACGGCTGGCGCATGGCAGCGATAAGCG
TGCTTTGAGCCGGACGCAACGCGATGCGCTAACGCATCTGCTGGTGGCTCGACAAGGGGAAATTCTTGCGGTGAATGGCC
CGCCGGGCACTGGAAAAACCACGCTGGTGTTATCGGTGGTGGCGTCGTTATGGGCGCATGCGGCGCTGGCAGGGGGGGAG
CCGCCGGTCATTGTCGCTGCGTCGACCACGAATCAGGCGGTGACCAATATCATTGATGCGTTTGAGAAGAATTTTGCAAA
GGGAGAGGGTCCTTTTGCGGGGCGTTGGCTGCCCAAGATCAAGAGTTTTGGTGCTTTATTAGGCTCAGCCGACAAAGAGA
GCAAAACAGCAGACAAGTATCACACCGAAGATTTTTTTAATGCGGTGGAATCAGTCGGCTACATCGCTGAGGCAGAGCAG
CATTATTTGCGTGCGGCGGCGGCGGCATTTCCAGAGATCCCGCAGGCGTCGTTCAATGTCAAGGGTGTCACCGCGGCATT
GCAGGAGGCGATTCGGAAGGAGGCGGCAACACTGGCCGCCATTGAGCAGGCGTGGTCAGGCTTGAATGATGCGCGTGATG
CGCTGCGGGCCGAGTTGGGGGAGACGCCCGCGACGACGATGACGCAACGCCGCACCCAACGGGACGCAGCGCAGGCGGAG
AAGCAGGTCATTGAAACGCTCATCACGCAGTGGGAGCGCTATGGTGCGGACGAGTCGCTGGTGTATGGGTTGTTTTCTTG
GCTGCCTGCGGTGGCCAAAAAGCGGATGCGGTTGGTCCGGGTGTTTTTGAACTCCATCTGGCCAGTGCATTACCCAACAC
AGACGTGGGAGAGCATTGAGCAGGTTGATGGGTGGCTCAGTGGGATGCAGCGCCAATGTGATGACAGGCTTCAGGAGCAC
TCCTGGGCTGTGACACGCGGTGAAAAGGTGCTGCATGAGGAGCAGTGCCACATCACCACATGGCAGGCCGCGTTAGCAAA
GGAGTCAGCACTTACAGGGCTGATGGAGGCGGCAGGCCAGGTCTCTCTTGCGGACTGTGATGCACTCGCCGATACATCCA
TCCGCTTTCGGATCTTTCTGCTAACCACGCACTATTGGGAGGGGCGTTGGTTGTCGGAGATGCAGGCCCAGTTGCCAGCG
GATCTGGATGGGGAGAAGAAAAAGACGGGGGGTACCGCCGTGGAACCGCGCTGGCGGCGACGGATGAAATTAACGCCGTG
TATTGTGTCGACGCTCTTCATGCTGCCCAAAAAGATGCAGGTTGCCAGGCGTGATGAGAAAAACTTCAGCCGTAACTACC
TGTATGACTTTGCGGATTTGCTGATTGTTGACGAGGCGGGCCAGGTGCTTCCTGAAGTAGCTGGTGCATCATTTGCACTG
GGCAAGCAAGCTTTGGTGATTGGGGACCAGCTGCAGATTGAACCGATTTGGTCGATTCCGGAAAGCATTGATATTGGTAA
CCTGCATGCTGCTGAGCTGCTGGGGAAGGAGGAGGATGCCTACGAGCGGCTTTGTCAGTCTGGAAGATCCGCAGCATCAG
GCAGCGTGATGCAGATCGCACAACGACTGAGCCGGTATCACTATGACCCTGAGATGGCACGCGGCATGTTTCTGTACGAG
CACCATCGCTGCTTTGATGAGATTGTCAGTTACTGCAATGCGCTGTGCTATCAAGGCAAACTCATCCCCAAGCGCGGACC
CAAGGTGGCAGCGCTGAAGAATGCAGCAGGAAAGGAAACAGGAGATGGTTTGCCGGCGCTGGGGTATCTGCACGTCGATG
GTCTCTGCCAGAAAAGTAGCGGCGGCAGCCGCCATAACCTCTATGAGGCACAGACCATTGCGGCGTGGCTTGCCGAGCAC
CGCGAGTCGTTGCAAGCACAGTATGGGAAGCCGCTGCATTGCATTGTGGGCATCGTGACGCCGTTTGGGGCGCAGGTACG
TGCCATTTCACAAGCCTGCCGTGACGTGGGAATTGAGGTGGGCCACGAAAAGGATGGGATCACGGTTGGGACGGTGCATG
CGCTGCAAGGGGCGGAACGGCCTGTGGTGATCTTTTCTGCGGTGTATAGCAAGCATGCCGATGGCGGTTTTATCGACCAG
CGTACAAGCATGTTGAATGTGGCGGTGTCGCGTGCCAAAAACACTTTTTTAGTGTTTGGGGATATGGATGTCTTCACGGC
GGCACCAAAGAGCAGGCCGCGTGGCCTGTTGGCGCATTATCTTTTTAAAGATGCAAGCAACGCGCTGTGTTTTCAACCGT
TAGTCCGCAAAGATCTGCAACAGGTCAGTACAGCGGTGGAGGTACTGCAGGACGCCGCCGAACATGACGCTTTTTTATTG
AAGGCGCTCAACAAGGTGCAGCGTGAAATTCATATTGTGAGTCCCTGGATCAACAAAGATCGCATTCAAGACATCGGTGC
CTTCAAGGCAATGCAGGAGGCGGTGAAGCGTCAGGTGCAGGTCACGGTCTACACGGATCAGGATTTGAACACCGACGACA
AGAAAGACATAAAGAAGATCACTAAAGTGCTGCAAGCCGCCAGGGCGTTGCGCGGTGTGGGGATTGAGGTGAATTTTGTC
GACAGAGTCCACAGCAAAATGGTGATTGGGGACGATGAGGTGTTTTGTGTTGGGTCGTTCAATTGGTTCAGTGCCAACAG
AAGCGCAATGTACGCCAAGCATGAAACGTCATTGGTTTATCGTGGTCGCGGGCTTGCAGATGAGAGGCAAACCAGGCTGA
ACAGTTTACGGCAGCGAATCACGGATGATCCTCAGCAAATCGGCCAGGCATGA

Protein sequence :
MQARSSAYARYWRHSLADSGTDRGVLRDAADSSFLEVSKQQLESGCVDRDIVAACFNKEAEQVQTVEVVIRLKVYVLLLQ
HGTPQRAGHPKIVTPLVTQALLARDGRIYPLARTMIPRDILKPLVGGHFAIGTVSDQDAFLTRQPVPGIAFTGTENNADS
LGDSNNADFDRQWKQFLAGCDRLLAEVGGGWPGAQGDYLEADYGYLTKKAAFSSRHILALYDHLLLETTPQVPLFERYAS
GEMAPPELCLPPHAGFSQRLAHGSDKRALSRTQRDALTHLLVARQGEILAVNGPPGTGKTTLVLSVVASLWAHAALAGGE
PPVIVAASTTNQAVTNIIDAFEKNFAKGEGPFAGRWLPKIKSFGALLGSADKESKTADKYHTEDFFNAVESVGYIAEAEQ
HYLRAAAAAFPEIPQASFNVKGVTAALQEAIRKEAATLAAIEQAWSGLNDARDALRAELGETPATTMTQRRTQRDAAQAE
KQVIETLITQWERYGADESLVYGLFSWLPAVAKKRMRLVRVFLNSIWPVHYPTQTWESIEQVDGWLSGMQRQCDDRLQEH
SWAVTRGEKVLHEEQCHITTWQAALAKESALTGLMEAAGQVSLADCDALADTSIRFRIFLLTTHYWEGRWLSEMQAQLPA
DLDGEKKKTGGTAVEPRWRRRMKLTPCIVSTLFMLPKKMQVARRDEKNFSRNYLYDFADLLIVDEAGQVLPEVAGASFAL
GKQALVIGDQLQIEPIWSIPESIDIGNLHAAELLGKEEDAYERLCQSGRSAASGSVMQIAQRLSRYHYDPEMARGMFLYE
HHRCFDEIVSYCNALCYQGKLIPKRGPKVAALKNAAGKETGDGLPALGYLHVDGLCQKSSGGSRHNLYEAQTIAAWLAEH
RESLQAQYGKPLHCIVGIVTPFGAQVRAISQACRDVGIEVGHEKDGITVGTVHALQGAERPVVIFSAVYSKHADGGFIDQ
RTSMLNVAVSRAKNTFLVFGDMDVFTAAPKSRPRGLLAHYLFKDASNALCFQPLVRKDLQQVSTAVEVLQDAAEHDAFLL
KALNKVQREIHIVSPWINKDRIQDIGAFKAMQEAVKRQVQVTVYTDQDLNTDDKKDIKKITKVLQAARALRGVGIEVNFV
DRVHSKMVIGDDEVFCVGSFNWFSANRSAMYAKHETSLVYRGRGLADERQTRLNSLRQRITDDPQQIGQA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 47
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 47
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 47
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 47
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
XfasM23_1514 YP_001830198.1 phospholipase D/transphosphatidylase VFG0627 Protein 0.0 47
XfasM23_1514 YP_001830198.1 phospholipase D/transphosphatidylase VFG1537 Protein 0.0 46