Open Access

Open Access Resources

The data hosted on this page are open access resources freely available to the scientific community to use and analyze for research purposes.

Resistance Gene Discovery Project

This research was funded by the Gordon and Betty Moore Foundation through Grant GBMF4725 from 2015-2019.

The Challenge    

Plants contain a broad repertoire of immune receptors (R genes) that mediate recognition of disease-causing organisms (pathogens). The major class of Rgenes is the NB-LRR (nucleotide-binding, leucine rich repeat) type, and an individual plant contains hundreds of different receptors. Activation of an NB-LRR receptor by a pathogen generally results in resistance to that pathogen, so the complement of NB-LRRs a plant possesses is a major determinant of which diseases it resists or succumbs to. Both pathogens and R genes evolve rapidly. Knowledge of R gene diversity within and across species would provide insight into the evolution of these important genes, improve our understanding of plant resistance, and aid the development of disease-resistant crops. However, the diversity of NB-LRRs within a given species and across related species is still poorly understood. Despite the advances in genome sequencing and explosion of available sequence data, until recently NB-LRR genes have been hard to sequence and annotate for technical reasons, because they are relatively similar to one another and are often found in clusters of highly similar alleles.

The Strategy

Scientists at the Sainsbury Laboratory in Norwich developed a fast, accurate, and cost-efficient technique for isolating and sequencing just the NB-LRR genes from a plant. Using this method, termed Resistance Gene Enhancement Sequencing (RenSeq), our collaborators sequenced resistance genes from hundreds of different plants within three important plant families. The analysis focused on crop plants and their wild relatives in three important plant families: the Solanaceae (tomatoes, potatoes, and their relatives) Triticeae (wheat, barley and related grains) and Brassicaceae (mustard greens, cabbage, turnips, canola and their relatives). The analysis included different individual isolates from a given species that evolved in different places in order to elucidate the diversity within species as well as between species. The results are publicly available in a database (below) to assist others analyzing or breeding for disease resistance in these species.

The Science

Our collaborators used the large dataset to answer important questions about R gene evolution, such as whether certain R gene families are evolving rapidly under pathogen pressure, whether some R genes have costs to the host plant in the absence of pathogen pressure, whether some classes of R genes have been selected or lost during domestication of crops, and whether there are novel classes of R genes that have not yet been described.

PROJECT: AT A GLANCE

CROP
Solanaceae (tomatoes, potatoes, and their relatives) Triticeae (wheat, barley and related grains) and Brassicaceae (mustard greens, cabbage, turnips, canola and their relatives)

COLLABORATORS
Prof. Jeff Dangl, University of North Carolina

Prof. Jonathan Jones, Sainsbury Laboratory, UK

Prof. Brian Staskawicz, University of California, Berkeley

Prof. Dr. Detlef Weigel, Max Planck Institute for Developmental Biology, Germany

FUNDING SUPPORT
This project was funded by the
Gordon and Betty Moore Foundation to 2Blades through Grant GBMF4725 from 2015-2019.

Arabidopsis thaliana

The collection of resistance genes from 69 accessions of Arabidopsis thaliana was sequenced as part of the GBMF-funded Resistance Gene Diversity (pan-NLR’ome) project performed by the laboratories of Detlef Weigel (MPI), Jeff Dangl (UNC), and Jonathan Jones (TSL) and conducted under the auspices of the 2Blades Foundation. The provided data contain circular consensus sequence (CCS) reads from Resistance Gene Enrichment Sequencing (Ren-Seq) produced on the PacBio platform. These reads were subsequently assembled into contigs.

Additional resources related to this data set are described and linked in Van der Weyer et al., 2019.

DOWNLOAD
(Data hosted at Max Planck Institute for Developmental Biology Tübingen)

Nicotiana and Solanum species

The collection of resistance genes from four species of Nicotiana and four Solanum species were sequenced as part of the GBMF-funded Resistance Gene Diversity (pan-NLR’ome) project performed by the laboratory of Jonathan Jones (TSL) and conducted under the auspices of the 2Blades Foundation. The data were uploaded to European Nucleotide Archive (ENA) and contain subreads from Resistance Gene Enrichment Sequencing (RenSeq) produced on the PacBio platform.

Species Sample NCBI Accession
Nicotiana tabacum ERS1972100 (SAMEA104347122) ERX2225148
Nicotiana sylvestris ERS1972101 (SAMEA104347123) ERX2225149
Nicotiana tomentosiformis ERS1972102 (SAMEA104347124) ERX2225150
Solanum tuberosum Group Phureja ERS1972103 (SAMEA104347125) ERX2225151
Solanum tuberosumcv Maris Piper ERS1972104 (SAMEA104347126) ERX2225152
Solanum lycopersicum accession FLA8059 ERS1972105 (SAMEA104347127) ERX2225153
Solanum americanum accession 944750095 ERS1972106 (SAMEA104347128) ERX2225154
Nicotiana benthamiana ERS1972107 (SAMEA104347129) ERX2225147

Tomatoes and other Solanaceae

The collection of resistance genes from tomato, pepper, N. benthamiana and wild Solanum species were sequenced as part of the GBMF-funded Resistance Gene Diversity (pan-NLR’ome) project performed by the laboratory of Brian Staskawicz and conducted under the auspices of the 2Blades Foundation. The data were uploaded to the National Center for Biotechnology Information (NCBI)  under the BioProject PRJNA496490 and contain circular consensus sequence (CCS) reads from Resistance Gene Enrichment Sequencing (RenSeq) produced on the PacBio platform.

Restrictions on data use

You are welcome to use these data to advance your studies of the plant immune system.

By accessing these data, you agree not to publish any articles containing analyses of genes or genomic data on a genome-wide, whole-assembly scale prior to publication by the producers (Reserved Analyses). Reserved Analyses include the identification of complete sets of genomic features such as genes, gene families, regulatory elements, repeat structures, GC content, or any other genome feature, and comparisons with other Solanaceae.

The embargo on publication of Reserved Analyses by researchers outside of the pan-NLR’ome Project is expected to extend until a first publication of the results of the project is accepted. Academic users are free to publish papers dealing with specific genes or small sets of genes. If these data are used for publication, please contact the responsible persons below to determine the acknowledgement.

Species NCBI Accession
S. lycopersicum Heinz SRX4891006
S. cheesmaniae 0422 SRX4891000
S. cheesmaniae 1039 SRX4891003

S. chmielewskii 1316 SRX4891008
S. chmielewskii 1330 SRX4891001
S. galapagense 0483 SRX4891007
S. galapagense 1136 SRX4891004
S. neorickii 1716 SRX4891009
S. pimpinellifolium 0722 SRX4891005
S. pimpinellifolium 1269 SRX4891002
C. annuum ECW SRX4891011
C. annuum ECW 20R SRX4891012
N. benthamiana SRX4891010

Rice

NLR-encoding genes were sequenced from 66 accessions of Oryza sativa as part of the GBMF-funded Resistance Gene Diversity (pan-NLR’ome) project performed by the laboratory of Jonathan Jones (TSL) in collaboration with Jean-Benoit Morel and Thomas Kroj (INRA) and managed by the 2Blades Foundation. Data were obtained via Resistance Gene Enrichment Sequencing (Ren-Seq) for 14 accessions via PacBio circular consensus sequence (CCS) and for 61 accessions (including those 14) via paired end Illumina reads on the HighSeq 2500 platform.

Restrictions on data use

You are welcome to use these data to advance your studies of the plant immune system.

By accessing these data, you agree not to publish any articles containing analyses of genes or genomic data on a genome-wide, whole-assembly scale prior to publication by the producers (Reserved Analyses). Reserved Analyses include the identification of complete sets of genomic features such as genes, gene families, regulatory elements, repeat structures, GC content, or any other genome feature, and comparisons with other Poaceae.

The embargo on publication of Reserved Analyses by researchers outside of the pan-NLR’ome Project is expected to extend until a first publication of the results of the project is accepted. Academic users are free to publish papers dealing with specific genes or small sets of genes. If these data are used for publication, please contact the responsible persons below to determine the acknowledgement.

All rice sequences can be downloaded through ENA under projects PRJEB23459and PRJEB29200.

Rice Variety Sample Accession
K1_Xiaogu SAMEA104389502 (ERS2007444)
K8_Xiaogu SAMEA104389503 (ERS2007445)
YYT72_Zinuo SAMEA104389523 (ERS2007465)
P04_Baijao SAMEA104389512 (ERS2007454)
T09_Acuce SAMEA104389518 (ERS2007460)
YYT73_Acuce SAMEA104389524 (ERS2007466)
V18_Baijao SAMEA104389521 (ERS2007463)
YYT31_Acuce SAMEA104389522 (ERS2007464)
E1_Hongjiao SAMEA104389492 (ERS2007434)
Q7_Acuce SAMEA104389514 (ERS2007456)
G9_Hongjiao SAMEA104389495 (ERS2007437)
G13_Hongjiao SAMEA104389494 (ERS2007436)
H15_Hongjiao SAMEA104389497 (ERS2007439)
R17_Acuce SAMEA104389515 (ERS2007457)
J05_Hongyang2 SAMEA104389501 (ERS2007443)
M10_Acuce SAMEA104389506 (ERS2007448)
S10_Acuce SAMEA104389516 (ERS2007458)
B05_Baijao SAMEA104389489 (ERS2007431)
V11_Baijao SAMEA104389520 (ERS2007462)
F22_Hongjiao SAMEA104389493 (ERS2007435)
H05_Hongjiao SAMEA104389496 (ERS2007438)
HYYT76B_Huangpinuo SAMEA104389498 (ERS2007440)
YYT78_Huangpinuo SAMEA104389526 (ERS2007468)
Maratelli SAMEA104389507 (ERS2007449)
Padi_Boenor SAMEA104389513 (ERS2007455)
Kasalath SAMEA104389504 (ERS2007446)
Kitaake SAMEA104389505 (ERS2007447)
IRAT13 SAMEA104389500 (ERS2007442)
Tetep SAMEA104389519 (ERS2007461)
Oryzica_Llanos_5 SAMEA104389511 (ERS2007453)
Azucena SAMEA104389488 (ERS2007430)
Moroberekan SAMEA104389508 (ERS2007450)
IR64 SAMEA104389499 (ERS2007441)
Sariceltik SAMEA104389517 (ERS2007459)
Nipponbare SAMEA104389509 (ERS2007451)
Chepugu SAMEA4975981 (ERS2794052)
Lubaigu SAMEA4975982 (ERS2794053)
Zaogu SAMEA4975983 (ERS2794054)
Epugu SAMEA4975984 (ERS2794055)
Jianshuigu SAMEA4975985 (ERS2794056)
Honglueduolu SAMEA4975986 (ERS2794057)
Mingliangyou 527 SAMEA4975987 (ERS2794058)
Liangyou 2161 SAMEA4975988 (ERS2794059)
Jinpinggu SAMEA4975989 (ERS2794060)
Luhonggu SAMEA4975990 (ERS2794061)
B06 Acuce SAMEA4975991 (ERS2794062)
P03 Acuce SAMEA4975992 (ERS2794063)
D15 Baijiao SAMEA4975993 (ERS2794064)
D09 Baijiao SAMEA4975994 (ERS2794065)
T18 Acuce SAMEA4975995 (ERS2794066)
YYT31 Bijiaolaojing SAMEA4975996 (ERS2794067)
F13 Hongjiao SAMEA4975997 (ERS2794068)
E5 Hongjiao SAMEA4975998 (ERS2794069)
J08 Hongyang 2 SAMEA4975999 (ERS2794070)
J11 Hongyang 2 SAMEA4976000 (ERS2794071)
L13 Xiaogu SAMEA4976001 (ERS2794072)
M1 Xiaogu SAMEA4976002 (ERS2794073)
YYT60 Xiaogu SAMEA4976003 (ERS2794074)
YYT62 Xiaogu SAMEA4976004 (ERS2794075)
YYT65 Xiaogu SAMEA4976005 (ERS2794076)
YYT24 Xiaogu SAMEA4976006 (ERS2794077)
YYT76B_Huangpinuo SAMEA104389525 (ERS2007467)
NSF-TV116 SAMEA104389510 (ERS2007452)
Cocodrie_301379 SAMEA104389491 (ERS2007433)
C101A51 SAMEA104389490 (ERS2007432)
Zhenshan2 SAMEA104389527 (ERS2007469)

Triticeae

Triticum aestivum (Bread wheat)

NLR-encoding genes were sequenced from 100 wheat landraces as part of the GBMF-funded Resistance Gene Diversity (pan-NLR’ome) project performed by the laboratory of Jonathan Jones (TSL) in collaboration with Dr. Brande Wulff (John Innes Centre) and under the auspices of 2Blades Foundation. The data were uploaded to the European Nucleotide Archive and can be downloaded under project PRJEB35692

Restrictions on data use

You are welcome to use these data to advance your studies of the plant immune system.

By accessing these data, you agree not to publish any articles containing analyses of genes or genomic data on a genome-wide, whole-assembly scale prior to publication by the producers (Reserved Analyses). Reserved Analyses include the identification of complete sets of genomic features such as genes, gene families, regulatory elements, repeat structures, GC content, or any other genome feature, and comparisons with other Triticeae.

The embargo on publication of Reserved Analyses by researchers outside of the pan-NLR’ome Project is expected to extend until a first publication of the results of the project is accepted. Academic users are free to publish papers dealing with specific genes or small sets of genes. If these data are used for publication, please contact the responsible persons below to determine the acknowledgement.


Triticum turgidum

NLR-encoding genes were sequenced from 59 accessions of the tetraploid wheat Triticum turgidum as part of the GBMF-funded Resistance Gene Diversity (pan-NLR’ome) project performed by the laboratory of Jonathan Jones (TSL) in collaboration with Dr. Ksenia Krasileva (University of California, Berkeley) and under the auspices of 2Blades Foundation. The data were uploaded to the European Nucleotide Archive and can be downloaded under project PRJEB34907

Restrictions on data use

You are welcome to use these data to advance your studies of the plant immune system.

By accessing these data, you agree not to publish any articles containing analyses of genes or genomic data on a genome-wide, whole-assembly scale prior to publication by the producers (Reserved Analyses). Reserved Analyses include the identification of complete sets of genomic features such as genes, gene families, regulatory elements, repeat structures, GC content, or any other genome feature, and comparisons with other Triticeae.

The embargo on publication of Reserved Analyses by researchers outside of the pan-NLR’ome Project is expected to extend until a first publication of the results of the project is accepted. Academic users are free to publish papers dealing with specific genes or small sets of genes. If these data are used for publication, please contact the responsible persons below to determine the acknowledgement.


Aegilops sharonensis

NLR-encoding genes were sequenced from 100 accessions of Aegilops sharonensis as part of the GBMF-funded Resistance Gene Diversity (pan-NLR’ome) project performed by the laboratory of Jonathan Jones (TSL) in collaboration with Dr. Brande Wulff (John Innes Centre) and under the auspices of 2Blades Foundation. The data were uploaded to the European Nucleotide Archive and can be downloaded under projects PRJEB35211 and PRJEB35215.

Restrictions on data use

You are welcome to use these data to advance your studies of the plant immune system.

By accessing these data, you agree not to publish any articles containing analyses of genes or genomic data on a genome-wide, whole-assembly scale prior to publication by the producers (Reserved Analyses). Reserved Analyses include the identification of complete sets of genomic features such as genes, gene families, regulatory elements, repeat structures, GC content, or any other genome feature, and comparisons with other Triticeae.

The embargo on publication of Reserved Analyses by researchers outside of the pan-NLR’ome Project is expected to extend until a first publication of the results of the project is accepted. Academic users are free to publish papers dealing with specific genes or small sets of genes. If these data are used for publication, please contact the responsible persons below to determine the acknowledgement.