Data Store | Map downloads

Data for bulk download at legumeinfo is now stored at the Data Store. Layout looks like:
    Genus_species/
      Genotype.DatasetTypeVersion.Subtype.Key/
        README.Key.md
        gensp.Genotype.DatasetTypeVersion.Subtype.KeyA.DESCRIPTION1.ext.gz
        gensp.Genotype.DatasetTypeVersion.Subtype.KeyA.DESCRIPTION2.ext.gz

Example:
    Trifolium_pratense/
      MilvusB.gnm2.ann1.gNmT/
        README.gNmT.md
        tripr.MilvusB.gnm2.gNmT.pchr_plus_unachored.fna.gz
      MilvusB.gnm2.ann1.DFgp/
          README.DFgp.md
          tripr.MilvusB.gnm2.ann1.DFgp.ahrd.txt.gz
          [more files]
      etc...
Abbreviations in the directory names:

We use three-letter abbreviations to indicate data types. This is to avoid the ambiguity of unlabeled version numbers (e.g. how to indicate assembly version 1, annotation version 2)? Here are the abbreviations and corresponding data types:

  • ann => annotation
  • gnm => genome assembly
  • tcp => transcriptome
  • map => map
  • syn => synteny
  • div => diversity
  • gws => GWAS
Dataset key names:

The four-letter string in the README and the filenames (e.g. Key or gNmT above) is a unique key, which associates the file(s) in a directory (a data collection) with the metadata for the file(s). The keys are also recorded at a Registry, along with other curatorial and status information.

Searching for information in the Data Store:

Access the search text-entry box by clicking on the magnifying glass in the upper left corner. Then enter, for example, "protein" to find all files with that text in the name, or a key name (e.g. "Qq0N", if you know that from the Registry), or "Lupinus", or "lupan" (the five-letter GENus SPecies code). Also, the data is organized in a standard way, so you can probably find what you want by navigating through the directory tree.

Contributing data:

Yes, please! contact us. For developers of other sites: if you like this system, and would like to host a similar one, please contact us about the installation and configuration details (this uses the h5ai browser package). This file system and metadata schema are part of the Legume Federation project, and we would like to see addition of other instances.

The relationship of data in this repository to other instances of the data:

In many cases, data in the Data Store is re-hosted from another primary repository. We host it here in order to provide access in regularized processes at this site. The metadata file (README.KeyX.md) in every terminal directory gives the data provenance, in cases where there is another primary repository for this data. This file also contains citation information, information about any file name changes from the previous repository, and transformations on the data contents (if any).

Metadata templates and protocols:

See more information about the Data Store directory, including metadata templates, the Registry, and associated protocols, at the metadata directory. See more about the Legume Federation project, which funds part of the development and curation work on the Data Store.

Go to the Data Store.


Download Maps
File namesort descendingSizeModifiedDescription
alt textBAT93_x_JALOEEP558_b.txt23.71 KBMon, 06/01/2015 - 14:53Mapping Population: BAT93_x_JALOEEP558
Species: P. vulgaris
Publication: Freyre, Skroch et al., 1998a
alt textPv_Blair_2017.txt77.17 KBMon, 07/17/2017 - 09:20
alt textPv_COS_Cook_UC_Davis_2009.txt68.56 KBMon, 04/27/2015 - 08:21Mapping Population: unknown
Species: Phaselous vulgaris
Publication: Blair, Cortes et al., 2013a
alt textPv_Jackson_Purdue_FPC_2007.txt5.66 MBMon, 04/27/2015 - 08:30Mapping Population:
Species: Phaselous vulgaris
Publication: unknown
alt textPv_McClean_NDSU_-_2007.txt36.78 KBMon, 04/27/2015 - 08:29Mapping Population: unknown
Species: Phaselous vulgaris
Publication: unknown
alt textPvConsensus_GaleanoFernandez2011_a.txt171.2 KBMon, 04/27/2015 - 08:28Mapping Population: N/A
Species: Phaselous vulgaris
Publication: Galeano, Fernandez et al., 2011a