HuBMAP Frequently Asked Questions (FAQ)

General Questions

What is HuBMAP & what are its goals? HuBMAP is the Human BioMolecular Atlas Program. The vision for HuBMAP is to catalyze the development of a framework for mapping of the human body at single-cell resolution to transform our understanding of normal tissue organization and function. This will be achieved by:
  • Accelerating the development of the next generation of tools and techniques for constructing high resolution spatial tissue maps that quantify multiple types of biomolecules either sequentially or simultaneously.
  • Generating foundational 3D human tissue maps using validated high-content, high-throughput imaging and omics assays.
  • Establishing an open data platform that will develop novel approaches to integrating, visualizing, and modelling imaging and omics data to build multi-dimensional tissue maps, and making data rapidly findable, accessible, interoperable, and reusable by the global research community.
  • Coordinating and collaborating with other funding agencies, programs, and the biomedical research community to build the framework and tools for mapping the human body at single-cell resolution.
  • Supporting pilot projects that demonstrate the value of the resources developed by the program to study normal individual variations and tissue changes across the lifespan and the health-disease continuum.
Better insights into the principles governing the tissue organization-function relationship will:
  • Potentially lead to better understanding of the significance of normal inter-individual variability and changes across the lifespan.
  • Inform about the emergence of disease at the biomolecular level before the appearance of clinical symptoms.
Despite vastly improved imaging and omics technologies and many important foundational discoveries, our understanding of how tissues are organized is still restricted by main challenges:
  1. Integrating high content, high resolution spatial and omics information to comprehensively profile biomolecular distribution and morphology of tissues in a high throughput manner.
  2. Arranging this information into 3D tissue maps amenable to modelling.
Generating foundational 3D human tissue maps is one of the core goals of HuBMAP. HuBMAP projects will generate _high resolution, high content, high-throughput_ biomolecular 3D tissue maps of non-diseased human organs and organ systems.
  • For HuBMAP, a high-resolution assay is one that can reliably and reproducibly assign detected biomolecules to individual cells or extracellular compartments of a tissue.
  • A high content approach is one that maximizes identification of tissue features through a combination of biomolecular depth, spatial resolution and multiplexing of complementary, multi-parameter assays.
  • A high throughput pipeline is one that maximizes the bandwidth of data production to result in any or all of the following:
  1. Accelerated speed of analysis, so that hundreds or thousands of samples can be analyzed simultaneously.
  2. Greater depth of analysis, so that hundreds or thousands of molecules can be analyzed in a single sample.
  3. Enhanced capacity for volume, so that a given set of molecules can be analyzed in all the cells within a larger tissue sample.
Using a multi-dimensional approach, including imaging, sequencing, and mass spectrometry assays, HuBMAP provides robust molecular characterization of human cells in their natural tissue context. HuBMAP also generates and shares a number of other resources to support the use of these maps, including details of experimental protocols used, validation of affinity probes, biospecimen metadata, conventions used for annotation, as well as computational tools.

HuBMAP, which made its first external awards in Fall 2018, is funded through the NIH Common Fund as a short-term (8 years), goal-driven strategic investment, with deliverables intended to catalyze research across multiple biomedical research disciplines. The NIH Common Fund supports cross-cutting programs that are expected to have exceptionally high impact. All Common Fund initiatives invite investigators to develop bold, innovative, and often risky approaches to address problems that may seem intractable in isolation or to seize new opportunities that offer the potential for rapid progress.

For a more in depth understanding, read the HuBMAP marker paper, see the course on HuBMAP data acquisition, analysis, and visualization, the Visible Human MOOC, or see this video: HuBMAP Overview. Stay in touch by subscribing to our mailing list and YouTube channel.

Who are the intended users of HuBMAP resources? HuBMAP's rich datasets and associated resources are intended for broad use by the research community, including:
  • Computational researchers exploring organizing principles of human tissues, new structural-functional relationships, and biomolecular networks
  • Biologists exploring hypotheses using publicly available HuBMAP datasets prior to or in parallel with work in their own labs
  • Experimentalists interested in using the same protocols or computational tools in their labs
  • Educators developing new teaching materials
  • Technology developers interested in developing new assays with enhanced performance
How does HuBMAP fit into the ecosystem of other single cell efforts within and outside the NIH? HuBMAP is part of a rich ecosystem of established and emerging atlasing programs supported by NIH and globally by other funding organizations, many of which are focused on specific organs or diseases.
  • HuBMAP connects with these programs to ensure data interoperability, avoid duplication of work, and leverage and synergize gained knowledge.
  • The consortium has organized a number of events to bring together these communities to discuss topics of shared interest (e.g. CCF meeting, NIH-HCA meeting) and is committed to improving coordination and collaboration among different programs.
  • In addition, many HuBMAP PIs actively participate in these efforts, helping with cross-pollination and advancing our global understanding.
HuBMAP, as its name implies, was specifically initiated to resolve the challenge of building integrated, comprehensive, high-resolution spatial maps of human tissues and organs. HuBMAP provides leadership in the ecosystem around:
  • Techniques for integrating disparate, multi-dimensional and multi-scale datasets
  • The development of a Common Coordinate Framework (CCF) for integrating data across many individuals
  • The development and validation of these assays
To further increase interoperability, HuBMAP has adopted a number of standards and processes developed by other consortia, and is working and actively involved in knowledge exchange. The consortium sees itself as an integral part of the ecosystem, sharing its strengths and actively contributing to the community.

Introduction to HuBMAP resources

We welcome all to explore the HuBMAP Consortium website and HuBMAP Data Portal.

Visitors can provide feedback through the Contact form.

The Visible Human MOOC provides an overview of HuBMAP and an introduction to data acquisition, analysis, and visualization.

Getting Started with the HuBMAP Data Portal

How can I start using the portal?

Here's a link to the HuBMAP Data Portal. HuBMAP Data Portal (and related) documentation can be found on the HuBMAP Documentation page and also the HuBMAP FAQ - this page.

How can I learn more about HuBMAP data acquisition, analysis, and visualization tools?

The Visible Human MOOC provides an overview of HuBMAP and introduction to data acquisition, analysis, and visualization.

Can I sign up to be notified of HuBMAP news and events?

Yes, you can sign up for our mailing list to keep informed on everything that is happening in HuBMAP.

Do you track my activities on your portal?

Yes, interactions with the site are recorded in server logs and on Google Analytics and are mapped to your IP address.
In that regard the HuBMAP portal is no different from the rest of the internet.

What web-browsers are supported in the portal?

All modern, mainstream browsers are supported (i.e. Chrome, Edge, Firefox, Safari, etc.).

Where should I report errors, make suggestions for new features or functionality, or ask questions about using the portal?

The HuBMAP Consortium welcomes your comments, feedback, and help in identifing errors on the HuBMAP Data Portal.

  • You can provide error reports, make suggestions, or ask questions through our contact-form.
  • For help with specific issues related to the portal, please contact the HuBMAP Helpdesk and submit a support ticket.

HuBMAP Data and Metadata Questions

Can I use HuBMAP data for my own research? If you use NIH HuBMAP data in publications or presentations we request that you include an acknowledgement of the HuBMAP Program.
  • This acknowledgement helps justify and sustain funding needed to continue providing open access to a growing set of data and tools.
  • Suggested language for such an acknowledgment is: “The results (published or shown) here are in whole or part based upon data generated by the HuBMAP Program."
Can I use your data for another project that I am working on?

Yes! The Consortium provides raw and processed data (at multiple levels) for the community to access through the HuBMAP Data Portal.

HuBMAP products are broadly available to the research community to establish the foundations for a human body map that other programs and the international community can build upon, including methods, tools, reagents, biospecimens, datasets, and software.

To acknowledge HuBMAP in your findings, the Consortium suggests language of the form: “The results (published or shown) here are in whole or part based upon data generated by the HuBMAP Program."

Can I submit my own data to HuBMAP? Why would I want to?
  • Yes, HuBMAP allows investigators to submit their own data via the HuBMAP Data Portal.
  • Why share? Having your own data on HuBMAP will ...
    • Allow other researchers access to your results. In this way, others may extend and interact with your scientific work.
    • Provide additional resources for creating cellular and molecular level anatomical maps of the healthy human.
The HuBMAP consortium encourages the scientific community to provide feedback about HuBMAP dataset metadata. This feedback helps improve the quality and usability of community data submissions.
Can I add my own tools to HuBMAP portal? Why would I want to submit my tools to HuBMAP?
  • Yes, HuBMAP seeks to host relevant tools and welcomes community input to help with feature prioritization and development for the HuBMAP Portal.
  • Adding your Tools to the HuBMAP Portal will help you get others to use your tools and provide feedback to improve the scientific impact of your work.
  • One of the first tools released was Azimuth, an app for reference-based single-cell analysis, that lets users annotate cell-types in their own data based on HuBMAP approaches.
What do I need to do to gain access to HuBMAP data? How can I download HuBMAP data? Access to data on HuBMAP's Data Portal is open to all interested viewers, without additional barriers (account creation, login, etc.).

How can I download HuBMAP data?

To download HuBMAP data from the Data Portal you will need to register as a member of HuBMAP. Note that downloads of specific datasets (e.g., raw genetic data) require NIH approval. Contact the HuBMAP Helpdesk for assitance with accessing this type of data.

How can I register as a member of HuBMAP?

1. Go to the HuBMAP Consortium website
2. Under Member Services select Member Register
3. Complete the registration form (you will need a valid institutional or eRA Commons ID)

Can I make HuBMAP data available through my resource?
  • You may use HuBMAP data for any purposes permitted by the Data Sharing Policy.
  • The CCF 3D Reference Object Library provides anatomically correct reference organs. The organs are developed by a specialist in 3D medical illustration and approved by organ experts.
  • As of December 2023, the 6th HRA release included 30 organ objects that can be freely used in teaching, research, or commercial applications.
How would I cite and reference HuBMAP and related data?

To acknowledge HuBMAP data in publications or presentations, we suggest:

  • “The results (published or shown) here are in whole or part based upon data generated by the HuBMAP Program."

The HuBMAP marker paper should be cited as:

The Visible Human reference organs are freely available via the CCF 3D Reference Object Library. Please cite as:

Can I use figures from HuBMAP in my publications?

Yes, as long as you cite the source of the figure. See the preceeding question for more details.

What metadata is included in HuBMAP? Will HuBMAP include additional metadata in future releases?
  • HuBMAP accepts donor, sample (block, section, or suspension), and assay metadata.
  • See this HuBMAP metadata resource for more information.
  • In future releases, metadata will be linked to various ontologies to make integration more efficient.
How does HuBMAP curate and validate metadata? HuBMAP data submitters can access lists of metadata schemas (and related information) and download metadata templates from HuBMAP’s Data Upload Guidelines page. Data submitters can validate their own metadata using a Validator tool.
  • Click on any assay or sample type on the lists to jump to a page for that type.
  • Select the Excel template on that page.
  • Download and complete the Excel template.
  • Validate your metadata spreadsheet (template) using the Metadata Spreadsheet Validator.
    • The Validator tool compares entries in your metadata template against specifications stored in the CEDAR repository.
    • It categorizes any errors found and provides hints on how to fix those errors.
    • If errors are detected, use the Validator tools to correct any errors.
    • When no errors are found, you may safely upload your spreadsheet.
    • Once all errors are corrected, download the corrected metadata spreadsheet (TSV format).
    • Send validated (organ or sample) metadata TSV files to the HuBMAP Helpdesk
    • A Data Curator at the HuBMAP Helpdesk will manually validate and upload the files.
  • Access Help documentation for the Validator.
  • TMCs are required to validate their metadata BEFORE uploading it to the HuBMAP Helpdesk.
Is there a “map” in HuBMAP? The Human Reference Atlas (HRA) is a comprehensive, high-resolution, three-dimensional atlas of all the cells in a healthy human body.
  • The HRA provides standard terminologies and data structures for describing:
    • specimens
    • biological structures
    • spatial positions
All linked to existing ontologies. Watch a short video introduction to the HRA.
How can I run queries? What type queries are currently supported? Can you show me example queries? You can query and interact with HuBMAP (and other) data registered in the Human Reference Atlas (HRA) using APIs.
Can I suggest types of queries I am interested in?

Contact the HRA Team with feedback or suggestions regarding APIs and queries for HuBMAP HRA data.

What controlled vocabularies or ontologies are being used by HuBMAP? Each donor metadata item uses Unified Medical Language System (UMLS) Concept Unique Identifiers (CUIs) and related SNOMEDCT_US codes.
  • This list will be expanded as clinical data transactions, not just metadata, are added for donors for which data is available.
  • Similarly, other metadata will be encoded with applicable ontologies.
  • The HuBMAP Knowledge Graph underpins all ontologies used in HuBMAP but is not yet deployed.
  • The current CCF ontology uses Uberon, Kidney Tissue Atlas Ontology (KTAO) and Cell Ontology (CL), see details

API (Application Programming Interface) Questions

Do you support an API for programmatic access? The HuBMAP portal is built using an extensible API structure that supports all component interactions.
  • APIs are being registered in SmartAPI.
  • For external access to APIs, please submit a request to the HuBMAP Helpdesk.
What are the current features of the API? The HuBMAP APIs underpin all provenance, data access, processing, translation, search, and access controls.
  • APIs also report the versions and uptime statuses of all Docker containers that comprise HuBMAP’s microservices orchestration architecture.
What are the expectations about future API features?
  • APIs are extensible and are expected to be expanded progressively.
  • The next major set of APIs will deliver the underpinning transactions needed for semantic search.
What are the licensing requirements to use HuBMAP APIs, software, and data?
How do you plan to handle error fixes and updates? Will I be notified of these?
  • You can submit a bug or request a new Data Portal feature through this form.
  • For help with specific issues related to the portal, please contact the HuBMAP Helpdesk and submit a support ticket.
  • Be sure you are up-to-date on all HuBMAP news, sign up for our mailing list.

Experimental Design & Data Analysis Questions

Where can I find information about experimental design associated with HuBMAP data? An overview of the Information on the experimental design and choice of modalities can be found within this reference:
  • Snyder, M.P., Lin, S., Posgai, A. et al. The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature 574, 187–192 (2019). https://doi.org/10.1038/s41586-019-1629-x (PMC6800388)
Additional information on experimental design for each modality featured in the portal can be obtained on protocols.io as listed below. Further questions can be directed to the dataset contacts detailed within the portal. Overview protocols:

University of Florida:

Vanderbilt University:

UCSD:

Where can I find information about HuBMAP experimental protocols?

All published protocols that are used in HuBMAP are available on protocols.io

How do you define raw and processed data?
  • We define raw data as the data that comes directly off of the instrument (e.g. mass spectrometer, microscopy, etc.)
  • Processed data has been transformed in some manner (e.g. normalization, background subtracted, aligned, etc.).
    • The level of processing is defined by the data state as detailed below.
    • Data states are dependent upon the modality.
    • In general, data state 0 (raw data) and state 1 (processed data) are available on the portal for downloading.
Microscopy:
Data State Description Example File Type
0 Raw image data: This is the data that comes directly off the instrument without preprocessing. (may not always be included). CZI, TIFF
1 Processed data: Can include stitching, thresholding, background subtraction, z-stack alignment, deconvolution CZI, TIFF, OME-TIFF
2 Segmentation: Computationally predicted cell (nucleus, cytoplasm) and/or structural boundaries (tubules, ventricles, etc.) CSV, TIFF
3 Annotation (Cells and Structures): Interpretation of microscopy image and/or segmentation in terms of biology (e.g. unhealthy vs healthy, cell-type, function, functional region). TIFF, PNG
Mass Spectrometry:
Data State Description Example File Type
0 Raw image data: This is the data that comes directly off the mass spectrometer without preprocessing; sometimes referred to as raw spectral data. imzML
1 Processed imaging MS data: Can include peak alignment, intensity normalization, m/z recalibration. CSV, OME-TIFF
Sequencing:
Data State Description Example File Type
0 Raw data: This is the raw sequence data (unprocessed) generated directly by the sequence instrument in files either with Phred quality scores (fastq). FASTQ
1 Aligned data: SAM files contain sequence data that has been aligned to a reference genome and includes chromosome coordinates. BAM files are compressed binary versions of SAM files. The reference genome used is hg38. SAM, BAM
In what formats is your data available?
  • Imaging based raw or processed data is available as TIFF or OME.TIFF formats
  • Segmented imaging data is generated as csv and TIFF formats
  • Annotated imaging data is TIFF, PNG, and PDF
  • Raw sequence data is provided as fastq and metadata via tsv
  • Imaging mass spectrometry raw data is provided as a .d and processed data is imzml, or a csv and a series of ome-tiffs.
Can I get the code that you used to process data?

All available code can be found on the HuBMAP github page.

Can I find an explanation for the choice of HuBMAP pipelines and algorithms?
Which samples should I consider to be technical replicates?
  • Technical replicates are repeated measurements of the same existing sample.
  • As even serial tissue sections represent distinct samples, we do not consider any images of tissues to be technical replicates.
  • Technical replicates for sequencing assays would be any sequencing libraries generated from the same sample or aliquot of cells or nuclei.
Which samples should I consider to be biological replicates?
  • Biological replicates are datasets from samples that originate from the same organ and organ donor.
  • As such, each dataset within the HuBMAP database that is provided for a given donor organ for a comparable anatomical region/structure would be a biological replicate.
How are each organ/tissue handled to maintain high reproducibility and validity for the assay?

Protocols.io detailed processing with QA/QC HuBMAP Method Development Community timeline

How have antibodies been verified? Initially, all antibodies were validated by individual groups. With later data releases, complete antibody information, including antibody clone, vendor, RRID, conjugation information, etc. became available. Additional antibody validation standards were also implemented. For the development of our antibody validation levels, we followed the antibody verification guidelines established in the following manuscripts: A proposal for validation of antibodies
  • Uhlen M, Bandrowski A, Carr S, Edwards A, Ellenberg J, Lundberg E, Rimm DL, Rodriguez H, Hiltke T, Snyder M, Yamamoto T. Nat Methods. 2016 Oct;13(10):823-7. doi: 10.1038/nmeth.3995. Epub 2016 Sep 5.PMID: 27595404
The Antibody Society’s antibody validation webinar series
  • Voskuil, J., Bandrowski, A., Begley, C. G., Bradbury, A., Chalmers, A. D., Gomes, A. V., Hardcastle, T., Lund-Johansen, F., Plückthun, A., Roncador, G., Solache, A., Taussig, M. J., Trimmer, J. S., Williams, C., & Goodman, S. L. MAbs. 2020;12(1):1794421. doi:10.1080/19420862.2020.1794421. PMID: 32748696