ABDEFIJKLMNOPQRSTUVWXYZAAABAEAFAGAHAIAJAKALAMANAOAPAQARASATAUAVAWAXAY
1
Field NameData typeDefinitionoptions: required, optional, n/aNotesCATEGORY FIELD VALUES
2
Visium (no probes)Visium (with probes)GeoMx (NGS)GeoMx (nCounter)HiFihrsTP-seq (DBiTSeq)XeniumResolveCosMxMERFISHStereo-Seqmapping area unitspot size unitspot spacing unitcapture area idpermeabilization time unitpreparation instrument vendorpreparation instrument modelpreparation instrument kit
(manufacturer; model/name; product code)
probe hybridization time unitoligo probe panel (manufacturer; model/name; product code)target retrieval incubation time unitproteinaseK incubation time unitprobe hybridization time unitsegmentation strategy
3
COMMON ASSAY FIELDSvariedEither the single-assay or multi-assay fields from the "common assay fields" metadata schema should be included here.required (multi-
assay)
required (multi-
assay)
required (single-
assay)
required (single-
assay)
required (single-
assay)
required (single-
assay)
required (single-
assay)
required (single-
assay)
required (single-
assay)
required (single-
assay)
required (multi-
assay)
mm^2um^2umA1minute10X GenomicsVisium CytAssistGeoMx Immune Cell Profiling (Human Protein Core for nCounter)minuteminuteminuteautomated segmentation
4
mapped area valuefloatFor Visium, this is the area of spots that was covered by tissue within the captured area, not the total possible captured area which is fixed. For GeoMx this would be the area of the AOI being captured. For HiFi this is the summed area of the ROIs in a single flowcell lane. For CosMx, Xenium and Resolve, this is the area of the FOV (aka ROI) region being captured. For Stereo-Seq this is the number of beads.requiredrequiredrequiredrequiredrequiredrequiredrequiredoptionaloptionaloptionalrequiredspecific to a single AOI (GeoMx), FOV (CosMx), capture area (Visium), flowcell lane surface(HiFi)um^2mm^2B1Not applicableNot applicableGeoMx Neural Cell Profiling (Human Protein Core for nCounter)manual segmentation
5
mapped area unitcategoryThe unit of measurement for the mapping area. For Visium and GeoMx this is typically um^2.requiredrequiredrequiredrequiredrequiredrequiredrequiredoptionaloptionaloptionalrequiredC1Custom
6
spot size valuefloatFor assays where spots are used to define discrete capture areas, this is the area of a spot.requiredrequiredn/an/arequiredrequiredn/an/an/an/arequiredIs this relevant to DBiTSeq?D1Not applicable
7
spot size unitcategoryThe unit for spot size value.requiredrequiredn/an/arequiredrequiredn/an/an/an/arequiredIf this is required for Xenium, Resolve, CosMx, then spot_size_value should also be required. Is this supposed to be "n/a"?Lane 1
8
number of spotsintegerNumber of capture spots within the mapped area. For Visium this would be the number of spots covered by tissue, while it's the number of spots within ROIs for HiFi.requiredrequiredn/an/arequiredrequiredn/an/an/an/arequiredLane 2
9
spot spacing valuefloatApproximate center-to-center distance between capture spots. Synonyms: Inter-Spot distance, Spot resolution, Pit sizerequiredrequiredn/an/arequiredrequiredn/an/an/an/arequiredLane 3
10
spot spacing unitcategoryUnits corresponding to inter-spot distancerequiredrequiredn/an/arequiredrequiredn/an/an/an/arequiredLane 4
11
capture area idcategoryWhich capture area on the slide was used. For Visium this would be [A1, B1, C1, D1]. For HiFi this would be the lane on the flowcell.requiredrequiredn/an/aoptionaloptionaln/an/an/an/an/aLane 5
12
permeabilization time valueintegerPermeabilization time used for this tissue section.optionaloptionaln/an/arequiredrequiredn/an/an/aoptionaloptionalLane 6
13
permeabilization time unitcategoryThe unit for the permeabilization time.optionaloptionaln/an/arequiredrequiredn/an/an/aoptionaloptionalLane 7
14
slide IDtextA unique ID denoting the slide used. This allows users the ability to determine which tissue sections were processed together on the same slide. It is recommended that data providers prefix the ID with the center name, to prevent values overlapping across centers. For GeoMx this is found in the "Initial Dataset" spreadsheet (download from within Data Analysis Suite); can also use "scan name". For HiFi-Slide, this is the ID of used illumina flow cell. HiFi-Slide lay a piece of tissue on a used illumina flow cell and then RNA released from the tissue could be captured by spots on that flow cell.n/an/arequiredrequiredrequiredoptionalrequiredrequiredrequiredrequiredn/aFor Visium this is captured in the histology assay.Lane 8
15
preparation instrument vendorcategoricalThe manufacturer of the instrument used to prepare the sample for the assay.optionalrequiredn/an/an/an/an/an/an/aoptional
16
preparation instrument modelcategoricalThe model number/name of the instrument used to prepare the sample for the assay.optionalrequiredn/an/an/an/an/an/an/aoptional
17
preparation instrument kitcategoricalThe reagent kit used with the preparation instrument.n/an/an/an/an/an/an/an/an/an/a
18
Histology AssayvariedThe histology assay metadata should be additional completed, per dataset, if required by the assay. This histology metadata should be uploaded in a separate TSV or Excel file. For example, Visium would include three TSV/Excel metadata files (Visium, RNAseq, and histology).requiredrequiredn/an/arequiredn/an/an/an/arequired
19
Sequencing AssayvariedThe appropriate sequencing assay metadata should be additional completed, per dataset, as required by the assay. This sequencing metadata should be uploaded in a separate TSV or Excel file. For example, Visium would include three TSV/Excel metadata files (Visium, RNAseq, and histology).required RNAseqrequired RNAseq (with probes)required RNAseq (with probes)n/arequired RNAseqrequired DBiT (RNA)n/an/an/an/arequired RNAseqThis would potentially be "RNAseq with probes" for GeoMx assays that use ngs as the readout method.
20
number of channelsintegerThe number of distinct color channels in the image.n/an/arequiredrequiredn/arequiredn/an/an/an/an/a
21
target retrieval incubation temperature
integerThe temperature in Celsius. This will normally be 100 degrees Celsius for RNA assays, and 80 degrees Celsius for protein assays.n/an/arequiredrequiredrequiredoptionaloptionaloptionaloptionaln/a
22
target retrieval incubation time valuefloatThe duration for which a sample is exposed to a target retrieval solution.n/an/arequiredrequiredrequiredoptionaloptionaloptionaloptionaln/a
23
target retrieval incubation time unitcategoryThe units for target retrieval incubation time value.n/an/arequiredrequiredrequiredoptionaloptionaloptionaloptionaln/a
24
proteinaseK concentrationfloatThe amount or concentration of the enzyme Proteinase K within a sample (in ug/ml).n/an/aoptionaloptionalrequiredoptionaloptionaloptionaloptionaln/a
25
proteinaseK incubation time valuefloatThe duration for which a sample is exposed to Proteinase K.n/an/aoptionaloptionalrequiredoptionaloptionaloptionaloptionaln/a
26
proteinaseK incubation time unitcategoryThe units for proteinaseK incubation time value.n/an/aoptionaloptionalrequiredoptionaloptionaloptionaloptionaln/a
27
probe hybridization time valuefloatHow many hours were the oligo-conjugated RNA or oligo-conjugated antibody probes hybridized with the sample?n/an/acaptured in the RNAseq metadatarequiredn/aoptionaloptionaloptionaloptionaln/a
28
probe hybridization time unitcategoryThe units for probe hybridization time value.n/an/arequiredn/aoptionaloptionaloptionaloptionaln/a
31
oligo probe panelcategoryThis is the probe panel used to target genes and/or proteins. In cases where there is a core panel and add-on modules, the core panel should be selected here. If additional panels are used, then they must be included in the "additional_panels_used.csv" file that's uploaded with the dataset.n/an/arequiredn/aoptionalrequiredrequiredrequiredrequiredn/a
32
is custom probes usedbooleanState ("Yes" or "No") whether custom RNA or antibody probes were used. If custom probes were used, they must be listed in the "custom_probe_set.csv" file.n/an/arequiredn/arequiredrequiredrequiredrequiredn/aAny spike ins used should be included in the probe panel file. We need to define the file structure.
33
number of panel targetsintegerHow many genes, RNA isoforms or RNA regions are targeted by probes.n/an/an/an/an/arequiredrequiredrequiredrequiredn/a
34
roi_labeltextA label for the region of interest. For Resolve and CosMx, this is the field of view (FOV) label. For GeoMx this can be found in the "Initial Dataset" spreadsheet (download from within Data Analysis Suite).n/an/arequiredrequiredoptionalrequiredoptionaloptionaloptionaln/a
35
is overlay usedbooleanState ("Yes" or "No") whether an ROI overlay was used to guide ROI selection. If an overlay is used, then the overlay details will be provided in the preparation protocols.io protocol.n/an/an/an/an/aoptionaln/an/an/an/an/aIf used, this needs to be uploaded. It is not included in the OME TIFF.
36
is roi segmentation performedbooleanWas the image segmented. For GeoMx this refers to whether segmentation was used to split ROIs (regions of interest) into AOIs (areas of interest).n/an/arequiredrequiredn/an/an/an/an/an/a
37
roi segmentation strategycategoryThe method of segmentation that was applied in a GeoMx assay. If an overlay was used the overlay image needs to be included in the dataset upload.n/an/aoptionaloptionaln/an/an/an/an/an/an/ais required if "segmentation performed" is true. If value = "overlay" then an overlay file needs to be uploaded.
38
anatomical structure labeltextThe overarching anatomical structure.n/an/aoptionaloptionaloptionaln/aoptionaloptionaloptionaloptionaln/a
39
anatomical structure IDontology IDThe ontology ID for the parent structure. Typically this would be an UBERON ID.n/an/aoptionaloptionaloptionaln/aoptionaloptionaloptionaloptionaln/a
40
targeted entity labeltextState what cell type(s) or functional tissue unit was targeted in this ROI/AOI.n/an/arequiredrequiredn/an/an/an/an/an/an/a
41
targeted entity IDontology IDThe ontology ID for the targeted entity.n/an/aoptionaloptionaln/an/an/an/an/an/an/ais required if "segmentation performed" is true.
42
segment IDtextThis is the ID for the area of interest (AOI) in a GeoMx dataset. From "Initial Dataset" spreadsheet (download from within Data Analysis Suite), e.g. 9a828e39-43d8-4051-9bcc-581a520a85d4.n/an/arequiredrequiredn/an/an/an/an/an/a
43
is technical replicatebooleanIs the sequencing reaction run in replicate, "Yes" or "No". If "Yes", FASTQ files in dataset need to be merged.n/an/arequiredrequiredn/an/an/an/an/an/an/a
44
HybCode pack lot numbertextEnter the lot number noted within the LabWorksheet.txt file (and used in downstream nCounter processing).n/an/an/arequiredn/an/an/an/an/an/an/a
45
non global filestextA semicolon separated list of non-shared files to be included in the dataset. The path assumes the files are located in the "TOP/non-global/" directory. For example, for the file is TOP/non-global/lab_processed/images/1-tissue-boundary.geojson the value of this field would be "./lab_processed/images/1-tissue-boundary.geojson". After ingest, these files will be copied to the appropriate locations within the respective dataset directory tree. This field is used for internal HuBMAP processing. Examples for GeoMx and PhenoCycler are provided in the File Locations documentation: https://docs.google.com/document/d/1n2McSs9geA9Eli4QWQaB3c9R3wo5d5U1Xd57DWQfN5Q/edit#heading=h.1u82i4axggeen/an/arequiredrequiredn/an/aoptionaloptionaloptionaloptionaloptional
46
number of additional stainsintegerthis would be minimally 2 (always include DAPI and polyT) and can include 6 more.n/an/an/an/an/an/an/an/arequiredn/a
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102