A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | AA | AB | AC | AD | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Summary: | This is a first pass of gathering and structuring key data to be used to build the CINECA WP3 minimal metadata model - current under development. | ||||||||||||||||||||||||||||
2 | Content: | The chosen sub-categories are shaped by common data categories collected by major catalogues such as Maelstrom. The chosen variables are either CINECA use case requirements, or variables that can be mapped to by the majority of CINECA cohorts, or typical metadata variables collected by major catalogues. We have applied ontology terms to variables (mostly from NCIT). We are also looking into cross referencing with existing minimal data models. | ||||||||||||||||||||||||||||
3 | Attribution: | If you use this work, please attribute CINECA and the primary contacts listed below. | ||||||||||||||||||||||||||||
4 | Primary Contacts: | Vivian Jin and Fiona Brinkman (SFU, Canada); Melanie Courtot (EBI, UK). | ||||||||||||||||||||||||||||
5 | Possible next steps: | Create application ontology and structure terms in a hierarchy for easier search. For each variable, specify values and associated ontology terms (eg. for gender - specify male/female/transgender/non-binary). | ||||||||||||||||||||||||||||
6 | ||||||||||||||||||||||||||||||
7 | 9 CINECA cohorts | stand-alone cohorts | consortiums | |||||||||||||||||||||||||||
8 | CHILD | H3Africa | ||||||||||||||||||||||||||||
9 | CARTaGENE | BIOS | ||||||||||||||||||||||||||||
10 | CLSA | EGA | ||||||||||||||||||||||||||||
11 | Estonian Biobank | |||||||||||||||||||||||||||||
12 | CoLaus/PsyCoLaus | |||||||||||||||||||||||||||||
13 | UK Biobank | |||||||||||||||||||||||||||||
14 | NOTE: use blank response if answer to question description is N/A | |||||||||||||||||||||||||||||
15 | Broad category | Sub-category | Sub-category/variable | Variable | Question Description | Expected Answer Type | Ontology label | Ontology ID | Ontology definition | Known num. of cohorts with this data (if greater zero) | Use Cases Requirement | Notes | ||||||||||||||||||
16 | basic cohort attributes | aims and objectives | What are the aims and objectives of your cohort? | Alphanumeric | Study Objective | NCIT:C93415 | The reason for performing a study in terms of the scientific questions to be answered by the analysis of data collected during the study. | 9 | ||||||||||||||||||||||
17 | timeline | What is the timeline for your cohort study? | Alphanumeric | Timeline | NCIT:C54576 | A chronological schedule of when activities or events occurred or will occur. | 9 | |||||||||||||||||||||||
18 | study design (eg. longitudinal) | What is the study design for your cohort study? | MIABIS codes (Categorical) | Study Design | NCIT:C15320 | A plan detailing how a study will be performed in order to represent the phenomenon under examination, to answer the research questions that have been asked, and defining the methods of data analysis. Study design is driven by research hypothesis being posed, study subject/population/sample available, logistics/resources: technology, support, networking, collaborative support, etc.… | 9 | |||||||||||||||||||||||
19 | population data (Population Group? NCIT:C17005) | location | What location is your cohort's population based in? | Alphanumeric | Location | NCIT:C25341 | A position, site, or point in space where something can be found. | 9 | 4.1 | |||||||||||||||||||||
20 | criteria for enrollment and recruitment procedures | What is your cohort's criteria of enrollment or recruitment procedures? | Alphanumeric | Inclusion Criteria | NCIT:C25532 | Medical and/or social characteristics which are necessary for a subject to be allowed to participate in a clinical study, as outlined in the study protocol. Meeting inclusion criteria is not a sufficient condition for entry or recruitment of a subject into the study. Characteristics limiting the eligibility of a subject for the clinical study must be considered | 6 | 4.1 | ||||||||||||||||||||||
21 | num. participants | What are the number of participants in your cohort? | Numeric | Planned Subject Number | NCIT:C49692 | The number of subjects entered in a clinical trial. | 9 | 4.1, 5 | ||||||||||||||||||||||
22 | demographic data (NCIT:C142508) | sex(es) studied in cohort | ** these are also captured in the survey category | How many of each sex are studied in your cohort? | Numeric | Sex | NCIT:C28421 | The assemblage of physical properties or qualities by which male is distinguished from female; the physical difference between male and female; the distinguishing peculiarity of male or female. | 9 | 5.3 | ||||||||||||||||||||
23 | gender(s) studied in cohort | How many of each gender are studied in your cohort? | Numeric | Gender | NCIT:C17357 | The assemblage of properties that distinguish people on the basis of the societal roles expected for the two sexes. | ||||||||||||||||||||||||
24 | age range | What is the age range of your cohort's participants? | Alphanumeric | 6 | ||||||||||||||||||||||||||
25 | data collection events | If data was collected over several events or time points, what are the events or time points? | Alphanumeric | |||||||||||||||||||||||||||
26 | biosample (NCIT:C43412) | sample type (Body fluid or substance NCIT:C13236) | urine | Have you collected this type of biosample? | Yes/No (Binary) | Urine | NCIT:C13283 | The fluid that is excreted by the kidneys. It is stored in the bladder and discharged through the urethra. | 4 | 5.1 | ||||||||||||||||||||
27 | blood | venous or arterial | Blood | NCIT:C12434 | A liquid tissue; its major function is to transport oxygen throughout the body. It also supplies the tissues with nutrients, removes waste products, and contains various components of the immune system defending the body against infection. Several hormones also travel in the blood. | 5 | ||||||||||||||||||||||||
28 | fasting or non-fasting | |||||||||||||||||||||||||||||
29 | stool | Feces | NCIT:C13234 | The material discharged from the bowel during defecation. It consists of undigested food, intestinal mucus, epithelial cells, and bacteria. | 2 | |||||||||||||||||||||||||
30 | saliva | Saliva | NCIT:C13275 | The watery fluid in the mouth made by the salivary glands. Saliva moistens food to help digestion and it helps protect the mouth against infections. | 2 | |||||||||||||||||||||||||
31 | other | |||||||||||||||||||||||||||||
32 | availability | For each biosample present, is it available/accessible for research? | DUO code(s) | Availability | NCIT:C25429 | The quality of being obtainable or accessible and ready for use or service. | TBD | 5.1 | ||||||||||||||||||||||
33 | sample size | For each biosample present, what is the sample size? | Numeric | Sample Size | ||||||||||||||||||||||||||
34 | processing method | For each biosample present, what was the processing method used? | Alphanumeric (or Categorical?) | Methodology | NCIT:C86572 | The system of methods followed in a particular discipline. | TBD | 5.1 | ||||||||||||||||||||||
35 | storage method | For each biosample present, what is the storage method? | MIABIS codes (Categorical) | Defined Specimen Storage | NCIT:C93374 | An administrative activity defined at a global library level that is an action of storing samples. | TBD | 5.1 | ||||||||||||||||||||||
36 | laboratory measures (Laboratory Procedure NCIT:C25294) | microbiology (Microbiology Test NCIT:C49188) | microbial data | Have you collected microbial data? | Yes/No (Binary) | TBD | ||||||||||||||||||||||||
37 | biosample source/anatomical location | From where (specimen source site) have you collected microbial data? | Alphanumeric (or Categorical?) | Specimen Source Site | NCIT:C159256 | A request to identify the specimen source site. | ||||||||||||||||||||||||
38 | available data format(s) | For each specimen source site, what are the available data formats? | Alphanumeric (or Categorical?) | Format | NCIT:C42761 | The organization of information according to preset specifications. | TBD | |||||||||||||||||||||||
39 | genomics (Molecular Analysis? NCIT:C19770) | data type | DNA/Genotyping | Have you collected this type of data? | Yes/No (Binary) | DNA Sequencing/Nucleic Acid Sequencing | NCIT:C153598/NCIT:C18881 | 9 | 4.1,4.2,5.2,5.3 | |||||||||||||||||||||
40 | WGS | Whole Genome Sequencing | NCIT:C101294 | 7 | 5.2,5.4 | |||||||||||||||||||||||||
41 | WES | Whole Exome Sequencing | NCIT:C101295 | 4 | 5.2,5.4 | |||||||||||||||||||||||||
42 | Sequence variants (CNV, SNP arrays) | Single Nucleotide Polymorphism Profile | NCIT:C129888 | The analysis of all of the single nucleotide polymorphisms in the genome of a biological sample. | TBD | 5.2,5.3,5.4 | ||||||||||||||||||||||||
43 | Epigenetics | Epigenetic Profile | NCIT:C129887 | The analysis of all of the epigenetic DNA modifications in the genome a biological sample. | 4 | 5.2 | ||||||||||||||||||||||||
44 | Metagenomics | Metagenomics analysis | ERO:0000657 (***ERO is obsolete) | A molecular assay that is used to analyze metagenomic data; genetic material recovered directly from environmental samples for genomic research. | TBD | 5.2 | ||||||||||||||||||||||||
45 | Microbiome markers (rRNA, etc) | |||||||||||||||||||||||||||||
46 | RNAseq/gene expression | mRNA sequencing | NCIT:C129432 | A procedure that can determine the RNA sequences for all or part of the poly-A tail-containing messenger RNA transcripts in an individual. | 6 | 4.1,5.3 | ||||||||||||||||||||||||
47 | eQTL (Cis eQTLs and/or Trans eQTLs) | Expression Quantitative Trait Locus | NCIT:C113415 | A stretch of DNA at a particular chromosomal location that is able to regulate the expression of a specific mRNA or protein. | TBD | 4.1 | ||||||||||||||||||||||||
48 | other | |||||||||||||||||||||||||||||
49 | sample size | For each data type present, what is the sample size? | Numeric | Sample Size | NCIT:C53190 | A subset of a larger population, selected for investigation to draw conclusions or make estimates about the larger population. | TBD | all? | ||||||||||||||||||||||
50 | available data format(s) | For each data type present, what are the available data formats? | Alphanumeric (or Categorical?) | Format | NCIT:C42761 | The organization of information according to preset specifications. | TBD | 4.1 | ||||||||||||||||||||||
51 | availability | For each data type present, is it available/accessible for research? | DUO code(s) | Availability | NCIT:C25429 | The quality of being obtainable or accessible and ready for use or service. | TBD | all? | ||||||||||||||||||||||
52 | processing method | For each data type present, what was the processing method used (eg. sequencer/software)? | Alphanumeric | Methodology | NCIT:C86572 | The system of methods followed in a particular discipline. | TBD | |||||||||||||||||||||||
53 | associated cell type/tissue type/biosample | For each data type present, what are the associated cell/tissue/biosample types? | Alphanumeric (or Categorical?) | Specimen Source Site | NCIT:C159256 | A request to identify the specimen source site. | TBD | |||||||||||||||||||||||
54 | associated phenotype | For each data type present, what are the associated phenotype(s)? | Alphanumeric | Trait/Phenotype | NCIT:C985496/NCIT:C16977 | Any genetically determined characteristic/The assemblage of traits or outward appearance of an individual. It is the product of interactions between genes and between genes and the environment. | ||||||||||||||||||||||||
55 | main diagnosis | For each data type present, what is the main diagnosis ? | Alphanumeric | Diagnosis | ||||||||||||||||||||||||||
56 | associated disease | For each data type present, what are the associated diseases? | Alphanumeric | Diagnosis | NCIT:C15220 | The investigation, analysis and recognition of the presence and nature of disease, condition, or injury from expressed signs and symptoms; also, the scientific determination of any kind; the concise results of such an investigation. | 5 | |||||||||||||||||||||||
57 | associated medication | For each data type present, what are the associated medications? | Alphanumeric | Medication | NCIT:C459 | TBD | 4.1,4.2,5.1,5.2,5.3 | |||||||||||||||||||||||
58 | survey administration (NCIT:C64252) | date and time-related information | Have you collected the date/time of survey assessments? | Yes/No (Binary) | Assessment Date | NCIT:C93511 | The date (and time) on which an assessment is completed. | 4 | ||||||||||||||||||||||
59 | consent/accessibility | Is there consent for your survey data to be accessible for research? | DUO code(s) | Informed consent | NCIT:C16735 | Consent by a patient to a surgical or medical procedure or participation in a clinical study after achieving an understanding of the relevant medical facts and the risks involved. | TBD | |||||||||||||||||||||||
60 | unique identifiers | Are there unique identifiers for subjects/participants? | Yes/No (Binary) | Identifier | NCIT:C25364 | One or more characters used to identify, name, or characterize the nature, properties, or contents of a thing. | 6 | all? | ||||||||||||||||||||||
61 | questionnaire/survey data (NCIT:C17176) | socio-demographic and economic characteristics | age/birthdate | If this data type is present, what are the associated variables from your cohort's survey/questionnaire? For each associated variable, what is the data type (eg. categorical/binary/numeric/alphanumeric/datetime), the data collection event or time point (if applicable), the population type (if there are different participants sharing one identifier; eg. child, mother, father)? | .csv format | Age /Birth Date | NCIT:C25150/NCIT:C68615 | 5 | 4.1, 5 | |||||||||||||||||||||
62 | biological sex | Sex | NCIT:C28421 | The assemblage of physical properties or qualities by which male is distinguished from female; the physical difference between male and female; the distinguishing peculiarity of male or female. | 8 | 4.1, 5 | ||||||||||||||||||||||||
63 | gender | Gender | NCIT:C17357 | The assemblage of properties that distinguish people on the basis of the societal roles expected for the two sexes. | 8 | 5 | ||||||||||||||||||||||||
64 | ethnicity/race | Ethnicity/Race | NCIT:C16564/NCIT:C17049 | A social group characterized by a distinctive social and cultural tradition that is maintained from generation to generation. Members share a common history and origin and a sense of identification with the group. They have similar and distinctive features in their lifestyle habits and shared experiences. They often have a common genetic heritage which may be reflected in their experience of health and disease... | 6 | 4.1, 5 | ||||||||||||||||||||||||
65 | genealogy | TBD | 4.1 | |||||||||||||||||||||||||||
66 | birthplace | 6 | 4.1? | |||||||||||||||||||||||||||
67 | residence | Residence | NCIT:C25273 | Any address at which a person dwells more than temporarily. | 6 | 4.1 | ||||||||||||||||||||||||
68 | education | Education | NCIT:C16529 | The activities of educating or instructing or teaching; activities that impart knowledge or skill. | 6 | |||||||||||||||||||||||||
69 | family and household structure | 6 | ||||||||||||||||||||||||||||
70 | lifestyle and behaviours (lifestyle NCIT:C16795) | tobacco | Tobacco Smoking History | NCIT:C29719 | A record of an individual's background in regard to smoking tobacco. This would include such factors as start date, end date (if applicable), number of cigarette smoked, attempts to quit, and others. | 6 | ||||||||||||||||||||||||
71 | alcohol | Alcohol Use History | NCIT:C81229 | A description of an individual's current and past experience with alcoholic beverage consumption. | 5 | |||||||||||||||||||||||||
72 | physical activity | Physical Activity Measurement | NCIT:C120914 | A measurement of a subject's physical activity or movement. | 6 | |||||||||||||||||||||||||
73 | sleep | Sleep | NCIT:C73425 | A natural and periodic state of rest during which consciousness of the world is suspended. | 6 | |||||||||||||||||||||||||
74 | nutrition | Nutrition | NCIT:C28294 | That which is consumed to fuel necessary life processes of an organism. | 6 | |||||||||||||||||||||||||
75 | physician/practitioner info | Admitting Physician | NCIT:C51798 | The physician responsible for the admission of a patient to a hospital or other inpatient health institution. The admitting physician evaluates patients, makes admitting decisions and assesses diagnostic and treatment plans | TBD | |||||||||||||||||||||||||
76 | diseases (Disease or Disorder NCIT:C2991) | diagnosis | blood-related disorders | Hematopoietic and Lymphoid System Disorder | NCIT:C35814 | Any deviation from the normal structure or function of the blood or lymphatic system that is manifested by a characteristic set of symptoms and signs. | 5 | 4.1,4.2,5.1,5.2,5.3,5.4 | ||||||||||||||||||||||
77 | endocrine/nutritional/metabolic disorders | Endocrine System Disorder | NCIT_C3009 | A non-neoplastic or neoplastic disorder that affects the endocrine system. Representative examples of non-neoplastic disorders include diabetes mellitus, hyperthyroidism, and adrenal gland insufficiency. Representative examples of neoplastic disorders include carcinoid tumor, neuroendocrine carcinoma, and pheochromocytoma.… | 6 | |||||||||||||||||||||||||
78 | mental and behaviour disorders | Psychiatric Disorder/Behavioural Disorder | NCIT:C2893/NCIT:C35470 | A disorder characterized by behavioral and/or psychological abnormalities, often accompanied by physical symptoms. The symptoms may cause clinically significant distress or impairment in social and occupational areas of functioning. Representative examples include anxiety disorders, cognitive disorders, mood disorders and schizophrenia.…/A specific behavioral problem that occurs in persistent patterns and characteristic clusters and that causes clinically significant impairment. | 5 | |||||||||||||||||||||||||
79 | nervous system | Nervous System Disorder | NCIT:C26835 | A non-neoplastic or neoplastic disorder that affects the brain, spinal cord, or peripheral nerves. | 5 | |||||||||||||||||||||||||
80 | digestive system | Digestive System Disorder | NCIT:C2990 | A non-neoplastic or neoplastic disorder that affects the gastrointestinal tract, anus, liver, biliary system, and pancreas. | 5 | |||||||||||||||||||||||||
81 | respiratory system | Respiratory System Disorder | NCIT:C26871 | A non-neoplastic or neoplastic disorder that affects the respiratory system. Representative examples include pneumonia, chronic obstructive pulmonary disease, pulmonary failure, lung adenoma, lung carcinoma, and tracheal carcinoma. | 5 | |||||||||||||||||||||||||
82 | circulatory system | Cardiovascular Disorder | NCIT:C2931 | 5 | ||||||||||||||||||||||||||
83 | oncological | Cancer-Related Condition | NCIT:C8278 | TBD | ||||||||||||||||||||||||||
84 | signs and symptoms (Sign or Symptom NCIT:C100104) | 5 | ||||||||||||||||||||||||||||
85 | physiological measurements | anthropometry | weight | Weight | NCIT:C25208 | The vertical force exerted by a mass as a result of gravity. | 6 | |||||||||||||||||||||||
86 | height | Height | NCIT:C25347 | The vertical measurement or distance from the base to the top of an object; the vertical dimension of extension. | ||||||||||||||||||||||||||
87 | circulation and respiration | blood pressure | Blood Pressure | NCIT:C54706 | The pressure of the circulating blood against the walls of the blood vessels. | 6 | ||||||||||||||||||||||||
88 | heart rate (HR) | Heart Rate | NCIT:C49677 | The number of heartbeats per unit of time, usually expressed as beats per minute. | ||||||||||||||||||||||||||
89 | non-pharmacological interventions | surgical interventions | Vasculature Mechanical or Surgical Intervention | NCIT_C119212 | 6 | |||||||||||||||||||||||||
90 | medication (Medication NCIT:C459) | associated disease(s) | 6 | 4.1,4.2,5.1,5.2,5.3,5.4 | ||||||||||||||||||||||||||
91 | prescription | Prescription | NCIT:C28180 | A verbal or written order given by an authorized person instructing a patient to obtain and use a medical device, prescription or undergo a procedure. | ||||||||||||||||||||||||||
92 | drug response(s) | |||||||||||||||||||||||||||||
93 | posology | Dosage Regimen | NCIT:C142516 | The specific way a therapeutic drug is to be taken, including formulation, route of administration, dose, dosing interval, and treatment duration. | ||||||||||||||||||||||||||
94 | administration method | |||||||||||||||||||||||||||||
95 | life stage/time point | Life Stage | NCIT:C89335 | A designation assigned to a particular period during a life cycle, generally defined by chronological parameters. | TBD | |||||||||||||||||||||||||
96 | general research data documents (pdf, ppt, images etc) | Research Material | NCIT:C84338 | Any item with which a scientist works. | TBD | 5.3 | ||||||||||||||||||||||||
97 | statistics | summary statistics for additional data from federated sources | Statistical Analysis Documentation | NCIT:C115732 | Records pertaining to the statistical analyses and reports of a clinical trial. | TBD | 4.1,4.2 | |||||||||||||||||||||||
98 | ||||||||||||||||||||||||||||||
99 | ||||||||||||||||||||||||||||||
100 |