Sequencing data, imaging data, clinical data, and data from cancer registries presents several challenges in terms of heterogeneity and a lack of precision. Big analytics serve several purposes including identifying complex data representations, optimization of algorithms, traceability, and predictability. Big data analytics in healthcare context solve several problems such as interpreting findings in personalized medicine and identifying care patterns.
Geography: Global; Focus Area: Healthcare data analysis for precision medicine
Biomedical research that is driven by omics approaches is the future of personalized medicine. Cirillo and Valencia (2019) discuss how cloud infrastructures will serve as foundations of data analysis for personalized medicine through the application of deep learning and machine learning techniques.
The availability of large-scale clinical and molecular data is a significant challenge for data analysis and interpretation (Cirillo & Valencia, 2019). The success of personalized medicine will depend on the efficiency of data-driven systems to generate mechanistic models for the design of clinical procedures.
International Data Sources for Personalized Medicine
The availability of data has been driven by several factors (Cirillo & Valencia, 2019):
- High throughput genome sequencing at low costs and increased access
- Availability of information extracted from imaging data and clinical records
- Massive efforts in community-based data collection through global initiatives such as Global Alliance for Genomics and Health (GA4GH), big data to knowledge (BD2K) initiative, and ELIXIR research infrastructure
- Domain-specific initiatives such as International Rare Disease Research Consortium (IRDiRC) and Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE)
- Genomic and epigenetics data from Human Genome Diversity Project (HGDP), Encyclopedia of DNA Elements (ENCODE), Global Network of Personal Genome Projects (PGP), International Human Epigenome Consortium (IHEC), and NIH Roadmap Epigenomics Mapping Consortium (Roadmap)
- Cancer registries including International Cancer Genome Consortium (ICGC) and the Cancer Genome Atlas (TCGA)
Promotion: Genomics and Personalized Medicine: What Everyone Needs to Know® Paperback – Illustrated, 24 March 2016 by Michael Snyder (Author)
Big data is multi-spectral, heterogeneous, incomplete, and imprecise. Analytics for big data require capabilities to interpret complex data representations, algorithm optimization, modeling, and increased computational power (Cirillo & Valencia, 2019). Analytics systems for unstructured data need to support prediction, traceability, and decision support, and be able to identify patterns of care. Functional characterization of medical data often proceeds at a slower pace when compared to data gathering, and increased challenges are encountered when multi-omics data has to be combined with phenotypic patient data to arrive at interpretations in personalized medicine.
Multiple Data Types Available from Big Data
High volume, sensitive, and complex patient data, such as neuroimaging and genomics data is growing by petabytes annually. This data is accompanied by metadata and quantitative measurements. Heterogeneous data requires scalable and integrative platforms. Common data types available to explore frontiers in personalized include (Cirillo & Valencia, 2019):
- Symptom descriptions (unstructured data)
- ICD codes (structured data)
- Whole genome sequencing data (WGS data)
- Whole exome sequencing data (WEX)
- Transcriptome sequencing data
- Proteome and interactome profiling data
- EHR patient data
- Imaging data
- Multi-omics data
- Data from implants and wearable devices
- Patient generated health data (PGHD)
- Real-time data from sensors and biometric devices
- Mobile-tracked data such as treatment history and lifestyle choices
Promotion: Visualizing Health and Healthcare Data: Creating Clear and Compelling Visualizations to "See how You're Doing" Paperback – Import, 17 December 2020 by K Rowell (Author)
Heterogeneity in Big Data Applications
Big data has been applied to multiple disciplines in medicine including cancer and rare disease research, biomarker and drug development, diabetes and cardiovascular disease research, and neurodegeneration studies. Deciphering distributed data collaborative effort (Cirillo & Valencia, 2019). Some examples of personalized medicine projects are personalized brain models of the Human Brain project undertaken by the European Commission. Likewise, the Human Genome project forms the basis of large-scale biomedical projects, and the International Personalized Medicine consortium (PerMed) facilitates the study of mechanisms of hematopoiesis and epigenetics. Data handling is regulated by ethical framework for anonymous processing, such as General Data Protection Regulation (EU). The effectiveness of biomedical data rests on certain characteristics beyond security, such as accessibility, ease of finding data, interoperability, and reusability (FAIR).
Complex Data Analysis
Data analysis requires high performance computing (HPC) for the extraction of knowledge from big data. Computing capabilities such as streaming data analysis, data-intensive simulations, machine learning, deep learning, and neural networks for high dimensional data, and the investigation of multi-view data through data-driven integrated workflows promote complex data analysis (Cirillo & Valencia, 2019).
Deep learning methods (convolutional / neural networks and recursive neural networks) have interesting applications in prediction and classification such as assessing disease risk, hospital outcome prediction, medical image analysis, prediction of promoter-enhancer interaction and transcription factor binding site, metagenome classification, drug design, and epileptic seizure prediction (Cirillo & Valencia, 2019).
Therefore, big data can be transformational in healthcare and biomedical research. Big data types and analytics are highly challenging and require advanced machine learning methods to arrive at effective inferences. Continued research and development in these areas will lead to innovative solutions in the field of personalized medicine.
Promotion: Big Data in Healthcare: Statistical Analysis of the Electronic Health Record Hardcover – 28 February 2020 by Farrokh Alemi (Author)
Keywords
healthcare data, data analysis, big data analytics, heterogeneity, complex data patterns, precision medicine, big data, big data frameworks, personalized medicine
References
Cirillo, D., & Valencia, A. (2019). Big data analytics for personalized medicine. Current Opinion in Biotechnology, 58, 161–167. https://doi.org/10.1016/j.copbio.2019.03.004
No comments:
Post a Comment