№2, 2017

ANALYZING BIG VOLUME MEDICAL DATA: CURRENT PROBLEMS AND PROSPECTS

Rena T. Gasımova

Big Data is becoming a promising area in the field of healthcare. It is used to improve the results of the analysis of large volumes of data sets and to reduce the costs. Increase in data volume and demand for ad hoc analysis of data created one of the biggest problems of Big Data called Big Data analysis. The article deals with actual problems of a large analysis of data generated in the field of health, and explores the main characteristics of the data. At the same time, it defines the various opportunities, advantages and characteristics of the data in the area of health, and provides a number of recommendations (pp.63-72).

Keywords: data warehouse, big data, big data analytics, biometric data, evidence-based medicine, genomic analytics, MapReduce, Hadoop
DOI : 10.25045/jpit.v08.i2.07
References
  • Laney D. 3D Data Management: Controlling Data Volume, Velocity and Variety, Technical report, META Group, Inc (now Gartner, Inc.), February 2001. http://blogs.gartner.com
  • Laura B. Madsen Data-Driven Healthcare: How Analytics and BI are Transforming the Industry, Publisher: John Wiley & Sons, Inc.,2014, 224 p.
  • Wullianallur R. Data Mining in Health Care. Healthcare Informatics: Improving Efficiency and Productivity, CRC Press, 2010, Taylor & Francis, pp.211–224. crcnetbase.com.
  • Wullianallur R., Viju R. Big data analytics in healthcare: promise and potential, Raghupathi and Raghupathi; licensee BioMed Central Ltd. Health Information Science and Systems, 2014, 2, no.3, pp.2–10. online resource,www.hissjournal.com.
  • Bill F. The taming of large data. How to extract knowledge from data arrays using deep analytics, trans. from English. Andrei Baranov, Moscow: Mann, Ivanov and Ferber, 2014, 352 p.
  • Bian J., Topaloglu U., Yu F. Towards Large-scale Twitter Mining for Drug-related Adverse Events / Proceedings of the 2012 international workshop on Smart health and wellbeing (SHB'12), New York, USA, 2012, pp.25–32.
  • Mayer-Schonberger V., Kukier K. Big data. A revolution that will change how we live, work and think, trln.from English. Inna Gaiduk, Moscow: Mann, Ivanov and Ferber, 2013, 240 p.
  • Gudivada V.N., Rao D., Raghavan V.V. NoSQL Systems for Big Data Management
    / Proceedings of the 2014 IEEE World Congress on Services (SERVICES '14), USA, 2014, pp.190–197.
  • Institute for Health Technology Transformation (IHTT): Transforming Health Care through Big Data Strategies for leveraging big data in the health care industry, 2013. http://c4fd63cb482ce6861463-bc6183f1c18e748a49b87a25911a0555.r93.cf2.rackcdn.com/ iHT2_BigData_2013.pdf
  • Ayankoya K., Calitz A., Greyling J. Intrinsic Relations between Data Science, Big Data, Business Analytics and Datafication / Proceedings of the Southern African Institute for Computer Scientist and Information Technologists Annual Conference (SAICSIT 2014), New York, USA, 2014, pp.192.
  • Gasimova R.T. Conceptual basis for the creation of a knowledge base of domain names // News of Baku University, Physics and Mathematics series, No 4, 2010, pp.95–102.
  • Gasimova R.T. Big data analytics: existing approaches, problems and solutions // Problems of Information Technology, 2016, No1, pp.75–93.
  • Clifford L. Big data: How do your data grow? // Nature, 2008, vol.455, pp. 28–29.
  • Alguliyev RM, Hacirahimova M.Sh. Big data phenomenon: problems and opportunities // Problems of Information Technology, 2014, No2, pp.3–16.
  • Andreas H. Biomedical Informatics 2014: Discovering Knowledge in Big Data, Publisher: Springer International Publishing AG., 1st. Edition, 2014, 606 p.
  • Babu S., Herodotou H. Massively Parallel Databases and MapReduce Systems, Foundations and Trends in Databases, 2013, vol.5, no.1, pp.1–104.
  • Deng X., Donghui W. Big data and predictive modeling topics in healthcare / Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics (BCB '15), New York, USA, 2015, p.677.
  • Intel: Big Data Analytics, 2012, http://www.intel.com/content/dam/www/public/us/en/documents/ reports/data-insights-peer-research-report.pdf
  • Jimeng S., Chandan K. Big data analytics for healthcare / Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '13), New York, USA, 2013, pp.1525.
  • Varun C., Sukumar Sreenivas R., Schryver Jack C. Knowledge discovery from massive healthcare claims data / Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '13), New York, USA, pp.1312–1320, 2013.
  • Davenport T.H., Dwight McNeill. Analytics in Healthcare and the Life Sciences: Strategies, Implementation Methods, and Best Practices, Publisher: Pearson Education, USA, 2013, 352 p.
  • Nambiar U., Niranjan T. Data management & analytics for healthcare (DARE 2013)
    / Proceedings of the 22nd ACM international conference on Information & Knowledge Management (CIKM '13)New York, USA, 2013, pp.2565–2566.
  • Korolyuk I.P. Medical Informatics. Textbook, 2 ed, Samara: "OFORT", "SamGMU", 2012, 244 p. http://www.samsmu.ru/files/smu/chairs/radiology/med_inf.pdf
  • Yamakami T. Inter-service revisit analysis of three user groups using intra-day behavior in the mobile clickstream / Proceedings of the 2009 International Conference on Hybrid Information Technology (ICHIT '09), New York, USA,2009, 340–344.
  • Kolesnichenko O.Yu., Smorodin G.N. Big Data: Social Challenges / Abstracts of the V Sociological Grushin Conference "Big Sociology: Expanding the Data Space", Proceedings of the Conference, M: VTsIOM, 2015, pp.26–29.
  • Schmarzo B. Big Data MBA: Driving Business Strategies with Data Science, Publisher: John Wiley & Sons, Ing. 1st. Edition, 2015, 312 p.
  • Hadoop Distributed File System. http://hadoop.apache.org/docs
  • Vignesh P. Big Data Analytics with R and Hadoop, Publisher: Packt Publishing Ltd, 2013, pp.238.
  • Chuck L. Hadoop in Action, Publisher: DMK Press, 2012, 424 p.
  • Ohlhorst Frank Big Data Analytics: Turning Big Data into Big Money, Publisher: John Wiley & Sons Inc, 2013, 176 p.
  • Translational Genomics Research Institute, Arizona, USA, https://www.tgen.org/
  • DNAnexus, Providing cloud solutions for the global genomics industry, USA, https:// www.dnanexus.com
  • Human Longevity, Inc., San Diego, California, USA, http://www.humanlongevity.com/ about/j-craig-venter
  • Department of Health & Human Services (HHS) USA, http://www.hhs.gov/about/ index.html
  • Assunção M.D., Rodrigo N., Bianchi S., Netto Marco A.S., Rajkumar B. Big Data computing and clouds // Journal of Parallel and Distributed Computing, 2015, vol.79, pp.3–15.