AMP
Overview of
DATA & ANALYTICS
October 02, 2021
Session 1
MEET YOUR PRESENTER
Kiran Kumar
IIT ROORKEE 1989, E&C
BACKGROUND:
MS Comp. Sc / MBA. Industry experience in Fin Svcs and Health Care.
CURRENT ROLE:
Chief Data & Analytics Officer
FUN FACT:
Enjoy hiking and mountain biking
CONTACT:
This seminar series is arranged under the auspices of
The Roorkee Alumni Mentorship Program (AMP)
More info at: iitramp.org
Connect students and Alumni
Awesome mentors
TIMELINE
Data and Analytics Overview
Overview of Machine Learning and AI
Shagun Sodhani
SESSION 2
Analytics in Financial Services
Gursheen Kaur
SESSION 4
Real Life Case Study of Data Science Project
Himanshu Gupta
SESSION 3
SESSION 1
Each session 1 hour in duration.
Keep yourself on mute
Materials of the presentation
Asking Questions
Housekeeping Items
Sessions are being recorded and the video and slide deck will be made available.
Send questions via chat, or use the raise hand feature. You can also ask live at the end of the presentation.
Except when you need to speak, keep yourself on mute.
AGENDA
The Data Landscape
Intro
What is Big Data
Bringing It All Together
Q&A
03
01
02
04
05
Maury’s Wind and Currents Charts
Matthew Fontaine Maury (1806 – 1873)
Data
(Ship Logs)
Information
(Charts)
Consumption
(Navigation)
Once Upon a Time…
8
Non-depreciating
Always growing
Strategic
Reusable
“Data! Data Data! I can’t make bricks without clay!” - Sherlock Holmes
Data is the new Oil
Data are to this century what oil was to the last one: a driver of growth and change.
- The Economist, May 2017
Big Data?
Volume
Velocity
Variety
What do you mean by “big data”?
“In God we trust, all others bring data.” — W Edwards Deming
“Data is a precious thing and will last longer than the systems themselves.”
- Tim Berners-Lee �(inventor of the World Wide Web)
9
The Data Landscape
Foundational Technologies
Applied Technologies
Business Solutions
Data Storage
Data Movement
Data Analysis
Data Visualization
Data Lake / Warehouse / Mart
Analytical and ML Applications
Reports / Interactive Dashboards
Financial Services
Health Sciences
Manufacturing
Retail
Travel, etc.
Data analyst
Data scientist
Testing / QA
Engineer
BI developer
Data engineer
SW engineer
Academia
Research
Faculty
Career paths
10
The Data Landscape - Foundational Technologies
Store
Analyse
Blob Store
File Store
Databases
SQL
search
ML / AI
Relational
Columnar
Graph
Document
Cloud DB
Etc...
Capture and Move
ETL / ELT
Streaming
Messaging
ftp
IOT
Visualize
Static Reports
Interactive Dashboards
Conversational Analytics
alerts
Data Quality�&�Governance
Metadata Mgmt
Lineage
Data Dictionary
Data Glossary
Data Catalog
11
The Data Landscape - Sample Business Applications of AI / ML
12
Bringing it all together - Real World Evidence
Governance Layer
Data Sources
Transactional Data�(Pharmacy, lab, medical))
Mobile Apps & Wearables
Reference Data�(Drugs, Diseases)
Social�(Twitter, FB)
Streaming Layer
Data Ingestion
Data Processing
Analytics Layer
Data Lake
Raw Data
Data Hub
Data Warehouse
Machine Learning
Streaming dashboard
Dashboards
Reports
Alerts
Data Dictionary
Data Lineage
Curated Data
Testing the efficacy of new drugs after they are released to the general patient population.
Drug to drug interaction issues
Severe and unforeseen side effects
Long-term impact and efficacy
Next week - Overview of ML / AI
Shagun Sodhani
QUESTIONS