Introduction to Data Science
By
S.V.V.D.Jagadeesh
Sr. Assistant Professor
Dept of Artificial Intelligence & Data Science
LAKIREDDY BALI REDDY COLLEGE OF ENGINEERING
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Previous Class Discussions
LBRCE
IDS
At the end of this session, Student will be able to:
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Session Outcomes
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Big Data Eco-System
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Big Data Eco-System
7. Benchmarking Tools
8. System Deployment
9. Service Programming
10. Security
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Distributed File Systems
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Distributed File Systems
■ They can store files larger than any one computer disk.
■ Files get automatically replicated across multiple servers for redundancy or parallel operations while hiding the complexity of doing so from the user.
■ The system scales easily: you’re no longer bound by the memory or storage restrictions of a single server
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Distributed Programming Framework
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Distributed Programming Framework
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Data Integration Framework
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Machine Learning Framework
LBRCE
IDS
■ PyBrain for neural networks—Neural networks are learning algorithms that mimic the human brain in learning mechanics and complexity. Neural networks are often regarded as advanced and black box
■ NLTK or Natural Language Toolkit—As the name suggests, its focus is working with natural language. It’s an extensive library that comes bundled with a number of text corpuses to help you model your own data.
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Machine Learning Framework
LBRCE
IDS
■ Pylearn2—Another machine learning toolbox but a bit less mature than Scikit-learn.
■ TensorFlow—A Python library for deep learning provided by Google.
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Machine Learning Framework
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
NoSQL Databases
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Types of NoSQL Databases
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Types of NoSQL Databases
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Types of NoSQL Databases
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Scheduling Tools
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
BenchMarking Tools
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
System Deployment
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Service Programming
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Security
LBRCE
IDS
S.V.V.D.Jagadeesh
Saturday, December 21, 2024
Summary
LBRCE
IDS