Data Science
Grade IX
Version 1.0
Chapter 4: Ethics in data science
At the end of this chapter, students will understand the following:
What is data ethics?
Data ethics refers to the principles and practices that guide the responsible and ethical use, handling, and management of data.
Data ethics ensures respect for privacy, fairness, and transparency in decision-making processes.
Key aspects of data ethics include:
The few ethical guidelines around data analysis are:
• Data governance is critical
• Protect your customer
• Do not lie
• Understand the role of data quality
• Private data and identity should remain private
• Shared private information should be treated confidentially
Need of ethical guidelines
For Example, prohibitions against falsifying, fabricating, or misrepresenting data promote the truth and minimize error.
Falsifying: Manipulating research data with the intention of giving a false impression.
Fabricating Data: A researcher creates data points or entire datasets that never existed.
Misrepresenting data: representing incorrect data.
Need of ethical guidelines
Data quality refers to the accuracy, completeness, and consistency of the data.
Data integrity means the data has been maintained in a secure, unaltered state, free from manipulation or falsification.
Need of ethical guidelines
It suggests that new analyses should only be conducted if they provide clear value or benefit to people. If an analysis has no meaningful positive impact or could potentially cause harm, it should not be pursued.
Need of ethical guidelines
Minimizing data usage: refers to the practice of collecting, storing, and analyzing only the data that is necessary for a specific purpose.
Use the least amount of data necessary to meet the desired objective, understanding that reducing data usage encourages more sustainable and less risky analysis.
Goals of ethical guidelines
The goal of ethical guidelines is to help data analysts make decisions ethically. Moreover, the ethical guidelines aim to encourage accountability by enlightening those who rely on data analysis of the standards they should expect.
Goals of ethical guidelines
The key goals are:
• Professional integrity and accountability
• The integrity of data and methods
• Follow informed-consent rules
• Respect confidentiality and privacy
Data governance framework
Data governance framework
Why do we need to govern data?
To Improve data quality through efforts to identify and fix errors in data sets.
To increase analytics accuracy and give decision-makers reliable information.
To ensure compliance with data privacy laws and other regulations
To implement and enforce policies that help prevent data errors and misuse
To avoid inconsistent data in different departments and business units
To come to an agreement on standard data definitions for a shared understanding of data
Goals of data governance
Goals of data governance