Data Governance for Ethical AI

Neil D. Lawrence

Virtual DSN AI Bootcamp

There are three types of lies: lies, damned lies and statistics

??

There are three types of lies: lies, damned lies and statistics

Benjamin Disraeli

There are three types of lies: lies, damned lies and statistics

Benjamin Disraeli 1804-1881

There are three types of lies: lies, damned lies and ‘big data’

Neil Lawrence 1972-?

Mathematical Statistics

‘Mathematical Data Science’

Embodiment Factors

bits/min billions 2,000
billion
calculations/s
~100 a billion
embodiment 20 minutes 5 billion years

Evolved Relationship with Information

New Flow of Information

Evolved Relationship

Evolved Relationship

The Gartner Hype Cycle

Cycle for ML Terms

Challenges

  1. Paradoxes of the Data Society

  2. Quantifying the Value of Data

  3. Privacy, loss of control, marginalization

Challenges

  1. Paradoxes of the Data Society

  2. Quantifying the Value of Data

  3. Privacy, loss of control, marginalization

Privacy, Loss of Control and Marginalization

  • Society is becoming harder to monitor
  • Individual is becoming easier to monitor

Digital Revolution and Inequality

  • Potential for explicit and implicit discrimination on the basis of race, religion, sexuality, health status
  • All prohibited under European law, but can pass unawares, or be implicit
  • GDPR: General Data Protection Regulation

Discrimination

  • Potential for explicit and implicit discrimination on the basis of race, religion, sexuality, health status
  • All prohibited under European law, but can pass unawares, or be implicit
  • GDPR: Good Data Practice Rules

Marginalization

  • Credit scoring, insurance, medical treatment
  • What if certain sectors of society are under-represented in our analysis?
  • What if Silicon Valley develops everything for us?

Digital Revolution and Inequality?

Data Science Africa is a bottom up initiative for capacity building in data science, machine learning and AI on the African continent

Example: Prediction of Malaria Incidence in Uganda

Martin Mubangizi Ricardo Andrade Pacecho John Quinn

Malaria Prediction in Uganda

(Andrade-Pacheco et al., 2014; Mubangizi et al., 2014)

Tororo District

Malaria Prediction in Nagongera (Sentinel Site)

Mubende District

Malaria Prediction in Uganda

GP School at Makerere

Kabarole District

Early Warning System

Early Warning Systems

Feudal Era Data Ecosystem

Information Barons threaten our Privacy

African Data Sharing Covid-19

Morine Amutorine
Jessica Montgomery Victor Ohuruogu

Amelioration

  • Work to ensure individual retains control of their own data
  • We accept privacy in our real lives, need to accept it in our digital
  • Control of persona and ability to project
  • Need better technological solutions: trust and algorithms.

Data Governance Toolkit

Society

Vulnerabilities

Enfranchisement

Individuals

Personal Data Trusts

Motivation

  1. Indsidious decision-making that has downstream instrumental effects we don’t control.
  2. A power-asymmetry between data-controllers and data-subjects
  3. A loss of personhood in the re-representation of ourselves in the digital world.
  4. The GDPR’s endeavour to curb contractual freedom cannot by itself reverse the power-asymmetry between data-controllers and data-subjects.

Analogy

  • Digital Democracy vs Digital Oligarchy Lawrence (2015a) or Information Feudalism Lawrence (2015b)
  • Data subjects, data controllers and data processors.

Data Trusts Initiative

References

Thanks!

Andrade-Pacheco, R., Mubangizi, M., Quinn, J., Lawrence, N.D., 2014. Consistent mapping of government malaria records across a changing territory delimitation. Malaria Journal 13. https://doi.org/10.1186/1475-2875-13-S1-P5
Delacroix, S., Lawrence, N.D., 2019. Bottom-up data trusts: Disturbing the ‘one size fits all’ approach to data governance. International Data Privacy Law. https://doi.org/10.1093/idpl/ipz014
Edwards, L., 2004. The problem with privacy. International Review of Law Computers & Technology 18, 263–294.
Lawrence, N.D., 2017. Living together: Mind and machine intelligence. arXiv.
Lawrence, N.D., 2016. Data trusts could allay our privacy fears.
Lawrence, N.D., 2015b. The information barons threaten our autonomy and our privacy.
Lawrence, N.D., 2015a. Beware the rise of the digital oligarchy.
Mubangizi, M., Andrade-Pacheco, R., Smith, M.T., Quinn, J., Lawrence, N.D., 2014. Malaria surveillance with multiple data sources using Gaussian process models, in: 1st International Conference on the Use of Mobile ICT in Africa.