Appendix J — Further Reading

Appendix D: Further Reading

A curated guide to essential resources for deepening your knowledge of AI in public health.

Online Courses

Foundational Machine Learning

Stanford CS229: Machine Learning - Instructor: Andrew Ng - Platform: Coursera / Stanford Online - Level: Intermediate - Duration: 11 weeks - Topics: Supervised learning, unsupervised learning, deep learning, best practices - Why take it: Gold standard ML course, mathematical rigor with practical applications - Link: https://www.coursera.org/learn/machine-learning

Fast.ai: Practical Deep Learning for Coders - Instructors: Jeremy Howard, Rachel Thomas - Platform: Fast.ai - Level: Beginner to advanced - Duration: Self-paced - Topics: Deep learning, computer vision, NLP, ethics - Why take it: Top-down approach, code-first, healthcare examples - Link: https://course.fast.ai/

MIT 6.S191: Introduction to Deep Learning - Platform: MIT OpenCourseWare - Level: Intermediate - Duration: 1 week intensive - Topics: Neural networks, CNNs, RNNs, GANs, reinforcement learning - Why take it: Comprehensive, includes labs - Link: http://introtodeeplearning.com/

Healthcare-Specific AI

AI in Healthcare Specialization (Stanford) - Platform: Coursera - Level: Intermediate - Duration: 3 months - Topics: Medical imaging, EHR data, clinical trials, deployment - Why take it: Healthcare-focused, taught by Stanford faculty - Link: https://www.coursera.org/specializations/ai-healthcare

AI for Medicine Specialization (deeplearning.ai) - Instructor: Andrew Ng et al. - Platform: Coursera - Level: Intermediate - Duration: 3 months - Courses: 1. AI for Medical Diagnosis 2. AI for Medical Prognosis 3. AI for Medical Treatment - Link: https://www.coursera.org/specializations/ai-for-medicine

MIT Critical Data: Secondary Analysis of Electronic Health Records - Platform: MIT OpenCourseWare - Level: Advanced - Duration: Self-paced - Topics: EHR data, MIMIC database, clinical prediction - Link: https://ocw.mit.edu/

Public Health and Epidemiology

Epidemiology in Public Health Practice (Johns Hopkins) - Platform: Coursera - Level: Beginner - Duration: 4 weeks - Topics: Study design, measures of disease, screening, causation - Why take it: Essential epidemiology foundation for AI applications - Link: https://www.coursera.org/specializations/epidemiology

Data Science for Public Health (Imperial College London) - Platform: Coursera - Level: Intermediate - Duration: 6 months - Topics: Data analysis, visualization, statistical modeling, machine learning in public health - Link: https://www.coursera.org/specializations/data-science-public-health

Ethics and Fairness

Data Science Ethics (University of Michigan) - Platform: Coursera - Level: Beginner to intermediate - Duration: 4 weeks - Topics: Privacy, fairness, transparency, accountability - Why take it: Critical thinking about AI ethics - Link: https://www.coursera.org/learn/data-science-ethics

Fairness in Machine Learning (MIT) - Platform: MIT OpenCourseWare - Level: Advanced - Duration: Self-paced - Topics: Fairness definitions, bias mitigation, fairness-aware ML - Link: https://stellar.mit.edu/

Books

Technical Foundations

“Pattern Recognition and Machine Learning” by Christopher Bishop - Level: Advanced - Topics: Bayesian methods, neural networks, graphical models - Best for: Mathematical foundations, reference text - Why read it: Comprehensive, rigorous, standard graduate text

“Deep Learning” by Goodfellow, Bengio, and Courville - Level: Intermediate to advanced - Topics: Neural networks, optimization, CNNs, RNNs, regularization - Best for: Deep learning theory and practice - Why read it: Authoritative, free online - Link: https://www.deeplearningbook.org/

“Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow” by Aurélien Géron - Level: Beginner to intermediate - Topics: Practical ML, neural networks, implementation - Best for: Learning by doing, practical projects - Why read it: Code-heavy, excellent examples

“Probabilistic Machine Learning” by Kevin Murphy - Level: Advanced - Topics: Probabilistic graphical models, Bayesian methods, deep learning - Best for: Rigorous treatment of ML from probabilistic perspective - Link: https://probml.github.io/pml-book/

Healthcare AI

“Machine Learning for Healthcare” edited by Ranganath, Perotte, Zitnik - Level: Advanced - Topics: Clinical prediction, imaging, EHR analysis, interpretability - Best for: Cutting-edge research in healthcare ML - Why read it: Comprehensive coverage of healthcare AI research

“Artificial Intelligence in Medicine” by Markus Wenzel - Level: Intermediate - Topics: Medical imaging, diagnosis, treatment planning, drug discovery - Best for: Overview of AI applications in medicine

“Clinical Prediction Models” by Ewout Steyerberg - Level: Intermediate to advanced - Topics: Risk prediction, model development, validation, updating - Best for: Rigorous approach to clinical prediction - Why read it: Gold standard for clinical prediction methodology

Ethics and Society

“Weapons of Math Destruction” by Cathy O’Neil - Level: General audience - Topics: Algorithmic bias, societal impact, fairness - Best for: Understanding societal implications - Why read it: Accessible, compelling examples, critical perspective

“Automating Inequality” by Virginia Eubanks - Level: General audience - Topics: Algorithms in social services, child welfare, healthcare - Best for: Case studies of algorithmic harm - Why read it: Real-world impact on vulnerable populations

“The Ethical Algorithm” by Kearns and Roth - Level: Intermediate - Topics: Fairness, privacy, game theory, algorithmic design - Best for: Technical approaches to ethical AI - Why read it: Bridges theory and practice

“Race After Technology” by Ruha Benjamin - Level: General audience - Topics: Race, technology, algorithmic bias, social justice - Best for: Critical race perspective on AI - Why read it: Essential perspective on tech and inequality

Public Health

“Modern Epidemiology” by Rothman, Greenland, Lash - Level: Advanced - Topics: Causal inference, study design, bias, confounding - Best for: Rigorous epidemiologic foundation - Why read it: Standard graduate text, essential for healthcare AI

“Infectious Disease Epidemiology” by Nelson and Williams - Level: Intermediate - Topics: Disease transmission, outbreak investigation, surveillance - Best for: Understanding infectious disease dynamics

Academic Journals

AI and Machine Learning

Top-Tier General ML: - NeurIPS (Conference on Neural Information Processing Systems) - ICML (International Conference on Machine Learning) - ICLR (International Conference on Learning Representations) - JMLR (Journal of Machine Learning Research) - Machine Learning (Springer)

Access: Many papers available on arXiv.org preprints

Healthcare AI

Clinical Journals Publishing AI Research: - The Lancet Digital Health 🎯 - Focus: Digital health technologies, AI applications - Open access - Link: https://www.thelancet.com/journals/landig

npj Digital Medicine (Nature) 🎯
- Focus: Digital health, AI, sensors, apps
- Open access
- Link: https://www.nature.com/npjdigitalmed/
Journal of the American Medical Informatics Association (JAMIA) 🎯
- Focus: Health informatics, clinical decision support, AI
- Link: https://academic.oup.com/jamia
NEJM AI (New England Journal of Medicine) 🎯
- Focus: AI in medicine, launched 2024
- High-impact clinical AI research
- Link: https://ai.nejm.org/

Specialized Healthcare AI: - Radiology: Artificial Intelligence - Focus: AI in medical imaging - Journal of Medical Internet Research - Focus: Digital health, eHealth - IEEE Journal of Biomedical and Health Informatics - Focus: Biomedical informatics, signals

Public Health

American Journal of Public Health
American Journal of Epidemiology
Epidemiology
International Journal of Epidemiology
BMC Public Health

Ethics and Fairness

ACM Conference on Fairness, Accountability, and Transparency (FAccT) 🎯
- Premier venue for algorithmic fairness
- Interdisciplinary: CS, law, social science
- Proceedings open access
AI and Ethics (Springer)
Ethics and Information Technology

Datasets and Data Repositories

Clinical Datasets

MIMIC-III / MIMIC-IV 🎯 - Content: ICU patient data (60,000+ admissions) - Includes: Vitals, labs, medications, notes, outcomes - Access: Free after credentialing - Use cases: Clinical prediction, NLP, time-series analysis - Link: https://mimic.mit.edu/

eICU Collaborative Research Database - Content: Multi-center ICU data (200,000+ admissions) - Access: Free after credentialing - Link: https://eicu-crd.mit.edu/

SEER (Surveillance, Epidemiology, and End Results) - Content: Cancer registry data - Access: Free, public - Link: https://seer.cancer.gov/

Medical Imaging

ChestX-ray14 🎯 - Content: 112,000 chest X-rays with labels - Labels: 14 pathologies - Access: Free - Link: https://nihcc.app.box.com/v/ChestXray-NIHCC

CheXpert - Content: 224,000 chest X-rays - Labels: 14 conditions - Access: Free - Link: https://stanfordmlgroup.github.io/competitions/chexpert/

RSNA Pneumonia Detection Challenge - Content: 30,000 chest X-rays - Labels: Pneumonia bounding boxes - Access: Kaggle - Link: https://www.kaggle.com/c/rsna-pneumonia-detection-challenge

The Cancer Imaging Archive (TCIA) - Content: Cancer imaging across modalities - Access: Free - Link: https://www.cancerimagingarchive.net/

Public Health / Surveillance

CDC Data - FluView: Influenza surveillance - COVID Data Tracker: COVID-19 data - WONDER: Mortality, natality data - Link: https://data.cdc.gov/

WHO Global Health Observatory - Content: Global health statistics - Topics: Mortality, disease burden, risk factors - Link: https://www.who.int/data/gho

Johns Hopkins COVID-19 Data - Content: Global COVID-19 cases, deaths, vaccinations - Updated: Daily - Link: https://github.com/CSSEGISandData/COVID-19

Synthetic / Benchmark

Synthea 🎯 - Content: Synthetic patient generator - Creates: Realistic EHR data - Use: Development, testing, education - Link: https://synthetichealth.github.io/synthea/

Tools and Software

Machine Learning Frameworks

Scikit-learn 🎯 - Language: Python - Focus: Classical ML algorithms - Best for: Tabular data, rapid prototyping - Docs: https://scikit-learn.org/

PyTorch 🎯 - Language: Python - Focus: Deep learning - Best for: Research, flexibility - Docs: https://pytorch.org/

TensorFlow / Keras 🎯 - Language: Python - Focus: Deep learning - Best for: Production deployment - Docs: https://www.tensorflow.org/

XGBoost 🎯 - Language: Python, R, others - Focus: Gradient boosting - Best for: Tabular data, competitions - Docs: https://xgboost.readthedocs.io/

Healthcare-Specific Libraries

lifelines - Focus: Survival analysis - Link: https://lifelines.readthedocs.io/

scikit-survival - Focus: Survival analysis with ML - Link: https://scikit-survival.readthedocs.io/

MNE-Python - Focus: EEG/MEG analysis - Link: https://mne.tools/

nibabel - Focus: Neuroimaging data (fMRI, MRI) - Link: https://nipy.org/nibabel/

Fairness and Explainability

Fairlearn (Microsoft) 🎯 - Focus: Fairness assessment and mitigation - Link: https://fairlearn.org/

AI Fairness 360 (IBM) 🎯 - Focus: Fairness metrics and algorithms - Link: https://aif360.mybluemix.net/

SHAP 🎯 - Focus: Model explanations - Link: https://shap.readthedocs.io/

LIME - Focus: Local explanations - Link: https://github.com/marcotcr/lime

InterpretML (Microsoft) - Focus: Interpretable models (EBM) - Link: https://interpret.ml/

Deployment and MLOps

MLflow - Focus: ML lifecycle management - Link: https://mlflow.org/

Weights & Biases - Focus: Experiment tracking, collaboration - Link: https://wandb.ai/

BentoML - Focus: Model serving - Link: https://bentoml.com/

Communities and Forums

Online Communities

Reddit: - r/MachineLearning - ML research and news - r/datascience - Data science practice - r/healthinformatics - Healthcare informatics - r/publichealth - Public health discussions

Stack Overflow: - Tags: machine-learning, healthcare, scikit-learn, tensorflow

Cross Validated (stats.stackexchange.com): - Statistical questions, study design, interpretation

Professional Organizations

American Medical Informatics Association (AMIA) - Focus: Health informatics, clinical informatics - Benefits: Conferences, networking, journals - Link: https://amia.org/

Healthcare Information and Management Systems Society (HIMSS) - Focus: Health IT, digital health - Link: https://www.himss.org/

American Public Health Association (APHA) - Focus: Public health practice and research - Link: https://www.apha.org/

Society for Medical Decision Making (SMDM) - Focus: Clinical decision making, modeling - Link: https://smdm.org/

Conferences

AI/ML in Healthcare: - MLHC (Machine Learning for Healthcare) - August - CHIL (Conference on Health, Inference, and Learning) - April - AMIA Annual Symposium - November

General ML: - NeurIPS - December - ICML - July - ICLR - May

Public Health: - APHA Annual Meeting - November - Society for Epidemiologic Research - June

Staying Current

Newsletters

The Batch (deeplearning.ai) - Weekly AI news curated by Andrew Ng - Link: https://www.deeplearning.ai/the-batch/

Import AI - Weekly AI research highlights by Jack Clark - Link: https://jack-clark.net/

AI Ethics Weekly - Weekly AI ethics news and research - Link: https://lighthouse3.com/newsletter/

Blogs

Distill.pub 🎯 - Beautiful, interactive ML explanations - High quality, peer-reviewed - Link: https://distill.pub/

Towards Data Science - Medium publication on data science - Practical tutorials and case studies

Google AI Blog - Google AI research updates - Link: https://ai.googleblog.com/

OpenAI Blog - OpenAI research and developments - Link: https://openai.com/blog/

DeepMind Blog - DeepMind research highlights - Link: https://www.deepmind.com/blog

Podcasts

The TWIML AI Podcast - Interviews with AI researchers and practitioners - Link: https://twimlai.com/

Data Skeptic - Data science, statistics, ML topics - Link: https://dataskeptic.com/

Linear Digressions - Machine learning explained accessibly - Link: https://lineardigressions.com/

Practical Skill Development

Kaggle

Healthcare Competitions: - RSNA Pneumonia Detection - Diabetic Retinopathy Detection - SIIM-ACR Pneumothorax Segmentation

Learn: Kaggle Learn micro-courses on ML, deep learning, ethics

DataCamp / Coursera Projects

Guided Projects: - Clinical data analysis - Medical image classification - Survival analysis - Time series forecasting

Advanced Topics

Causal Inference

“The Book of Why” by Judea Pearl - Accessible introduction to causality - Why correlation ≠ causation matters for AI

“Causal Inference: What If” by Hernán and Robins - Rigorous causal inference methods - Free online: https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/

Course: Introduction to Causal Inference (Brady Neal) - Free online course - Link: https://www.bradyneal.com/causal-inference-course

Survival Analysis

“Survival Analysis: A Self-Learning Text” by Kleinbaum and Klein - Accessible introduction - Clinical focus

“Modeling Survival Data” by Therneau and Grambsch - Advanced methods, R focus

Time Series

“Forecasting: Principles and Practice” by Hyndman and Athanasopoulos - Comprehensive forecasting methods - Free online: https://otexts.com/fpp3/

“Deep Learning for Time Series Forecasting” by Jason Brownlee - Practical guide to DL for time series

Privacy-Preserving ML

Differential Privacy Book (Dwork and Roth) - Foundational text on differential privacy - Link: https://www.cis.upenn.edu/~aaroth/Papers/privacybook.pdf

Federated Learning Tutorial - Distributed ML preserving privacy - Link: https://federated.withgoogle.com/

Career Development

Certifications

Google Professional ML Engineer - Cloud-based ML deployment - Link: https://cloud.google.com/certification/machine-learning-engineer

AWS Certified Machine Learning - Specialty - ML on AWS platform - Link: https://aws.amazon.com/certification/certified-machine-learning-specialty/

Clinical Informatics Board Certification (ABPM) - For physicians interested in informatics - Link: https://www.abpm.org/

Job Boards

AI/ML Healthcare Jobs: https://www.aijobs.com/
Health Informatics Jobs: https://www.himss.org/resources/jobmine
Academic Positions: https://academicjobsonline.org/

Conclusion

The field of AI in public health is rapidly evolving. Stay current by: - ✅ Following 2-3 key journals in your area - ✅ Subscribing to 1-2 newsletters - ✅ Attending 1-2 conferences annually - ✅ Practicing on real datasets - ✅ Engaging with online communities - ✅ Contributing to open-source projects

Remember: The best way to learn is by doing. Pick a project, get your hands dirty with code, and learn iteratively!

Ready to dive deeper? Start with one resource from each category that matches your current level and interests.