Apply your statistical analysis skills to analyze real-world healthcare data, perform hypothesis testing, and generate actionable insights that could improve patient outcomes.
In this comprehensive assignment, you will work as a Healthcare Data Analyst at HealthFirst Medical Center. The hospital administration needs you to analyze patient data, lab results, and treatment outcomes to identify patterns that could improve healthcare quality and patient satisfaction.
"Welcome to HealthFirst Medical Center! We've been collecting patient data for the past year, and our board is asking tough questions about treatment effectiveness and resource allocation. We need your statistical expertise to provide evidence-based answers.
Specifically, I need you to investigate:
The board meets next month, and I need a comprehensive statistical report with clear visualizations and actionable recommendations. Please use proper statistical methods and clearly state your hypotheses, test results, and conclusions.
Looking forward to your analysis!"
- Dr. Sarah Chen, Chief Medical Officer
Download the three datasets below. These contain anonymized patient data from HealthFirst Medical Center.
Patient admission records including demographics, department, charges, vitals, and outcomes.
patient_id, admission_date, age, gender, departmentadmission_type, length_of_stay, total_charges, insurance_typebp_systolic, heart_rate, satisfaction_score, readmitted_30day, discharge_dispositionLaboratory test results with values, normal ranges, and status indicators for various tests.
result_id, patient_id, test_date, test_nameresult_value, unit, normal_range_low, normal_range_highstatus (Normal/Low/High), ordering_physicianTreatment records including medications, procedures, dosages, and patient outcomes.
treatment_id, patient_id, treatment_date, treatment_typetreatment_name, dosage, frequency, durationcost, outcome (Improved/Stable/Resolved), side_effects_reporteddescriptive_stats.pnganova_length_of_stay.pngttest_satisfaction.pngchisquare_insurance.pngcorrelation_analysis.pngtreatment_analysis.png| Component | Points | Criteria |
|---|---|---|
| Descriptive Statistics | 15 | Complete summary statistics, proper interpretation, quality visualization |
| One-Way ANOVA | 20 | Correct assumptions testing, proper test execution, effect size calculated, post-hoc if needed |
| T-Test Analysis | 15 | Correct test selection, CI construction, effect size (Cohen's d), clear interpretation |
| Chi-Square Test | 15 | Proper contingency table, expected frequencies verified, CramΓ©r's V calculated |
| Correlation Analysis | 15 | Correlation matrix complete, regression assumptions checked, quality heatmap |
| Treatment Analysis | 20 | Two-way ANOVA correct, CIs properly constructed, non-parametric comparison included |
| Total | 100 |
Create a public GitHub repository with the exact name shown below, add all required files, and submit through the submission portal.
github.com/<your-username>/healthfirst-analysis
healthfirst-analysis/
βββ statistical_analysis.ipynb
βββ statistical_report.pdf
βββ visualizations/
β βββ descriptive_stats.png
β βββ anova_length_of_stay.png
β βββ ttest_satisfaction.png
β βββ chisquare_insurance.png
β βββ correlation_analysis.png
β βββ treatment_analysis.png
βββ data/
β βββ (downloaded CSV files)
βββ README.md
All files are required. Submission will fail if any file is missing.