Session: Use of Big Data in Education Prevention Research: Trials and Tribulations (Society for Prevention Research 27th Annual Meeting)

2-053 Use of Big Data in Education Prevention Research: Trials and Tribulations

Schedule:
Wednesday, May 29, 2019: 4:00 PM-5:30 PM
Seacliff B (Hyatt Regency San Francisco)
Theme: Big Data Integration
Symposium Organizer:
Rashelle Musci
Discussant:
C. Hendricks Brown
Big data has become a buzzword in prevention science, particularly as funding agencies place a greater emphasis on the use of large administrative datasets to answer key research and policy questions. The use of big data inherently requires care and attention to the storage and analysis of such data. Further, its use requires a number of ethical and methodological considerations, particularly when it comes to the use of education data where participants are typically minors. The goal of this symposium is to discuss the use of education data within prevention research, with an emphasis on describing key ethical and methodological challenges and solutions. This symposium represents the conference theme of Big Data Integration.

The first paper, “Navigating heavy (metal) data: Lead exposure and educational outcomes” discusses the process through which a harmonized education dataset was created. This ‘big data’ dataset includes highly sensitive information and therefore extreme care was taken in terms of community engagement and data security. While the final dataset will be deidentified, authors focused on creating the dataset with ongoing communication with community stakeholders. In their presentation, the authors will emphasize the importance of establishing a strong relationship with key stakeholders as a key aspect of big data within education research in prevention science.

The second paper, “Challenges in using data from a large urban school district to answer questions about the impacts of children entering kindergarten not socially-behaviorally ready to learn,” uses a large administrative dataset from an urban school district to understand the longer term impacts of a lack of school readiness on key educational outcomes. Use of this large dataset leads to interesting methodological challenges, which are not limited to these types of educational data, including data quality issues and missing data concerns. Results are presented within the context of these challenges and possible solutions are discussed.

The third paper, “Using Big Data to Inform Prevention Science in Maryland,” uses data from a large statewide repository, with a focus on the role of poverty in key outcomes among public school students including standardized test scores, school drop-out, and college enrollment. Along with discussing the analytic and ethical challenges faced when utilizing this large dataset, authors will discuss how findings from big data analyses have informed policy and practice.

Finally, a discussant will highlight commonalities among the papers, discuss implications for prevention, and moderate a discussion between the presenters and the audience.


* noted as presenting author
175
Challenges in Using Data from a Large Urban School District to Answer Questions about the Impacts of Children Entering Kindergarten Not Socially-Behaviorally Ready to Learn
Amie F. Bettencourt, Ph.D., The Johns Hopkins University; Rashelle J. Musci, Ph.D., The Johns Hopkins University; Deborah Gross, Ph.D., Johns Hopkins University
176
Navigating Heavy (metal) Data: Lead Exposure and Educational Outcomes
Rashelle Musci, Ph.D., The Johns Hopkins University; Jeffrey Grigg, PhD, The Johns Hopkins University; Heather Volk, PhD, The Johns Hopkins University; Jana C. Goins, MHS, Baltimore City Department of Health; Faith Connolly, PhD, The Johns Hopkins University
177
Using Big Data to Inform Prevention Science in Maryland
Angela Henneberger, PhD, University of Maryland at Baltimore; Bess A. Rose, Ed.D, University of Maryland at Baltimore; Boyoung Nam, BA, University of Maryland at Baltimore; Dawnsha R. Mushonga, PhD, University of Maryland at Baltimore