Play all audios:
ABSTRACT Public agencies aiming to enforce environmental regulation have limited resources to achieve their objectives. We demonstrate how machine-learning methods can inform the efficient
use of these limited resources while accounting for real-world concerns, such as gaming the system and institutional constraints. Here, we predict the likelihood of a facility failing a
water-pollution inspection and propose alternative inspection allocations that would target high-risk facilities. Implementing such a data-driven inspection allocation could detect over
seven times the expected number of violations than current practices. When we impose constraints, such as maintaining a minimum probability of inspection for all facilities and accounting
for state-level differences in inspection budgets, our reallocation regimes double the number of violations detected through inspections. Leveraging increasing amounts of electronic data can
help public agencies to enhance their regulatory effectiveness and remedy environmental harms. Although employing algorithm-based resource allocation rules requires care to avoid
manipulation and unintentional error propagation, the principled use of predictive analytics can extend the beneficial reach of limited resources. Access through your institution Buy or
subscribe This is a preview of subscription content, access via your institution ACCESS OPTIONS Access through your institution Access Nature and 54 other Nature Portfolio journals Get
Nature+, our best-value online-access subscription $29.99 / 30 days cancel any time Learn more Subscribe to this journal Receive 12 digital issues and online access to articles $119.00 per
year only $9.92 per issue Learn more Buy this article * Purchase on SpringerLink * Instant access to full article PDF Buy now Prices may be subject to local taxes which are calculated during
checkout ADDITIONAL ACCESS OPTIONS: * Log in * Learn about institutional subscriptions * Read our FAQs * Contact customer support SIMILAR CONTENT BEING VIEWED BY OTHERS DETECTION OF
UNTREATED SEWAGE DISCHARGES TO WATERCOURSES USING MACHINE LEARNING Article Open access 11 March 2021 DETERMINANTS OF EFFICIENT WATER USE AND CONSERVATION IN THE COLOMBIAN MANUFACTURING
INDUSTRY USING MACHINE LEARNING Article Open access 02 January 2024 ADDRESSING GAPS IN DATA ON DRINKING WATER QUALITY THROUGH DATA INTEGRATION AND MACHINE LEARNING: EVIDENCE FROM ETHIOPIA
Article Open access 08 September 2023 DATA AVAILABILITY The raw data used in this analysis can be downloaded from the EPA’s ECHO website (https://echo.epa.gov/). The processed datasets are
also available with code at the Stanford Digital Repository (https://purl.stanford.edu/hr919hp5420). REFERENCES * Kleinberg, J., Ludwig, J., Mullainathan, S. & Obermeyer, Z. Prediction
policy problems. _Am. Econ. Rev._ 105, 491–495 (2015). Article Google Scholar * Athey, S. Beyond prediction: using big data for policy problems. _Science_ 355, 483–485 (2017). Article CAS
Google Scholar * Mullainathan, S. & Spiess, J. Machine learning: an applied econometric approach. _J. Econ. Pers._ 31, 87–106 (2017). Article Google Scholar * Kleinberg, J.,
Lakkaraju, H., Leskovec, J., Ludwig, J. & Mullainathan, S. Human decision and machine predictions. _Q. J. Econ._ 133, 237–293 (2018). Google Scholar * Kang, J. S., Kuznetsova, P., Luca,
M. & Choi, Y. Where not to eat? Improving public policy by predicting hygiene inspections using online reviews. In _Proc. 2013 Conference on_ _Empirical Methods in Natural Language
Processing_ 1443–1448 (Association for Computational Linguistics, 2013). * Chandler, D., Levitt, S. D. & List, J. A. Predicting and preventing shootings among at-risk youth. _Am. Econ.
Rev._ 101, 288–292 (2011). Article Google Scholar * O’Neil, C. _Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy_ (Broadway Books, New York, USA,
2016). * Blumenthal-Barby, J. S. & Krieger, H. Cognitive biases and heuristics in medical decision making. _Med. Decis. Making_ 35, 539–557 (2015). Article CAS Google Scholar *
Mullainathan, S. & Obermeyer, Z. Does machine learning automate moral hazard and error? _Am. Econ. Rev._ 107, 476–480 (2017). Article Google Scholar * Lund, L. C. _Clean Water Act
National Pollutant Discharge Elimination System Compliance Monitoring Strategy_ (United States Environmental Protection Agency, 2014);
https://www.epa.gov/sites/production/files/2013-09/documents/npdescms.pdf * Friesen, L. Targeting enforcement to improve compliance with environmental regulations. _J. Environ. Econ.
Manage._ 46, 72–85 (2003). Article Google Scholar * Rivers, L., Dempsey, T., Mitchell, J. & Gibbs, C. Environmental regulation and enforcement: structures, processes and the use of
data for fraud detection. _J. Environ. Assess. Pol. Manage._ 17, 1550033 (2015). Article Google Scholar * Glicksman, R. L., Markell, D. L. & Monteleoni, C. Technological innovation,
data analytics, and environmental enforcement. _Ecol. Law. Q._ 44, 41–88 (2017). Google Scholar * _NPDES Compliance Inspection Manual_ Interim Revised Version, January 2017 (United States
Environmental Protection Agency, 2017); https://www.epa.gov/sites/production/files/2017-01/documents/npdesinspect.pdf * _National Pollutant Discharge Elimination System (NPDES) Electronic
Reporting Rule_ (United States Environmental Protection Agency, 2015); https://www.gpo.gov/fdsys/pkg/FR-2015-10-22/pdf/2015-24954.pdf * Shimshack, J. P. & Ward, M. B. Enforcement and
over-compliance. _J. Environ. Econ. Manage._ 55, 90–105 (2008). Article Google Scholar * James, G., Witten, D., Hastie, T., & Tibshirani, R. _An Introduction to Statistical Learning_
(Springer, New York, USA, 2013). Google Scholar * Hastie, T., Tibshirani, R. & Friedman, J. _The Elements of Statistical Learning_. _Data Mining, Inference, and Prediction_ 2nd edn
(Springer, New York, USA, 2009). * Zliobaite, I. Fairness-aware machine learning: a perspective. Preprint at https://arxiv.org/abs/1708.00754 (2017). * _ICIS-NPDES Download Summary and Data
Element Dictionary_ (United States Environmental Protection Agency, 2017); https://echo.epa.gov/tools/data-downloads/icis-npdes-download-summary * R Development Core Team _R: A Language and
Environment for Statistical Computing_ (R Foundation for Statistical Computing, 2017). * _State Compliance Monitoring Expectations_ (United States Environmental Protection Agency, 2015);
https://echo.epa.gov/trends/comparative-maps-dashboards/state-compliance-monitoring-expectations Download references ACKNOWLEDGEMENTS We thank S. Athey, M. Burke, F. Burlig, K. Mach, A.
D’Agostino, C. Anderson, K. Green, S. Hasan, D. Jiménez, H. Kim, A. R. Siders and A. Stock for comments. E.B. receives funding from the National Science Foundation Graduate Research
Fellowship Program (DGE-114747), M.H. from the Department of Earth System Science at Stanford University, and N.B. from the Stanford Graduate Fellowship/David and Lucile Packard Foundation.
AUTHOR INFORMATION Author notes AUTHORS AND AFFILIATIONS * Stanford University, Stanford, CA, USA M. Hino, E. Benami & N. Brooks Authors * M. Hino View author publications You can also
search for this author inPubMed Google Scholar * E. Benami View author publications You can also search for this author inPubMed Google Scholar * N. Brooks View author publications You can
also search for this author inPubMed Google Scholar CONTRIBUTIONS All three authors collaboratively designed the study, developed the methodology, assembled the data, wrote the code,
performed the analysis, interpreted the results, and wrote the manuscript. E.B. and M.H. conducted the final analysis, with substantial input from N.B. CORRESPONDING AUTHOR Correspondence to
E. Benami. ETHICS DECLARATIONS COMPETING INTERESTS The authors declare no competing interests. ADDITIONAL INFORMATION PUBLISHER’S NOTE: Springer Nature remains neutral with regard to
jurisdictional claims in published maps and institutional affiliations. SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION Supplementary Note 1, Supplementary Figures 1–6, Supplementary
Tables 1–6, Supplementary References 1–4 RIGHTS AND PERMISSIONS Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Hino, M., Benami, E. & Brooks, N. Machine learning for
environmental monitoring. _Nat Sustain_ 1, 583–588 (2018). https://doi.org/10.1038/s41893-018-0142-9 Download citation * Received: 02 March 2018 * Accepted: 23 August 2018 * Published: 01
October 2018 * Issue Date: October 2018 * DOI: https://doi.org/10.1038/s41893-018-0142-9 SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get
shareable link Sorry, a shareable link is not currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative