{"id":13,"date":"2022-08-18T23:13:37","date_gmt":"2022-08-18T23:13:37","guid":{"rendered":"https:\/\/libraries.mit.edu\/opendata\/?page_id=13"},"modified":"2025-10-20T19:50:13","modified_gmt":"2025-10-20T19:50:13","slug":"mit-prize","status":"publish","type":"page","link":"https:\/\/libraries.mit.edu\/opendata\/open-data-mit-home\/mit-prize\/","title":{"rendered":"MIT Prize for Open Data"},"content":{"rendered":"<p><span style=\"font-weight: 400\">To highlight the value of open data at MIT, and to encourage the next generation of researchers, the MIT School of Science and the MIT Libraries present the MIT Prize for Open Data.<\/span><\/p>\n<p><span style=\"font-weight: 400\"><strong>Congratulations to the recipients of the 2025 MIT Prize for Open Data!<\/strong><br \/>\nThe following winners and honorable mentions were selected from more than 60 nominees representing 30 different departments, labs, centers, and institutes across MIT. Join us as we honor them at the <a href=\"https:\/\/calendar.mit.edu\/event\/copy-of-open-data-mit-1723\">Open Data @ MIT event<\/a> held Oct. 21 at Hayden Library.<\/span><\/p>\n<h1>Winners<\/h1>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\"><strong>Lucas Attia<\/strong>, graduate student, Chemical Engineering; <strong>Jackson Burns<\/strong>, graduate student, Chemical Engineering; <strong>Patrick S. Doyle,<\/strong> Robert T Haslam (1911) Professor in Chemical Engineering; and <strong>William H. Green<\/strong>, Hoyt Hottel Professor in Chemical Engineering<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"http:\/\/fastsolv.mit.edu\/\"><span style=\"font-weight: 400\">Fastsolv<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/a><span style=\"font-weight: 400\">The team leveraged nearly 50,000 published experiments to develop <\/span><a href=\"https:\/\/www.nature.com\/articles\/s41467-025-62717-7\"><span style=\"font-weight: 400\">fastsolv<\/span><\/a><span style=\"font-weight: 400\">, an open-sourced deep learning model for organic solubility prediction. Fastsolv is freely available online and has been called by user scientists over 9,000 times since publication.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\"><strong>Timur Cinay<\/strong>, graduate student, Earth, Atmospheric, and Planetary Sciences<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/www.bco-dmo.org\/dataset\/917743\"><span style=\"font-weight: 400\">Galapagos Emissions Monitoring Station<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/a><span style=\"font-weight: 400\">First-of-their-kind continuous <\/span><a href=\"https:\/\/www.bco-dmo.org\/dataset\/917743\"><span style=\"font-weight: 400\">dataset<\/span><\/a><span style=\"font-weight: 400\"> monitoring ocean emissions of the greenhouse gas nitrous oxide, made completely free and openly available to all researchers globally.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\"><strong>Edgar Costa<\/strong>, research scientist, Mathematics<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/www.lmfdb.org\/\"><span style=\"font-weight: 400\">The L-functions and modular forms database (LMFDB)<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/a><span style=\"font-weight: 400\">The LMFDB is a database of mathematical objects arising in number theory and arithmetic geometry that illustrates some of the mathematical connections predicted by the Langlands program.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\"><strong>Danika Eamer<\/strong>, postdoctoral Impact Fellow, MIT Climate &amp; Sustainability Consortium; <strong>Micah Borrero<\/strong>, PhD student, Aerospace Engineering, University of Michigan; <strong>Brooke Bao<\/strong>, undergraduate student, Wellesley College\/Dartmouth College; <strong>Helena De Figueiredo Valente<\/strong>, undergraduate student, Mechanical Engineering; <strong>Amber Wu<\/strong>, undergraduate student, Computer Science, Wellesley College; <strong>Brilant Kasami<\/strong>, MCSC software consultant; and <strong>Viktoriia Tkachuk<\/strong>, UX\/UI designer<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/impactclimate.mit.edu\/geospatial-decision-support-tool\/\"><span style=\"font-weight: 400\">Geospatial Trucking Industry Decarbonization Explorer (Geo-TIDE)<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/a><span style=\"font-weight: 400\">Geo-TIDE is an open data platform that synthesizes fragmented public datasets into more than 400 curated, cloud-hosted geospatial layers for freight decarbonization planning. By making these high-value datasets openly available through Zenodo and Amazon Web Services, and pairing them with open-source code and documented methods, Geo-TIDE enables fleets, policymakers, and researchers to translate complex data into actionable strategies for zero-emission trucking.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\"><strong>Connor Makowski<\/strong>, research associate, Computational Analytics, Visualization &amp; Education (CAVE) Lab and <\/span><span style=\"font-weight: 400\">MIT MicroMasters SCx; <strong>Tim Russell<\/strong>, research engineer, MIT CAVE and MIT Humanitarian Supply Chain Lab; <strong>Willem Guter<\/strong>, research engineer, MIT CAVE and MIT Intelligent Logistics Systems; <strong>Austin Saragih<\/strong>, PhD candidate, MIT Center for Transportation and Logistics; <strong>Arne Heinold<\/strong>, Assistant Professor for Transportation, K\u00fchne Logistics University; and <strong>Spyridon Lekkakos<\/strong>, Professor of Supply Chain Management, MIT-Zaragoza International Logistics Program<br \/>\n<\/span><a href=\"https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=5388845\"><span style=\"font-weight: 400\">SCGraph<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/a><a href=\"https:\/\/github.com\/connor-makowski\/scgraph\"><span style=\"font-weight: 400\">SCGraph<\/span><\/a><span style=\"font-weight: 400\"> is an open source Python package that transforms scattered open transportation datasets into clean, ready to use geographic networks for research and real world analysis. With over 3.3k monthly downloads and adoption in multiple research projects, it shows how open data can be creatively synthesized into tools with broad impact.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\"><strong>Nada Tarkhan<\/strong>, graduate student, Architecture, and <strong>Paolo Giani<\/strong>, postdoctoral associate, Earth, Atmospheric and Planetary Sciences<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/github.com\/Nadatarkhan\/RMY.git\"><span style=\"font-weight: 400\">Extreme-Aware Meteorological Years: Open Weather Data for Climate-Resilient Building Simulations<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/a><span style=\"font-weight: 400\">This <\/span><a href=\"https:\/\/www.tandfonline.com\/doi\/full\/10.1080\/19401493.2025.2499687\"><span style=\"font-weight: 400\">project<\/span><\/a><span style=\"font-weight: 400\"> introduces open-source Representative and Future Meteorological Years (RMYs and FRMYs)\u2014novel weather file formats that embed extreme events into building simulation workflows using anomaly detection and climate model emulators. Designed for global scalability and resilience planning, they enable realistic assessments of overheating, peak loads, and future risk across diverse global locations.<\/span><\/li>\n<li><span style=\"font-weight: 400\"><strong>Jonathan Zheng<\/strong>, graduate student, Chemical Engineering<\/span><span style=\"font-weight: 400\">; <strong>Ivo Leito<\/strong>, professor of analytical chemistry, University of Tartu, Estonia; and <\/span><strong>William H. Green<\/strong>, Hoyt Hottel Professor in Chemical Engineering<span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/pubs.acs.org\/doi\/10.1021\/acs.jcim.4c01420\"><span style=\"font-weight: 400\">Widespread misinterpretation of pKa terminology for zwitterionic compounds and its consequences<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/a><span style=\"font-weight: 400\">Due to an unfortunate misinterpretation of chemical data, a widely-used biochemical dataset, <\/span><a href=\"https:\/\/www.ebi.ac.uk\/chembl\/\"><span style=\"font-weight: 400\">ChEMBL<\/span><\/a><span style=\"font-weight: 400\">, contains many incorrect values, negatively affecting its applications including drug design and organic chemistry. <\/span><a href=\"https:\/\/www.chemistryworld.com\/news\/incorrect-pka-values-have-slipped-into-chemical-databases-and-could-distort-drug-design\/4020661.article\"><span style=\"font-weight: 400\">This work<\/span><\/a><span style=\"font-weight: 400\"> explained the reasons for the error, examined the downstream repercussions, and made recommendations for data curation to avoid these issues in the future.<\/span><\/li>\n<\/ul>\n<h1>Honorable Mentions<\/h1>\n<ul>\n<li style=\"font-weight: 400\"><strong>Jeroen Audenaert and the Multimodal Universe Team<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/github.com\/MultimodalUniverse\/MultimodalUniverse\"><span style=\"font-weight: 400\">Multimodal Universe: Enabling Large-Scale Machine Learning with 100TBs of Astronomical Scientific Data<\/span><\/a><\/li>\n<li><strong>CAVE App team: <\/strong>Matthias Winkenbach, Tim Russell, Connor Makowski, Luis Vazquez, Willem Guter, Alice Zhao, Ella Wang<br \/>\n<a href=\"https:\/\/github.com\/MIT-CAVE\/cave_app\">CAVE App<\/a><\/li>\n<li style=\"font-weight: 400\"><strong>Yu-Chen (Janice) Chen<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><span style=\"font-weight: 400\">Reviving ALEPH: Modern, Validated Open Data from CERN\u2019s LEP for New QCD Tests<\/span><\/li>\n<li style=\"font-weight: 400\"><strong>Evan Collins<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/lnpdb.molcube.com\/\"><span style=\"font-weight: 400\">LNPDB (Lipid Nanoparticle Database)<\/span><\/a><\/li>\n<li style=\"font-weight: 400\"><strong>Matteo Di Bernard<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/2025.05.26.656231v1\"><span style=\"font-weight: 400\">Brieflow: An Integrated Computational Pipeline for High-Throughput Analysis of Optical Pooled Screening Data<\/span><\/a><\/li>\n<li style=\"font-weight: 400\"><strong>Lelia Hampton<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><span style=\"font-weight: 400\">Targeted urban afforestation can substantially reduce income-based heat disparities in U.S. cities (<\/span><a href=\"https:\/\/doi.org\/10.5281\/zenodo.16921611\"><span style=\"font-weight: 400\">Zenodo<\/span><\/a><span style=\"font-weight: 400\">; <\/span><a href=\"https:\/\/github.com\/LeliaPlusPlus\/TargetedHeatMitigation-ML\"><span style=\"font-weight: 400\">Github<\/span><\/a><span style=\"font-weight: 400\">)<\/span><\/li>\n<li style=\"font-weight: 400\"><strong>Margaret Hughes, Cassandra Overney<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/v2v-jamaicaplan.ccc-mit.org\/\"><span style=\"font-weight: 400\">Voice to Vision<\/span><\/a><\/li>\n<li style=\"font-weight: 400\"><strong>Sarah Mokhtar, Caitlin Mueller<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/dataverse.harvard.edu\/previewurl.xhtml?token=57c1017c-2ff4-4b78-8f3e-4608b3ccb5ea\"><span style=\"font-weight: 400\">PRISM: A Multi-modal Dataset for Learning-based Building Performance Modeling<\/span><\/a><\/li>\n<li style=\"font-weight: 400\"><strong>William Parker<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><a href=\"https:\/\/github.com\/wparker781\/REACT\"><span style=\"font-weight: 400\">Space Debris as a Sensor for Earth&#8217;s Upper Atmosphere<\/span><\/a><\/li>\n<li style=\"font-weight: 400\"><strong>Ci Xue<\/strong><span style=\"font-weight: 400\"><br \/>\n<\/span><span style=\"font-weight: 400\">The GOTHAM Project: Open-Sourcing Interstellar Chemistry (<\/span><a href=\"https:\/\/greenbankobservatory.org\/portal\/gbt\/gbt-legacy-archive\/gotham-data\/\"><span style=\"font-weight: 400\">Observation dataset<\/span><\/a><span style=\"font-weight: 400\">; <\/span><a href=\"https:\/\/doi.org\/10.7910\/DVN\/QCRWV7\"><span style=\"font-weight: 400\">molecular census dataset<\/span><\/a><span style=\"font-weight: 400\">)<\/span><\/li>\n<\/ul>\n<h2><b>2025 Committee<\/b><b><br \/>\n<\/b><\/h2>\n<p><b>Committee Co-Chairs<\/b><\/p>\n<ul>\n<li>Chris Bourg, Director, MIT Libraries<\/li>\n<li>Rebecca Saxe, Associate Dean of Science, School of Science (SoS)<\/li>\n<\/ul>\n<p><b>Committee Members<\/b><\/p>\n<ul>\n<li>Awad Abdelhamid, assistant director of research, Urban Mobility and Transit Labs<\/li>\n<li>Paul Berube, research scientist, Civil and Environmental Engineering<\/li>\n<li>Jerik Cruz, graduate student, Political Science<\/li>\n<li>Yifu Ding, post-doctoral research associate, MIT Energy Initiative<\/li>\n<li>Steve Flavell, Associate Professor, Picower Institute for Learning &amp; Memory and Department of Brain and Cognitive Sciences<\/li>\n<li>Satrajit Ghosh, Director of the Open Data in Neuroscience Initiative, McGovern Institute, and Director of Data Models and Integration, ReproNim<\/li>\n<li>Rafael Jaramillo, Thomas Lord Career Development Professor, Associate Professor of Materials Science and Engineering<\/li>\n<li>Stuart Levine, Director, MIT BioMicro Center<\/li>\n<li>Peace Ossom, Director of Research Data Services, MIT Libraries<\/li>\n<li>Tom Pollard, research scientist, Laboratory for Computational Physiology<\/li>\n<li>Sadie Roosa, Collections Strategist for Repository Services, MIT Libraries<\/li>\n<li>Virginia Spanoudaki, Scientific Director, Preclinical Imaging and Testing, Koch Institute<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p><em>Co-sponsored by the MIT School of Science and MIT Libraries<\/em><\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft wp-image-234\" src=\"https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2024\/09\/SoS_sub-brand_lockup_two-line_cmyk_bright-red-copy-300x68.jpeg\" alt=\"MIT School of Science logo\" width=\"150\" height=\"34\" srcset=\"https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2024\/09\/SoS_sub-brand_lockup_two-line_cmyk_bright-red-copy-300x68.jpeg 300w, https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2024\/09\/SoS_sub-brand_lockup_two-line_cmyk_bright-red-copy-768x174.jpeg 768w, https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2024\/09\/SoS_sub-brand_lockup_two-line_cmyk_bright-red-copy-624x141.jpeg 624w, https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2024\/09\/SoS_sub-brand_lockup_two-line_cmyk_bright-red-copy.jpeg 1000w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/>\u00a0 \u00a0 <img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-32\" src=\"http:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2022\/08\/MIT_LIB_LOGO_RED_RGB_P-150x150.png\" alt=\"MIT Libraries logo\" width=\"100\" height=\"122\" srcset=\"https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2022\/08\/MIT_LIB_LOGO_RED_RGB_P-246x300.png 246w, https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2022\/08\/MIT_LIB_LOGO_RED_RGB_P-838x1024.png 838w, https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2022\/08\/MIT_LIB_LOGO_RED_RGB_P-768x938.png 768w, https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2022\/08\/MIT_LIB_LOGO_RED_RGB_P-624x762.png 624w, https:\/\/libraries.mit.edu\/app\/uploads\/sites\/20\/2022\/08\/MIT_LIB_LOGO_RED_RGB_P.png 924w\" sizes=\"auto, (max-width: 100px) 100vw, 100px\" \/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>To highlight the value of open data at MIT, and to encourage the next generation of researchers, the MIT School of Science and the MIT Libraries present the MIT Prize for Open Data. Congratulations to the recipients of the 2025 MIT Prize for Open Data! The following winners and honorable mentions were selected from more than 60 nominees representing 30 different departments, labs, centers, and institutes across MIT. Join us as we honor them at the Open Data @ MIT event held Oct. 21 at Hayden Library. Winners Lucas Attia, graduate student, Chemical Engineering; Jackson Burns, graduate student, Chemical Engineering; [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":4,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"templates\/page.php","meta":{"footnotes":""},"class_list":["post-13","page","type-page","status-publish","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/pages\/13","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/comments?post=13"}],"version-history":[{"count":127,"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/pages\/13\/revisions"}],"predecessor-version":[{"id":316,"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/pages\/13\/revisions\/316"}],"up":[{"embeddable":true,"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/pages\/4"}],"wp:attachment":[{"href":"https:\/\/libraries.mit.edu\/opendata\/wp-json\/wp\/v2\/media?parent=13"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}