Glossaries#

Tools and Technology (general)#

Binder A free, public service for running reproducible interactive computing environments. Binder is a 100% open source infrastructure that is run by members of the Jupyter community. The underlying technology behind the Binder project is BinderHub.

BinderHub The underlying technology of mybinder.org, BinderHub is an open source tool that utilizes a JupyterHub`= in order to provide live, reproducible interactive computing environments that users define on GitHub.

Conda Package, dependency and environment management for any language—Python, R, Ruby, Lua, Scala, Java, JavaScript, C/ C++, FORTRAN, and more.

Docker Docker provides the ability to package and run an application in a loosely isolated environment called a container. It is widely used for creating reproducible software environments to run code on different computers.

Git A popular version control system that is used in many open source software projects to manage their software code base.

GitHub Provider of Internet hosting for software development and distributed version control using the “git” command line tool.

Hackweek Participant-driven events that strive to create welcoming spaces to learn new things, build community and gain hands-on experience with collaboration and team science.

Project Jupyter Project Jupyter (name derived from “JUlia PYThon and R”) exists to develop open-source software, open-standards, and services for interactive computing across dozens of programming languages.

Jupyter Book Jupyter Book is an open source project for building beautiful, publication-quality books and documents from computational material.

JupyterHub A core open source tool from the Jupyter community, JupyterHub allows you to deploy an application that provides remote data science environments to multiple users. It can be deployed in the cloud, or on your own hardware.

JupyterLab JupyterLab is the next-generation web-based user interface for Project Jupyter intended to replace the JupyterNotebook interface.

Jupyter Notebook open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text.

Machine Learning Models A machine learning model is a program that can find patterns or make decisions from a previously unseen dataset.

AI Artificial intelligence, or AI, is technology that enables computers and machines to simulate human intelligence and problem-solving capabilities.

Organizations#

D4 Disasters, Demography, Disparities, and Decisions.

CSDE
Center for Studies in Demography and Ecology

NOAA National Oceanic and Atmospheric Administration

AI2ES NSF AI Institute for Research on Trustworthy AI in Weather, Climate, and Coastal Oceanography

Datasets#

SHELDUS County-level hazard data set for the U.S. and covers natural hazards such thunderstorms, hurricanes, floods, wildfires, and tornados as well as perils such as flash floods, heavy rainfall, etc.

EM-Dat EM-DAT contains data on the occurrence and impacts of over 26,000 mass disasters worldwide from 1900 to the present day. The database is compiled from various sources, including UN agencies, non-governmental organizations, reinsurance companies, research institutes, and press agencies.

FEMA Federal Emergency Management Agency

ACS The American Community Survey (ACS) releases new data every year through a variety of data tables that you can access with different data tools.

UWDC The UW Data Collaborative (UWDC) provides NIST 800-171 aligned computer infrastructure to harness innovative, but hard-to-access and highly sensitive data for the development of novel, high-quality research and evidence-driven policy making.

NWFSRDC The Northwest Federal Statistical Research Data Centers (NWFSRDC) is the Northwest FSRDC, which are partnerships between federal statistical agencies and leading research institutions. FSRDCs provide secure environments supporting qualified researchers using restricted-access data while protecting respondent confidentiality