Course unit details:
Earth and Environmental Data Science
Unit code | EART60702 |
---|---|
Credit rating | 15 |
Unit level | FHEQ level 7 – master's degree or fourth year of an integrated master's degree |
Teaching period(s) | Semester 2 |
Available as a free choice unit? | No |
Overview
In this hands-on course unit, students will develop their skills in understanding the concepts behind Earth and Environmental Data Science, with a focus on using Python programming to analyze and visualize environmental data. Students will learn data curation skills and explore topics such as air quality and climate change. Throughout the course, students will work with Python programming concepts and packages including xarray, scikit-learn, and PyTorch. Students will also have the opportunity to learn about computer clusters and high-performance computing using university resources Three projects are designed to help students improve their programming skills and gain a deeper understanding of Earth and Environmental Science concepts, with group work highly encouraged. Students are expected to manage the use of GitHub for conducting these projects and collaborating with their peers on GitHub. A scientific literature presentation is designed to help students comprehend cutting-edge data science applications in Earth and Environmental Sciences.
By the end of the course, students will have the knowledge and tools necessary to solve realistic problems in Earth and Environmental Sciences using data-driven approaches.
Aims
Provide an understanding of important aspects of Earth and Environmental Data Science which are often overlooked in typical Data Science or Earth and Environmental Sciences courses. This unit provides a firm foundation and hands-on experience in data curation, data analytics, data visualization, and computing for Earth and Environmental applications.
Learning outcomes
On the successful completion of the course, students will be able to: | Developed | Assessed | |
---|---|---|---|
ILO 1 | Describe data curation, data analytics, data visualization, and computing methods and tools using industry standards in coding and curation practices. | ||
ILO 2 | evelop the skills to find and manage Earth and Environmental data (e.g., weather and climate data) from a range of sources and process data to explore and answer questions related to environment, weather and climate. | ||
ILO 3 | Develop an awareness of open science and open source communities and how to contribute to them by creating fully transparent and reproducible open-source data science projects. | ||
ILO 4 | Perform exploratory data analysis and use visualization to enhance interpretation of data, including maps and interactive visualizations. | ||
ILO 5 | Construct complete, well-structured programs in Python and practice reproducible research. |
Syllabus
Week 1:
- Introduction to Earth and Environmental Data Science
- Introduction to JupyterLab and Git Fundamenental
- Review of Python: NumPy, Pandas, and Basic Operation of Environmental Datasets
Week 2:
- Data Analytics I: Dataframe Computation, Time and Date Functionality (e.g. NumPy and Pandas)
- High level multidimensional gridded data using Xarray
Week 3:
- Data Analytics II: Review of Statistics
- Data Visualization: Principle and Tools (e.g. Matplotlib, Seaborn)
Week 4:
- Reproducible Research (e.g. Binder, Markdown)
- Project 1 (data visualization + data analytics) Presentation
Week 5:
- Introduction to Unix
- Managing Python Environments
- Git Advanced
Week 6:
- Online Repository of Earth and Environmental Science Data
- Remote Sensing Data: Google Earth Engine and geemap
- Climate Data: Climate Change Service
Week 7:
- Supervised Learning and Automated Machine Learning
- Unsupervised Learning
- Applications of Machine Learning in Earth and Environmental Sciences
Week 8:
- Organization and Packaging of Python Projects
- Advanced Data Visualization (e.g. Cartopy)
Week 9:
- Project 2 (reproducible research) Presentation
Week 10:
- A Taste of Deep Learning
Week 11:
- High-Performance Computing and Dask for Parallel Computing
- Scientific Literature Presentation (Machine Learning for Earth and Environmental Science session I)
Week 12:
- Summary
- Scientific Literature Presentation (Machine Learning for Earth and Environmental Science session II)
Teaching and learning methods
Other Scheduled teaching and learning activities:
- Revision workshops/surgeries
- Online discussions/tutorials
- Meetings with Academic Advisers
- Field trips
Assessment methods
Method | Weight |
---|---|
Other | 35% |
Report | 30% |
Project output (not diss/n) | 35% |
Feedback methods
Assessment task | Length | How and when feedback is provided | Weighting within unit (if relevant) |
---|---|---|---|
Data Science Projects and Presentations | Project 1 and 2 Submit code, Group presentation (10 minutes) Individual reflective write up (Max 500 words) Literature review Individual presentation (10 minutes) | Feedback from the instructor and peers (each group) | Project 1 35% Project 2 35% Literature Review 30% |
Recommended reading
- An Introduction to Earth and Environmental Data Science: https://earth-env-data-science.github.io/intro.html
- The Environmental Data Science Book: https://the-environmental-ds-book.netlify.app/, GitHub Repo: https://github.com/alan-turing-institute/environmental-ds-book
- Introduction to Earth Data Science Textbook: https://www.earthdatascience.org/courses/intro-to-earth-data-science/
- Use Data for Earth and Environmental Science in Open Source Python:
https://www.earthdatascience.org/courses/use-data-open-source-python/ - Python for Probability, Statistics, and Machine Learning (3rd Edition):
https://link.springer.com/book/10.1007/978-3-030-18545-9
Teaching staff
Staff member | Role |
---|---|
Zhonghua Zheng | Unit coordinator |