Skip to content
Home » Data Digest #1

Data Digest #1

Author: Kenton

Hello all! As part of the main aim of raising awareness of data science and developing talent at Imperial College, we’re trying to launch our blog!

This is the first issue of Data Digest, which we hope to do weekly! If you have read something interesting and would like to share it, please post it (preferably along with why you enjoyed it) in our Teams ‘Learning Resources’ page and we’ll make sure to send it around.


Data Science Article of the Week

Avoiding Shortcut Solutions in Artificial Intelligence

A new method forces a machine learning model to focus on more data when learning a task, which leads to more reliable predictions.


Credits: Image: Jose-Luis Olivares, MIT, with photo from iStockphoto

Challenges of the Week

For the coding problems and datasets, feel free to use our Teams discussion to discuss solutions or obtain help!

Coding Problem of the Week

You might encounter coding problems when attending Data Science interviews. This is the problem of the week which we encourage everyone to have a go at!

Longest Substring Without Repeating Characters – LeetCode

Dataset of the Week

COP26 is going on now! We thought it might be cool to showcase some datasets related to climate.

  • Task: can you make some data visualisations of the climate?
  • Advanced: what trends can you spot (in space/ in time/ between variables)? Can you make some statistical claims?

Weather Data in the UK, by station (Met Office)

Global Weather Data (Berkeley Earth)

Image: COP26 Logo

Book Recommendation

Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition by Aurélien Géron (2019)

This is one of the best books for learning the foundations of Data Science! What I like about it is that it has exercises at the ends of each chapter which encourages you to think and actually do 🙂

Book Cover of HOML

You can find it online and also in the Imperial Library.