Author: Kenton
Hello all! As part of the main aim of raising awareness of data science and developing talent at Imperial College, we’re trying to launch our blog!
This is the first issue of Data Digest, which we hope to do weekly! If you have read something interesting and would like to share it, please post it (preferably along with why you enjoyed it) in our Teams ‘Learning Resources’ page and we’ll make sure to send it around.
Data Science Article of the Week
Avoiding Shortcut Solutions in Artificial Intelligence
A new method forces a machine learning model to focus on more data when learning a task, which leads to more reliable predictions.
Challenges of the Week
For the coding problems and datasets, feel free to use our Teams discussion to discuss solutions or obtain help!
Coding Problem of the Week
You might encounter coding problems when attending Data Science interviews. This is the problem of the week which we encourage everyone to have a go at!
Longest Substring Without Repeating Characters – LeetCode
Dataset of the Week
COP26 is going on now! We thought it might be cool to showcase some datasets related to climate.
- Task: can you make some data visualisations of the climate?
- Advanced: what trends can you spot (in space/ in time/ between variables)? Can you make some statistical claims?
Weather Data in the UK, by station (Met Office)
Global Weather Data (Berkeley Earth)
Book Recommendation
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition by Aurélien Géron (2019)
This is one of the best books for learning the foundations of Data Science! What I like about it is that it has exercises at the ends of each chapter which encourages you to think and actually do 🙂
You can find it online and also in the Imperial Library.