Data Engineering

Alex Bush

Alex Bush

A Hadoop specialist with a penchant for DevOps practices

Data Engineering posts (0)

Data Science

David Asboth

David Asboth

Data Scientist, former Software Developer, occasional content creator

Michael O'Rourke

Michael O'Rourke
Data Science posts (3)

Data Engineering

Back to Top ↑

Data Science

To accuracy… and beyond

To accuracy… and beyond

In Data Science, when you learn about how to predict the future with machine learning, you are presented with an optimisation problem. The central focus of machine learning, at least the subset called “supervised learning”, is to ask “what is the best way to predict Y using X?” where “best” has a specific meaning. When you have past examples to learn from, “best” is some sort of measure of accuracy. How well did this particular model predict Y using X, when we can check it against past examples? Whether this is a classification task, where you’re trying to put things into the right category, or regression, where you are trying to get as close to a continuous number as you can, the higher the accuracy the better your model. There are ways to guard against complacency to ensure you’re not just learning the particulars of your dataset, but ultimately you are trying to get as high an accuracy score as possible.

3 min read

30 years of change in engines

30 years of change in engines

Engine design has changed a great deal since the Model T. Forced induction in 1920s increased engine power without increasing the size; fuel injection in the 1980s increased fuel efficiency and made carburettors a thing of the past; and more recently, the introduction of hybrid engines has massively reduced CO2 emissions.

5 min read

From Laptop to Cluster: Our Journey to Big Data

From Laptop to Cluster: Our Journey to Big Data

As the world accumulates data, more and more data science teams will be faced with the problem that their data no longer fits into their conventional workflow. Creating data-driven systems with hundreds of millions of rows of data requires a different set of tools to the ones statisticians, and even software developers, have been using in recent decades. How do we at Cox Automotive envisage moving from laptop-based analyses to big data systems?

5 min read
Back to Top ↑