All posts tagged with 'data'
Using API Gateway, Lambda, SageMaker and DynamoDB to build a categorisation service in AWS
I’ve talked previously about the value of combining rules-based and machine learning approaches to categorisation. In short, rules-based approaches make it easy to do customer-level personalisation that complements a machine learning model trained to find patterns across customers. In this post I’ll talk about how we used AWS to build an expense categorisation service that combines machine learning with a rules-based approach. This service forms part of the Smart Capture… Continue reading
Combining data from different sources with SageMaker pipelines
Generating datasets for machine learning Preparing data and generating datasets is a crucial step to train a machine learning model. If you are lucky your data might come from a single .csv file. However in most cases pulling together the input features to train your machine learning model will require combining datasets from different sources. Combining data from different sources manually can be a time consuming process, prone to errors. … Continue reading
Challenge Accepted: Our Weekly Looker Challenge
We wanted to improve our stakeholders’ Looker skills Here at FreeAgent we use Looker as our business intelligence tool. It’s used by over 150 stakeholders across the organisation, of which over 100 are active on a monthly basis. To unlock Looker’s full potential, we’d like those stakeholders to be better equipped to explore the data using Looker’s range of features, rather than simply viewing a chart that somebody else has… Continue reading
The Data Science Internship Chronicles: A Starfleet-worthy Tale of Numeric Exploration
In the vast expanse of the universe, I, a humble data science intern, set out on a mission to improve a classification model. As I delved deeper into the data, I encountered anomalies and outliers that threatened to disrupt my analysis. But with the guidance of my mentors and the help of advanced data tools, I navigated through the stars and uncovered the hidden patterns that led to breakthrough insights.… Continue reading
How we structure our data teams at FreeAgent
Since joining FreeAgent back in April I’ve been both impressed and interested with how the Data organisation is structured. I’ve come from an enterprise world where you have lots of Data Engineers, a team of dedicated Data Architects and a separate Business Intelligence org. A few things that immediately struck me at FreeAgent were: No one has the title ‘Data Engineer’Data Analytics are part of the Engineering orgNo one has… Continue reading
10 tips and tricks for a good intern job application
We’ve recently completed the hiring for this year's cohort of summer interns and as a result we’ve processed hundreds of applications; just across our data teams we had over 600 this year! For many of these applicants, who tend to still be at university or have recently graduated, this is likely to be one of the first ‘professional’ roles that they are applying for. As such they might not be… Continue reading
The three skills you need to be a Data Analyst, and how to get them – Part 3
In the third blog in the series, we discuss the skills you need to become a Data Analyst, this time covering Data Evanglism. Continue reading
The three skills you need to be a Data Analyst, and how to get them – Part 2
In the second blog in the series, we discuss the skills you need to become a Data Analyst, this time covering Data Analysis. Continue reading
The three skills you need to be a Data Analyst, and how to get them – Part 1
In the first blog in the series, we discuss the skills you need to become a Data Analyst, this time covering Data Engineering. Continue reading
The legendary data warehouse application
In spring 2014, I created an internal Ruby on Rails app called data-warehouse. In this blog post I’m going to describe what it was, how it came to be and why, against what would appear to be all reasonable logic, it became the central part of our data infrastructure for about eight years. Continue reading