Skip to content

Grinding Gears

Tales of code crunching from the FreeAgent Engineering team

  • Development
  • Data & ML
  • Infrastructure
  • Mobile
  • Tooling
  • Security
  • Culture
  • Careers

Training Hugging Face models in parallel with SageMaker

Posted by Owen Turner on 2 February 2022

In this post I'll describe how we train our Hugging Face models in a fraction of the time with a data parallel approach. Continue reading

➼ Read other posts about BERT or data science or hugging face or machine learning or SageMaker

Using data to understand the effects of a four day working week

Posted by Lana Turnbull on 27 October 2021

As you may have already heard, FreeAgent implemented a four-day week throughout the summer this year as a ‘thank you’ for the employees’ contributions during the pandemic and a year of working from home.  Whilst Pat George’s blog post talks about measuring the success of the four-day week with “job satisfaction, business objectives and personal stress levels”, as an analyst, I wanted to see if I could find less anecdotal… Continue reading

➼ Read other posts about 4 day week or analytics or api or data or hack-days or looker or matillion or okr or UX

tuning mechanism for a guitar

Fine-Tuning BERT for multiclass categorisation with Amazon SageMaker

Posted by Owen Turner on 15 September 2021

This post describes our approach to fine-tuning a BERT model for multiclass categorisation with Hugging Face and Amazon SageMaker. Continue reading

➼ Read other posts about AWS or BERT or fine-tuning or hugging face or machine learning or NLP or SageMaker or transaction categorisation or transfer learning

Bank Transaction Entity Detection with AWS Comprehend

Posted by Harry Tullett on 17 August 2021

Introduction For the past year, FreeAgent has been running a machine learning model in production that categorises customer bank transactions. This model takes transaction descriptions and transaction amounts as inputs, and attempts to predict the corresponding accounting category. This summer, I joined the data science team with the more specific goal of increasing model generalisation, which would allow it to make predictions for a larger fraction of incoming transactions. One… Continue reading

➼ Read other posts about AWS or AWS Comprehend or Entity Detection or machine learning or NLP or pre-processing or SVM

Control freaks: how we built our own version control for Matillion

Posted by Lea Whitelaw on 27 April 2021

We’ve all been there - it’s a couple of days or even the night before an important deadline and one of the following happens: You lose your memory stick with all your work on it and scramble to recreate it but can’t get it back to the way it was.You make some more ‘advancements’ to your work and suddenly everything is broken and you can’t seem to get it back… Continue reading

➼ Read other posts about analytics or data or etl or git or github or matillion or python or understanding data or version control

Shopping for data: How thinking about supermarkets might help you to manage your Looker implementation

Posted by Rob Brown on 26 March 2021

This blog makes a comparison between a well run supermarket and a well designed Looker implementation. Continue reading

➼ Read other posts about data or looker

Exciting Python code on screen

Managing Python dependencies across multiple Data Science projects with Poetry

Posted by David Edwards on 5 March 2021

Python is the programming language of choice for running analysis, building models and running machine learning services in production for the Data Science team at FreeAgent. A key reason we chose Python is the great ecosystem of packages available: NumPy, pandas, SciPy and scikit-learn, deep learning frameworks like TensorFlow and more bespoke options for specific tasks like Click for developing CLIs. This wealth of options is a great strength of… Continue reading

➼ Read other posts about data science or python

Answering bigger questions with BigQuery

Posted by Jack Gladas on 1 December 2020

Over the past few weeks, we’ve configured BigQuery to enable us to combine our Google Analytics (GA) front-end data with our internal back-end data. In this post I’m going to talk about why we needed to do this, how we went about it and what we are hoping to achieve as a result. What’s the problem? Historically, two separate systems have been used at FreeAgent to track, store and analyse… Continue reading

➼ Read other posts about analytics or bi or bigquery or data or GA or redshift

Empowering our practices: Take 2

Posted by Lea Whitelaw on 30 October 2020

‘How can we use our data to understand and empower our accountancy practice partners?’ This is the question that I posed in my previous blog about delivering practice insights last summer. This question was the foundation for my project last year and is the one I came back to continue answering this year. To recap how the project was left at the end of last summer; I had created three… Continue reading

➼ Read other posts about accountancy practice or analytics or bi or insights or internship or looker

throw_away_your_clutter

Can’t someone else do it? Automating Looker Housekeeping in Two Days

Posted by Rob Brown on 2 October 2020

This blog discusses how, through successful planning and design, we were able to automate some Looker housekeeping tasks in less than 2 days. Continue reading

➼ Read other posts about looker or matillion or planning or technical-debt

Posts navigation

« Older posts
Newer posts »

We're totally hiring!

See our current engineering vacancies

Built with ❤️ by the FreeAgent team.