• Contact

  • Newsletter

  • About us

  • Delivery options

  • Prospero Book Market Podcast

  • Between the Spreadsheets: Classifying and Fixing Dirty Data

    Between the Spreadsheets by Walsh, Susan;

    Classifying and Fixing Dirty Data

      • GET 10% OFF

      • The discount is only available for 'Alert of Favourite Topics' newsletter recipients.
      • Publisher's listprice GBP 36.99
      • The price is estimated because at the time of ordering we do not know what conversion rates will apply to HUF / product currency when the book arrives. In case HUF is weaker, the price increases slightly, in case HUF is stronger, the price goes lower slightly.

        17 671 Ft (16 830 Ft + 5% VAT)
      • Discount 10% (cc. 1 767 Ft off)
      • Discounted price 15 904 Ft (15 147 Ft + 5% VAT)

    17 671 Ft

    db

    Availability

    Not yet published.

    Why don't you give exact delivery time?

    Delivery time is estimated on our previous experiences. We give estimations only, because we order from outside Hungary, and the delivery time mainly depends on how quickly the publisher supplies the book. Faster or slower deliveries both happen, but we do our best to supply as quickly as possible.

    Product details:

    • Edition number Second Edition, New edition
    • Publisher Facet Publishing
    • Date of Publication 2 October 2025

    • ISBN 9781783307845
    • Binding Paperback
    • No. of pages216 pages
    • Size 234x156x4 mm
    • Weight 454 g
    • Language English
    • 700

    Categories

    Short description:

    Everyone talks about data quality issues, but not the consequences. From the top to the bottom of an organisation, everyone should understand the impact of dirty data and how to spot it. Being an entirely revised new edition, this book will show you how.

    More

    Long description:

    ‘Clear, concise, engaging and entertaining. Highly recommended for anyone involved with data in any capacity.' Information Professional

    Dirty data is a problem that costs businesses thousands, if not millions, every year. And with the increasing use of AI and Generative AI, it’s only getting worse. In organisations large and small across the globe you will hear talk of data quality issues. What you will rarely hear about is the consequences or best practices on how to fix it.

    Fully revised and updated throughout, this new edition of Between the Spreadsheets draws on classification expert Susan Walsh’s years of hands-on experience in data to present a fool-proof method for cleaning and classifying your data. The book covers everything from the very basics of data classification to normalisation and taxonomies, and presents the author’s proven COAT framework, helping ensure an organisation’s data is Consistent, Organised, Accurate and Trustworthy. A series of data horror stories outlines what can go wrong in managing data, and if it does, how it can be fixed as well as new advice on using GenAI and why it is so important to have clean data before using it.

    After reading this book, regardless of your level of experience, not only will you be able to work with your data more efficiently, but you will also understand the impact the work you do with it has, and how it affects the rest of the organisation. Written in an engaging and highly practical manner, Between the Spreadsheets, 2nd Edition gives readers of all levels a deep understanding of the dangers of dirty data and the confidence and skills to work more efficiently and effectively with it.



    The second edition of Between the Spreadsheets seamlessly expands a world in which author Susan Walsh is showing us not only the uncomfortable truth around dirty data, but also approaches and methods on how to get rid of dirty data in an effective and sustainable way. The new chapter on breaking myths around how GenAI can help with data cleaning is especially timely and enlightening, and the data horror stories are scary but also painfully reflective of data issues in today’s day and age. Susan’s writing style is wonderfully reflective of her fun and approachable personality, and I can only recommend anyone interested in creating and maintaining clean data to read this book!

    More

    Table of Contents:

    Introduction

    1. The Dangers of Dirty Data
    2. Supplier Normalisation
    3. Taxonomies
    4. Spend Data Classification
    5. Basic Data Cleansing
    6. Before and After: Real-Life Data Cleaning Case Studies
    7. The Myth Exposed: Data Cleaning and GenAI
    8. Other Methodologies
    9. The Dirty Data Maturity Model
    10. Data Horror Stories

    Summary

    More