AI might have captured the world鈥檚 attention for its ability to do seemingly complicated things with ease. While the technologies that underpin it 鈥 like machine learning (ML) 鈥 are becoming increasingly accessible, many ML teams when bringing these sophisticated models into production. ML product delivery is a team sport that necessitates multi-disciplinary collaboration, ranging from data science to product engineering.听
听
To help ML practitioners address such challenges, three Thoughtworkers 鈥 Ada Leung, David Tan and David Colls 鈥 wrote Effective Machine Learning Teams. Published by O鈥橰eilly in March 2024, it鈥檚 an essential guide that explains how ML teams 鈥 regardless of their size or organizational scale 鈥 can reliably and rapidly develop and deliver ML products to meet customer needs.听
听
Given the relevance of the book to today鈥檚 software industry, thanks to an increasing focus on machine learning and AI-driven products and services, we wanted to hear directly from the authors about the subject and the process of bringing their book to life.听
听
With each of them bringing a unique perspective to writing the book, their answers illustrate the range of their experiences and thinking. Find out what they had to say below鈥
What鈥檚 so hard about building machine learning products?
听
Richard Gall: What's the hardest thing about building ML products?
听
Ada Leung: Two things I think are particularly important are breaking down organizational silos and shifting feedback left. While we鈥檝e seen investments towards building ML capabilities, we are also seeing challenges validating 鈥 through feedback 鈥 that we are indeed solving a customer problem and that the solution is right for people. And, to do it successfully, it really does require us to apply systems thinking; we need to look at how knowledge is structured across the organization.
听
Dave Colls: Four hard things: Building the right thing, building it right, building it in a way that is right for people, and off-by-one errors.
听
David Tan: It鈥檚 hard to pick one! If I had to, I鈥檇 say it鈥檚 the silos that some organizations build around data scientists and ML engineers. I鈥檝e seen a small ML team spend months building a pricing ML model for another team only to have them shelve it 鈥 essentially thrown away by the other team. If only they had shifted that feedback left, with the techniques that we discuss in our book 鈥 product discovery, the right team shape and leadership to foster trust, hypothesis-driven experiments 鈥 they would have saved months of effort and morale and ultimately delivered a better outcome for everyone involved.听
听
Another hard thing is the rapidly changing landscape in AI and Generative AI 鈥 e.g. in large language models (LLMs), MLOps platforms, dependency management tools, just to name a few. Before the paint has time to dry 鈥 or best practices have time to diffuse and graduate into normative practices 鈥 teams just have to run with it and figure it out along the way.听
Why this book, now?
听
RG: Why did you write the book?
听
Ada: We wanted to show how principles and practices adopted in software engineering can be equally successfully applied to building ML products.
听
David: In various projects and workshops, we鈥檝e had the chance to work with data scientists, ML engineers and their leaders to build ML systems and improve how teams deliver ML systems. Through first-hand experience of these challenges in building ML systems, we鈥檝e seen ML engineering best practices help teams reduce toil and improve flow. We鈥檝e shared these ideas in various hands-on workshops, client engagements, blog posts and, for me, on my lowly . We received positive feedback through these channels and thought hey, we鈥檙e onto something here about how 鈥渨e鈥 (as an industry) do ML, what鈥檚 broken and how we could do better.
听
Dave: There was a gap to fill. ML is no longer just about technology, if it ever was. ML is a multidisciplinary endeavor that needs a multidisciplinary approach and team-first thinking. I hope we provided this.
听
RG: Tell us what this book is about 鈥 in five words or less鈥
听
Dave: Better results through better teams.
听
David: Shipping ML products reliably, rapidly.
听
Ada: Principles and practices recipe book.
AI in the engineering process
听
RG: Can AI help the engineering process? If so, how? And what are the limits?
听
Dave: Computer-aided approaches have been used in varied engineering disciplines for many decades. AI can help software engineering in different ways: it can help engineers learn technology and plan complex initiatives as well as do coding work, and it can help teams collectively manage their knowledge. The limits currently are in AI鈥檚 ability to reason about novel and unique business scenarios (as opposed to pattern-match common historical scenarios). People are going to need to drive those elements of the creative engineering process for a little while longer.
听
David: As we know, AI-assisted development is handy for specific problems 鈥 context-aware code completion, helping you write SQL, writing user stories, for example. We also know that it鈥檚 fallible and we still need the human 鈥減ilot鈥 to assess the correctness and suitability of AI-generated content. But the main point I鈥檇 like to make is that these specific tasks are just one of many parts in the product engineering process. Product engineering is fundamentally a social activity that encompasses information flow between humans: user testing, feature prioritization, system design, and so on. As an industry, we need to educate and remind each other that AI is a helpful tool and not a replacement for these other essential jobs in building digital products.
听
Ada: Again, what Dave and David said! Successful product engineering includes a human-led creative process, and is very much a social activity. On a human level, having the choice to utilize AI in the engineering process means we are opening up our deep-thinking time on the actual problem at hand. Think about hypothesis-driven experiments, for instance, which can surface outcomes faster because our test and learn (in other words, the feedback loop) is shorter.听
Successful product engineering includes a human-led creative process, and is very much a social activity. On a human level, having the choice to utilize AI in the engineering process means we are opening up our deep-thinking time on the actual problem at hand.
Successful product engineering includes a human-led creative process, and is very much a social activity. On a human level, having the choice to utilize AI in the engineering process means we are opening up our deep-thinking time on the actual problem at hand.
The importance and value of technology books
听
RG: What do you value in technology books? How would you characterize a really great tech book?
听
David: I really appreciate it when authors share their first-hand practical experience and not just theoretical knowledge. Sometimes when I鈥檓 faced with a new technology or a new problem, I鈥檓 thankful to find a resource where someone shares their empirical knowledge, so that I can stand on the shoulders of giants and see steady paths that they鈥檝e trod.听
听
Dave: Now I think about it, I wish you鈥檇 posed this question before we started! I think great tech books have a really simple central idea, but also tremendous depth that keeps you reading beyond the first chapter, gleaning immediately applicable practical advice and new perspectives each time you return to the content.
听
Ada: Visual aids, relatable analogies (but not too many) that compliment the theory. I find myself reaching for tech books when I鈥檝e spent time learning through doing and experiencing first-hand some of the challenges 鈥 of course I鈥檓 always in a team so have the support of the team to 鈥榝ail safely鈥.
听
RG: Are there any books that informed your approach to Effective Machine Learning Teams?
听
David: It鈥檚 not a book, but the seminal papers 鈥溾 and 鈥溾 by D. Sculley et al. provided the language and a rubric for thinking about ML systems as a multi-disciplinary undertaking.听
听
was another good one that explains what Lean actually entails, and how practitioners can apply it in their business. I鈥檝e also been deeply influenced by , which feature timeless 鈥 but often neglected! 鈥 advice, such as: cease dependence on inspection to achieve quality, break down barriers between staff areas, and drive out fear. Many of the practices in our book are extensions and stories of applying Lean in the context of building ML systems.
听
Dave: Team Topologies was a big influence for me, as evident in Chapter 11 of Effective ML Teams, which I鈥檝e dubbed the 鈥淭eam Topologies expansion pack for ML鈥. We do refer to many, many other books and technology thinkers throughout, but some we don鈥檛 explicitly reference but are nonetheless big influences for me, and that will broaden any technologist鈥檚 perspective, are: Don Norman鈥檚 The Design of Everyday Things, Kathryn Schulz鈥檚 Being Wrong and Yvon Chouinard鈥檚 Let My People Go Surfing.
听
Ada: For establishing foundational concepts around breaking down work, Jeff Patton鈥檚 has been a good resource. Lean Enterprise and The Design of Everyday Things were also books that have influenced my practice. Marty Cagan鈥檚 series of books that talk about product management and leadership are also great reads; Empowered, Inspired听and Transformed.
听
Something to acknowledge 鈥 and that we touch on lightly 鈥 is the discomfort we experience when navigating ambiguity especially during the early stages of delivery. Bren茅 Brown鈥檚 Daring Greatly and her article have been helpful in separating between the unknowns that come with the problem itself and how we handle ourselves.
Thanks Dave, Ada and David for taking the time to answer my questions. You can have a sneak peek of Effective Machine Learning Teams听here听or听.
Disclaimer: The statements and opinions expressed in this article are those of the author(s) and do not necessarily reflect the positions of 黑料门.