Название: Software Engineering for Data Scientists: From Notebooks to Scalable Systems (Final)
Автор: Catherine Nelson
Издательство: O’Reilly Media, Inc.
Год: 2024
Страниц: 258
Язык: английский
Формат: pdf (true), epub (true)
Размер: 11.7 MB
Data Science happens in code. The ability to write reproducible, robust, scaleable code is key to a data science project's success—and is absolutely essential for those working with production code. This practical book bridges the gap between data science and software engineering,and clearly explains how to apply the best practices from software engineering to Data Science. All the code examples in this book are written in Python, and many of the chapters describe Python-specific tools. In recent years, Python has become the most popular programming language for Data Science. Python has an extremely solid set of open source libraries for data science, with good backing and a healthy community of maintainers. Large trend-setting companies have chosen Python for their main ML frameworks, including TensorFlow (Google) and PyTorch (Meta). Because of this, Python appears to be especially popular among data scientists working on production Machine Learning code, where good coding skills are particularly important. In my experience, the Python community has been friendly and welcoming, with many excellent events that have helped me improve my skills. It’s my preferred programming language, so it was an easy choice for this book. This book is aimed at data scientists, but people working in closely related fields such as data analysts, Machine Learning (ML) engineers, and data engineers will also find it useful.