Category: Databases & Big Data

Data Exploration and Preparation with BigQuery 0

Data Exploration and Preparation with BigQuery

eBook Details: Paperback: 264 pages Publisher: WOW! eBook (November 29, 2023) Language: English ISBN-10: 1805125265 ISBN-13: 978-1805125266 eBook Description: Data Exploration and Preparation with BigQuery: Learn to understand and prepare data using BigQuery to make your data accurate, reliable, and ready for analysis and modeling Data professionals encounter a multitude of challenges such as handling large volumes of data, dealing with data silos, and the lack of appropriate tools. Datasets often arrive in different conditions and formats demanding considerable time from analysts engineers, and scientists to process and uncover insights. The complexity of the data life cycle often hinders teams and organizations...

Practical Machine Learning on Databricks 0

Practical Machine Learning on Databricks

eBook Details: Paperback: 244 pages Publisher: WOW! eBook (November 24, 2023) Language: English ISBN-10: 1801812039 ISBN-13: 978-1801812030 eBook Description: Practical Machine Learning on Databricks: Take your machine learning skills to the next level by mastering databricks and building robust ML pipeline solutions for future MLOps innovations Unleash the potential of databricks for end-to-end machine learning with this comprehensive guide, tailored for experienced data scientists and developers transitioning from DIY or other cloud platforms. Building on a strong foundation in Python, Practical Machine Learning on Databricks serves as your roadmap from development to production, covering all intermediary steps using the databricks platform. You’ll...

Kafka Troubleshooting in Production 0

Kafka Troubleshooting in Production

eBook Details: Paperback: 236 pages Publisher: WOW! eBook (December 14, 2023) Language: English ISBN-10: 1484294890 ISBN-13: 978-1484294895 eBook Description: Kafka Troubleshooting in Production: Stabilizing Kafka Clusters in the Cloud and On-premises This book provides Kafka administrators, site reliability engineers, and DataOps and DevOps practitioners with a list of real production issues that can occur in Kafka clusters and how to solve them. The production issues covered are assembled into a comprehensive troubleshooting guide for those engineers who are responsible for the stability and performance of Kafka clusters in production, whether those clusters are deployed in the cloud or on-premises. This Kafka Troubleshooting...

Alteryx Designer: The Definitive Guide 0

Alteryx Designer: The Definitive Guide

eBook Details: Paperback: 526 pages Publisher: WOW! eBook (December 19, 2023) Language: English ISBN-10: 1098107527 ISBN-13: 978-1098107529 eBook Description: Alteryx Designer: The Definitive Guide: Simplify and Automate Your Analytics Analytics projects are frequently long, drawn-out affairs, requiring multiple teams and skills to clean, join, and eventually turn data into analysis for timely decision-making. Alteryx Designer changes all of that. With this low-code, self-service, drag-and-drop workflow platform, new and experienced data and business analysts can deliver results in hours instead of weeks. Ready to work with data quickly and efficiently? This Alteryx Designer: The Definitive Guide gets you started. Learn the fundamentals of...

Distributed Machine Learning with PySpark: Migrating Effortlessly from Pandas and Scikit-Learn 0

Distributed Machine Learning with PySpark

eBook Details: Paperback: 510 pages Publisher: WOW! eBook (December 8, 2023) Language: English ISBN-10: 1484297504 ISBN-13: 978-1484297506 eBook Description: Distributed Machine Learning with PySpark: Migrating Effortlessly from Pandas and Scikit-Learn Migrate from pandas and scikit-learn to PySpark to handle vast amounts of data and achieve faster data processing time. This book will show you how to make this transition by adapting your skills and leveraging the similarities in syntax, functionality, and interoperability between these tools. Distributed Machine Learning with PySpark offers a roadmap to data scientists considering transitioning from small data libraries (pandas/scikit-learn) to big data processing and machine learning with PySpark....