Managed Data Analysis in the Google Cloud
Buch, Englisch, 525 Seiten, Format (B × H): 178 mm x 254 mm, Gewicht: 1047 g
ISBN: 978-1-4842-6185-9
Verlag: Apress
BigQuery is a managed cloud platform from Google that provides enterprise data warehousing and reporting capabilities. Part I of this book shows you how to design and provision a data warehouse in the BigQuery platform. Part II teaches you how to load and stream your operational data into the warehouse to make it ready for analysis and reporting. Parts III and IV cover querying and maintaining, helping you keep your information relevant with other Google Cloud Platform services and advanced BigQuery. Part V takes reporting to the next level by showing you how to create dashboards to provide at-a-glance visual representations of your business situation. Part VI provides an introduction to data science with BigQuery, covering machine learning and Jupyter notebooks.
What You Will Learn
- Design a data warehouse for your project or organization
- Load data from a variety of external and internal sources
- Integrate other Google Cloud Platform services for more complex workflows
- Maintain and scale your data warehouse as your organization grows
- Analyze, report, and create dashboards on the information in the warehouse
- Become familiar with machine learning techniques using BigQuery ML
Who This Book Is For
Developers who want to provide business users with fast, reliable, and insightful analysis from operational data, and data analysts interested in a cloud-based solution that avoids the pain of provisioning their own servers.
Zielgruppe
Professional/practitioner
Autoren/Hrsg.
Weitere Infos & Material
Part I. Building a Warehouse.- 1. Settling into BigQuery.- 2. Starting Your Warehouse Project.- 3. All My Data.- 4. Managing BigQuery Costs.- Part II. Filling the Warehouse.- 5. Loading Data Into the Warehouse.- 6. Streaming Data Into the Warehouse.- 7. Dataflow.- Part III. Using the Warehouse.- 8. Care and Feeding of Your Warehouse.- 9. Querying the Warehouse.- 10. Scheduling Jobs.- 11. Serverless Functions with GCP.- 12. Cloud Logging.- Part IV. Maintaining the Warehouse.- 13. Advanced BigQuery.- 14. Data Governance.- 15. Adapting to Long-Term Change.- Part V. Reporting On and Visualizing Your Data.- 16. Reporting.- 17. Dashboards and Visualization.- 18. Google Data Studio.- Part VI. Enhancing Your Data's Potential.- 19. BigQuery ML.- 20. Jupyter Notebooks and Public Datasets.- 21. Conclusion.- 22. Appendix A: Cloud Shell and Cloud SDK.- 23. Appendix B: Sample Project Charter.