
Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python
Paperback
Publisher Price: $79.99
ISBN13: 9781492072942
Publisher: Oreilly Media
Published: Jun 16 2020
Pages: 360
Weight: 1.30
Height: 0.90 Width: 7.00 Depth: 9.10
Language: English
Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this popular guide adds comprehensive examples in Python, provides practical guidance on applying statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what's important and what's not.
Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you're familiar with the R or Python programming languages and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.
With this book, you'll learn:
- Why exploratory data analysis is a key preliminary step in data science
- How random sampling can reduce bias and yield a higher-quality dataset, even with big data
- How the principles of experimental design yield definitive answers to questions
- How to use regression to estimate outcomes and detect anomalies
- Key classification techniques for predicting which categories a record belongs to
- Statistical machine learning methods that learn from data
- Unsupervised learning methods for extracting meaning from unlabeled data
Also in
Databases
Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
Kleppmann, Martin
Paperback
Fundamentals of Data Engineering: Plan and Build Robust Data Systems
Reis, Joe
Housley, Matt
Paperback
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Shields, Walter
Hardcover
The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling
Kimball, Ralph
Ross, Margy
Paperback
Fusion Strategy: How Real-Time Data and AI Will Power the Industrial Future
Venkatraman, Venkat
Govindarajan, Vijay
Hardcover
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Shields, Walter
Paperback
Trustworthy Online Controlled Experiments: A Practical Guide to A/B Testing
Kohavi, Ron
Tang, Diane
Xu, Ya
Paperback
The Definitive Guide to Dax: Business Intelligence for Microsoft Power Bi, SQL Server Analysis Services, and Excel
Russo, Marco
Ferrari, Alberto
Paperback
Introduction to Statistics: An Intuitive Guide for Analyzing Data and Unlocking Discoveries
Frost, Jim
Paperback
Product Operations: How successful companies build better products at scale
Perri, Melissa
Tilles, Denise
Paperback
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition
Hastie, Trevor
Friedman, Jerome
Tibshirani, Robert
Hardcover
Data Analytics & Visualization All-In-One for Dummies
Hyman, Jack A.
McFedries, Paul
Massaron, Luca
Paperback
SQL Cookbook: Query Solutions and Techniques for All SQL Users
Molinaro, Anthony
Graaf, Robert de
Paperback
Hands-On Salesforce Data Cloud: Implementing and Managing a Real-Time Customer Data Platform
Avila, Joyce Kay
Paperback
Statistical Tableau: How to Use Statistical Models and Decision Science in Tableau
Lang, Ethan
Paperback
Apache Iceberg: The Definitive Guide: Data Lakehouse Functionality, Performance, and Scalability on the Data Lake
Merced, Alex
Shiran, Tomer
Hughes, Jason
Paperback
Becoming a Data Head: How to Think, Speak, and Understand Data Science, Statistics, and Machine Learning
Gutman, Alex J.
Goldmeier, Jordan
Paperback
SQL for Data Analysis: Advanced Techniques for Transforming Data Into Insights
Tanimura, Cathy
Paperback
Designing Data Governance from the Ground Up: Six Steps to Build a Data-Driven Culture
Maffeo, Lauren
Paperback
Mathletics: How Gamblers, Managers, and Fans Use Mathematics in Sports, Second Edition
Pelechrinis, Konstantinos
Winston, Wayne L.
Nestler, Scott
Paperback
Databricks Certified Data Engineer Associate Study Guide: In-Depth Guidance and Practice
Alhussein, Derar
Paperback
Numerical Python: Scientific Computing and Data Science Applications with Numpy, Scipy and Matplotlib
Johansson, Robert
Paperback
R in Action, Third Edition: Data Analysis and Graphics with R and Tidyverse
Kabacoff, Robert I.
Paperback
Analytics the Right Way: A Business Leader's Guide to Putting Data to Productive Use
Wilson, Tim
Sutherland, Joe
Paperback
Non-Invasive Data Governance: The Path of Least Resistance and Greatest Success
Seiner, Robert
Paperback
Data Modeling with Microsoft Power BI: Self-Service and Enterprise Data Warehouse with Power BI
Ehrenmueller-Jensen, Markus
Paperback
Agile Data Warehouse Design: Collaborative Dimensional Modeling, from Whiteboard to Star Schema
Corr, Lawrence
Stagnitto, Jim
Paperback
Football Analytics with Python & R: Learning Data Science Through the Lens of Sports
Eager, Eric A.
Erickson, Richard a.
Paperback
High Performance PostgreSQL for Rails: Reliable, Scalable, Maintainable Database Applications
Atkinson, Andrew
Paperback
Exam Ref Dp-600 Implementing Analytics Solutions Using Microsoft Fabric
Maslyuk, Daniil
Winter, Johnny
Resl, Stěpán
Paperback
Blockchain: The Comprehensive Guide to Blockchain Development, Ethereum, Solidity, and Smart Contracts
Fertig, Tobias
Schütz, Andreas
Paperback
Data and Reality: A Timeless Perspective on Perceiving and Managing Information in Our Imprecise World, 3rd Edition
Kent, William
Paperback
PostgreSQL 16 Administration Cookbook: Solve real-world Database Administration challenges with 180+ practical recipes and best practices
Mejías, Boriss
Angelakos, Jimmy
Ciolli, Gianni
Paperback
Turning Data into Wisdom: How We Can Collaborate with Data to Change Ourselves, Our Organizations, and Even the World
Hanegan, Kevin
Paperback
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us about Who We Really Are
Stephens-Davidowitz, Seth
Paperback
SQL Server 2022 Query Performance Tuning: Troubleshoot and Optimize Query Performance
Fritchey, Grant
Paperback
Collect, Combine, and Transform Data Using Power Query in Power Bi and Excel
Raviv, Gil
Maslyuk, Daniil
Paperback
Excel 2021: Everything you need to know about Excel to go from Beginner to Expert
Wright, Nora E.
Paperback
Alteryx Designer: The Definitive Guide: Simplify and Automate Your Analytics
Burkhow, Joshua
Paperback
Databricks Data Intelligence Platform: Unlocking the Genai Revolution
Yip, Jason
Gupta, Nikhil
Paperback
Winning with Data Science: A Handbook for Business Leaders
Swaminathan, Akshay
Friedman, Howard Steven
Hardcover
Observability Engineering: Achieving Production Excellence
Majors, Charity
Fong-Jones, Liz
Miranda, George
Paperback
Streaming Databases: Unifying Batch and Stream Processing
Debusmann, Ralph Matthias
Dulay, Hubert
Paperback
Apache Airflow Best Practices: A practical guide to orchestrating data workflow with Apache Airflow
Storey, Dylan
Intorf, Dylan
Doorn, Kendrick Van
Paperback
Data Governance: The Definitive Guide: People, Processes, and Tools to Operationalize Data Trustworthiness
Eryurek, Evren
Gilad, Uri
Lakshmanan, Valliappa
Paperback
Aerospike: Up and Running: Developing on a Modern Operational Database for Globally Distributed Apps
Srinivasan, V.
Faulkes, Tim
Autin, Albert
Paperback
Practical Time Series Analysis: Prediction with Statistics and Machine Learning
Nielsen, Aileen
Paperback
Hands-On MySQL Administration: Managing MySQL on Premises and in the Cloud
Ayyalusamy, Jeyaram
Aravindan, Arunjith
Paperback
Analytics Engineering with SQL and Dbt: Building Meaningful Data Models at Scale
Machado, Rui Pedro
Russa, Helder
Paperback
Implementing Data Mesh: Design, Build, and Implement Data Contracts, Data Products, and Data Mesh
Perrin, Jean-Georges
Broda, Eric
Paperback
Value-Driven Data: Identifying, Communicating and Delivering Effective Business Solutions with Data
Odaro, Edosa
Paperback
Cockroachdb: The Definitive Guide: Distributed Data at Scale
Seldess, Jesse
Darnell, Ben
Harrison, Guy
Paperback
The Data Storyteller's Handbook: How to create business impact using data storytelling
Greenbrook, Kat
Paperback
High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark
Karau, Holden
Warren, Rachel
Paperback
Data Strategy: How to Profit from a World of Big Data, Analytics and Artificial Intelligence
Marr, Bernard
Paperback
Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale
Palino, Todd
Sivaram, Rajini
Shapira, Gwen
Paperback
Learn Microsoft Fabric: A practical guide to performing data analytics in the era of artificial intelligence
Ali, Arshad
Schacht, Bradley
Paperback
Optimizing DAX: Improving DAX performance in Microsoft Power BI and Analysis Services
Ferrari, Alberto
Russo, Marco
Paperback
Data Engineering Design Patterns: Recipes for Solving the Most Common Data Engineering Problems
Konieczny, Bartosz
Paperback
Big Data in Der Mobilität: Akteure, Geschäftsmodelle Und Nutzenpotenziale Für Die Welt Von Morgen
Knorre, Susanne
Müller-Peters, Horst
Gatzert, Nadine
Paperback
Data Engineering Best Practices: Architect robust and cost-effective data solutions in the cloud era
Larochelle, David
Schiller, Richard J.
Paperback
Data Governance Handbook: A practical approach to building trust in data
Batchelder, Wendy S.
Paperback
Object-Role Modeling Fundamentals: A Practical Guide to Data Modeling with ORM
Halpin, Terry
Paperback
Blueprints for Text Analytics Using Python: Machine Learning-Based Solutions for Common Real World (Nlp) Applications
Albrecht, Jens
Ramachandran, Sidharth
Winkler, Christian
Paperback
Delta Lake: The Definitive Guide: Modern Data Lakehouse Architectures with Data Lakes
Haines, Scott
Lee, Denny
Wentling, Tristen
Paperback
Build a Robo-Advisor with Python (from Scratch): Automate Your Financial and Investment Decisions
Reider, Rob
Michalka, Alex
Paperback
Business 101 for the Data Professional: What You Need to Know to Succeed in Business
Morrow, Jordan
Paperback
SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis
Teate, Renee M. P.
Paperback
Mongodb: The Definitive Guide: Powerful and Scalable Data Storage
Chodorow, Kristina
Bradshaw, Shannon
Brazil, Eoin
Paperback
Azure Data Factory by Example: Practical Implementation for Data Engineers
Swinbank, Richard
Paperback
PostgreSQL Query Optimization: The Ultimate Guide to Building Efficient Queries
Dombrovskaya, Henrietta
Bailliekova, Anna
Database Expert
Paperback
Data Analytics with Hadoop: An Introduction for Data Scientists
Bengfort, Benjamin
Kim, Jenny
Paperback
Text as Data: A New Framework for Machine Learning and the Social Sciences
Roberts, Margaret E.
Stewart, Brandon M.
Grimmer, Justin
Paperback
Snowflake: The Definitive Guide: Architecting, Designing, and Deploying on the Snowflake Data Cloud
Avila, Joyce Kay
Paperback
High Performance Python: Practical Performant Programming for Humans
Gorelick, Micha
Ozsvald, Ian
Paperback
Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale
Thalpati, Gaurav Ashok
Paperback
Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
Massung, Sean
Zhai, Chengxiang
Paperback
Predictive Analytics for the Modern Enterprise: A Practitioner's Guide to Designing and Implementing Solutions
Ali, Nooruddin Abbas
Paperback
Learn PostgreSQL - Second Edition: Use, manage and build secure and scalable databases with PostgreSQL 16
Ferrari, Luca
Pirozzi, Enrico
Paperback
Database Management for Business Leaders: Building and Using Data Solutions That Work for You
Ruddell, Larry
Paperback
The Decision Maker's Handbook to Data Science: AI and Data Science for Non-Technical Executives, Managers, and Founders
Kampakis, Stylianos
Paperback
The Enterprise Data Catalog: Improve Data Discovery, Ensure Data Governance, and Enable Innovation
Olesen-Bagneux, Ole
Paperback
Applied Unsupervised Learning with Python
Jones, Aaron
Kruger, Christopher
Johnston, Benjamin
Paperback
Practical Natural Language Processing: A Comprehensive Guide to Building Real-World Nlp Systems
Majumder, Bodhisattwa
Vajjala, Sowmya
Gupta, Anuj
Paperback
SAP S/4hana Financial Accounting Certification Guide: Application Associate Exam
Pougkas, Stefanos
Paperback
Aprende SQL en un fin de semana: El curso definitivo para crear y consultar bases de datos
Padial Solier, Antonio
Paperback
Practical Serverless Applications with AWS: Harnessing the Power of Serverless Cloud Applications
Basha, Shaik Inthiyaz
Prakash, Apoorva
Paperback
Product Analytics: Applied Data Science Techniques for Actionable Consumer Insights
Rodrigues, Joanne
Paperback
Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management
Linoff, Gordon S.
Berry, Michael J. a.
Paperback
Pandas Cookbook - Third Edition: Practical recipes for scientific computing, time series, and exploratory data analysis using Python
Ayd, William
Harrison, Matthew
Paperback
Data-Driven Talent Management: Using Analytics to Improve Employee Experience
Saling, Kristin
Paperback
Data Quality Fundamentals: A Practitioner's Guide to Building Trustworthy Data Pipelines
Vorwerck, Molly
Moses, Barr
Gavish, Lior
Paperback
Azure SQL Revealed: The Next-Generation Cloud Database with AI and Microsoft Fabric
Ward, Bob
Paperback
Data and Analytics Strategy for Business: Unlock Data Assets and Increase Innovation with a Results-Driven Data Strategy
Asplen-Taylor, Simon
Paperback
MICROSOFT EXCEL & ACCESS For Beginners and Pros. 2024: A Complete Guide to Master Excel and Access 365 for All Users
Sherer, Charles
Paperback
Principles of Data Science: Mathematical techniques and theory to succeed in data-driven industries
Ozdemir, Sinan
Paperback
Cracking the Data Engineering Interview: Land your dream job with the help of resume-building tips, over 100 mock questions, and a unique portfolio
Ransome, Taamir
Bryan, Kedeisha
Paperback
Essential Data Analytics, Data Science, and AI: A Practical Guide for a Data-Driven World
Attobrah, Maxine
Paperback
Mastering Python for Bioinformatics: How to Write Flexible, Documented, Tested Python Code for Research Computing
Youens-Clark, Ken
Paperback
Data Analytics Made Easy: Analyze and present data to make informed decisions without writing any code
Mauro, Andrea de
Paperback
Architecting Data and Machine Learning Platforms: Enable Analytics and Ai-Driven Innovation in the Cloud
Tekiner, Firat
Lakshmanan, Valliappa
Tranquillin, Marco
Paperback