Mastering Feature EngineeringFeature engineering is essential to applied machine learning, but using domain knowledge to strengthen your predictive models can be difficult and expensive. To help fill the information gap on feature engineering, this complete hands-on guide teaches beginning-to-intermediate data scientists how to work with this widely practiced but little discussed topic.
Author Alice Zheng explains common practices and mathematical principles to help engineer features for new data and tasks. If you understand basic machine learning concepts like supervised and unsupervised learning, you're ready to get started. Not only will you learn how to implement feature engineering in a systematic and principled way, you'll also learn how to practice better data science. ...
Classic Computer Science Problems in SwiftDon't just learn another language. Become a better programmer instead. Today's awesome iOS apps stand on the shoulders of classic algorithms, coding techniques, and engineering principles. Master these core skills in Swift, and you'll be ready for AI, data-centric programming, machine learning, and the other development challenges that will define the next decade.
Classic Computer Science Problems in Swift deepens your Swift language skills by exploring foundational coding techniques and algorithms. As you work through examples in search, clustering, graphs, and more, you'll remember important things you've forgotten and discover classic solutions to your "new" problems. You'll appreciate author David Kopec's amazing ability to connect the core disciplines of computer science to the real-world concerns of apps, data, performance, and even nailing your next job interview! ...
Streaming DataAs humans, we're constantly filtering and deciphering the information streaming toward us. In the same way, streaming data applications can accomplish amazing tasks like reading live location data to recommend nearby services, tracking faults with machinery in real time, and sending digital receipts before your customers leave the shop. Recent advances in streaming data technology and techniques make it possible for any developer to build these applications if they have the right mindset. This book will let you join them.
Streaming Data is an idea-rich tutorial that teaches you to think about efficiently interacting with fast-flowing data. Through relevant examples and illustrated use cases, you'll explore designs for applications that read, analyze, share, and store streaming data. Along the way, you'll discover the roles of key technologies like Spark, Storm, Kafka, Flink, RabbitMQ, and more. This book offers the perfect balance between big-picture thinking and implementation deta ...
Practical SQLPractical SQL is an approachable and fast-paced guide to SQL (Structured Query Language), the standard programming language for defining, organizing, and exploring data in relational databases. The book focuses on using SQL to find the story your data tells, with the popular open-source database PostgreSQL and the pgAdmin interface as its primary tools.
You'll first cover the fundamentals of databases and the SQL language, then build skills by analyzing data from the U.S. Census and other federal and state government agencies. With exercises and real-world examples in each chapter, this book will teach even those who have never programmed before all the tools necessary to build powerful databases and access information quickly and efficiently.
Create databases and related tables using your own data; Define the right data types for your information; Aggregate, sort, and filter data to find patterns; Use basic math and advanced statistical functions; Identify errors in data and cle ...
Exam Ref 70-778 Analyzing and Visualizing Data by Using Microsoft Power BIPrepare for Microsoft Exam 70-778 - and help demonstrate your real-world mastery of Power BI data analysis and visualization. Designed for experienced BI professionals and data analysts ready to advance their status, Exam Ref focuses on the critical thinking and decision-making acumen needed for success at the MCSA level.
Focus on the expertise measured by these objectives: Consume and transform data by using Power BI Desktop; Model and visualize data; Configure dashboards, reports, and apps in the Power BI Service.
This Microsoft Exam Ref: Organizes its coverage by exam objectives; Features strategic, what-if scenarios to challenge you; Assumes you have experience consuming and transforming data, modeling and visualizing data, and configuring dashboards using Excel and Power BI. ...
Exam Ref 70-768 Developing SQL Data ModelsPrepare for Microsoft Exam 70-768 - and help demonstrate your real-world mastery of Business Intelligence (BI) solutions development with SQL Server 2016 Analysis Services (SSAS), including modeling and queries. Designed for experienced IT professionals ready to advance their status, Exam Ref focuses on the critical thinking and decision-making acumen needed for success at the MCSA level.
Focus on the expertise measured by these objectives: Design a multidimensional BI semantic model; Design a tabular BI semantic model; Develop queries using Multidimensional Expressions (MDX) and Data Analysis Expressions (DAX); Configure and maintain SSAS.
This Microsoft Exam Ref: Organizes its coverage by exam objectives; Features strategic, what-if scenarios to challenge you; Assumes you are a database or BI professional with experience creating models, writing MDX or DAX queries, and using SSAS. ...
Exam Ref 70-761 Querying Data with Transact-SQLPrepare for Microsoft Exam 70-761 - and help demonstrate your real-world mastery of SQL Server 2016 Transact-SQL data management, queries, and database programming. Designed for experienced IT professionals ready to advance their status, Exam Ref focuses on the critical-thinking and decision-making acumen needed for success at the MCSA level.
Focus on the expertise measured by these objectives: Filter, sort, join, aggregate, and modify data; Use subqueries, table expressions, grouping sets, and pivoting; Query temporal and non-relational data, and output XML or JSON; Create views, user-defined functions, and stored procedures; Implement error handling, transactions, data types, and nulls.
This Microsoft Exam Ref: Organizes its coverage by exam objectives; Features strategic, what-if scenarios to challenge you; Assumes you have experience working with SQL Server as a database administrator, system engineer, or developer; Includes downloadable sample database and code for SQL Serve ...
Exam Ref 70-767 Implementing a SQL Data WarehousePrepare for Microsoft Exam 70-767 - and help demonstrate your real-world mastery of skills for managing data warehouses. This exam is intended for Extract, Transform, Load (ETL) data warehouse developers who create business intelligence (BI) solutions. Their responsibilities include data cleansing as well as ETL and data warehouse implementation. The reader should have experience installing and implementing a Master Data Services (MDS) model, using MDS tools, and creating a Master Data Manager database and web application. The reader should understand how to design and implement ETL control flow elements and work with a SQL Service Integration Services package.
Focus on the expertise measured by these objectives: Design, and implement, and maintain a data warehouse; Extract, transform, and load data; Build data quality solutions.
This Microsoft Exam Ref: Organizes its coverage by exam objectives; Features strategic, what-if scenarios to challenge you; Assumes you have working kno ...
Exam Ref 70-779 Analyzing and Visualizing Data with Microsoft ExcelPrepare for Microsoft Exam 70-779 - and help demonstrate your real-world mastery of Microsoft Excel data analysis and visualization. Designed for BI professionals, data analysts, and others who analyze business data with Excel, this Exam Ref focuses on the critical thinking and decision-making acumen needed for success at the MCSA level.
Focus on the expertise measured by these objectives: Consume and transform data by using Microsoft Excel; Model data, from building and optimizing data models through creating performance KPIs, actual and target calculations, and hierarchies; Visualize data, including creating and managing PivotTables and PivotCharts, and interacting with PowerBI.
This Microsoft Exam Ref: Organizes its coverage by exam objectives; Features strategic, what-if scenarios to challenge you; Assumes you have a strong understanding of how to use Microsoft Excel to perform data analysis. ...
Learning SparkData in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java, and Scala. This edition includes new information on Spark SQL, Spark Streaming, setup, and Maven coordinates.
Written by the developers of Spark, this book will have data scientists and engineers up and running in no time. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.
Quickly dive into Spark capabilities such as distributed datasets, in-memory caching, and the interactive shell; Leverage Spark's powerful built-in libraries, including Spark SQL, Spark Streaming, and MLlib; Use one programming paradigm instead of mixing and matching tools like Hiv ...