Practical Data Science CookbookStarting with the basics, this book will cover how to set up your numerical programming environment, introduce you to the data science pipeline (an iterative process by which data science projects are completed), and guide you through several data projects in a step-by-step format. By sequentially working through the steps in each chapter, you will quickly familiarize yourself with the process and learn how to apply it to a variety of situations with examples in the two most popular programming languages for data analysis - R and Python. ...
Practical Zendesk Administration, 2nd EditionImplementing the Zendesk customer service software as part of your company's operations can be time-consuming, but with the best practices and advice in this hands-on guide, you can shorten the procedure considerably. You'll learn the purpose, benefits, and pitfalls of each Zendesk feature, with examples of how to configure it to match your company's needs and processes.
Written by written by two experienced Zendesk product trainers, this book is distilled from years of working in the trenches, launching customer implementations, and answering thousands of questions from participants throughout the world. With it, you'll be able to determine the best way to put Zendesk's vast potential to work for your company. ...
Pro Apache Hadoop, 2nd EditionPro Apache Hadoop, Second Edition brings you up to speed on Hadoop – the framework of big data. Revised to cover Hadoop 2.0, the book covers the very latest developments such as YARN (aka MapReduce 2.0), new HDFS high-availability features, and increased scalability in the form of HDFS Federations. All the old content has been revised too, giving the latest on the ins and outs of MapReduce, cluster design, the Hadoop Distributed File System, and more.
This book covers everything you need to build your first Hadoop cluster and begin analyzing and deriving value from your business and scientific data. Learn to solve big-data problems the MapReduce way, by breaking a big problem into chunks and creating small-scale solutions that can be flung across thousands upon thousands of nodes to analyze large data volumes in a short amount of wall-clock time. Learn how to let Hadoop take care of distributing and parallelizing your software - you just focus on the code; Hadoop takes care of the ...
Ansible: Up and Running, 2nd EditionAmong the many configuration management tools available, Ansible has some distinct advantages - it's minimal in nature, you don't need to install anything on your nodes, and it has an easy learning curve. With this updated second edition, you'll learn how to be productive with this tool quickly, whether you're a developer deploying code to production or a system administrator looking for a better automation solution.
Authors Lorin Hochstein and René Moser show you how to write playbooks (Ansible's configuration management scripts), manage remote servers, and explore the tool's real power: built-in declarative modules. You'll discover that Ansible has the functionality you need - and the simplicity you desire.
Manage Windows machines, and automate network device configuration; Manage your fleet from your web browser with Ansible Tower; Understand how Ansible differs from other configuration management systems; Use the YAML file format to write your own playbooks; Work with a comp ...
Building Microservices with ASP.NET CoreAt a time when nearly every vertical, regardless of domain, seems to need software running in the cloud to make money, microservices provide the agility and drastically reduced time to market you require. This hands-on guide shows you how to create, test, compile, and deploy microservices, using the ASP.NET Core free and open-source framework. Along the way, you'll pick up good, practical habits for building powerful and robust services.
Building microservices isn't about learning a specific framework or programming language; it's about building applications that thrive in elastically scaling environments that don't have host affinity, and that can start and stop at a moment's notice. This practical book guides you through the process.
Learn test-driven and API-first development concepts; Communicate with other services by creating and consuming backing services such as databases and queues; Build a microservice that depends on an external data source; Learn about event sourcing, ...
Kubernetes: Up and RunningLegend has it that Google deploys over two billion application containers a week. How's that possible? Google revealed the secret through a project called Kubernetes, an open source cluster orchestrator (based on its internal Borg system) that radically simplifies the task of building, deploying, and maintaining scalable distributed systems in the cloud. This practical guide shows you how Kubernetes and container technology can help you achieve new levels of velocity, agility, reliability, and efficiency.
Authors Kelsey Hightower, Brendan Burns, and Joe Beda - who've worked on Kubernetes at Google and other organizatons - explain how this system fits into the lifecycle of a distributed application. You will learn how to use tools and APIs to automate scalable distributed systems, whether it is for online services, machine-learning applications, or a cluster of Raspberry Pi computers.
Explore the distributed system challenges that Kubernetes addresses; Dive into containerized appl ...
Deep Learning with Applications Using PythonBuild deep learning applications, such as computer vision, speech recognition, and chatbots, using frameworks such as TensorFlow and Keras. This book helps you to ramp up your practical know-how in a short period of time and focuses you on the domain, models, and algorithms required for deep learning applications. Deep Learning with Applications Using Python covers topics such as chatbots, natural language processing, and face and object recognition. The goal is to equip you with the concepts, techniques, and algorithm implementations needed to create programs capable of performing deep learning.
This book covers intermediate and advanced levels of deep learning, including convolutional neural networks, recurrent neural networks, and multilayer perceptrons. It also discusses popular APIs such as IBM Watson, Microsoft Azure, and scikit-learn.
Work with various deep learning frameworks such as TensorFlow, Keras, and scikit-learn; Build face recognition and face detection capabilit ...
The Customer-Driven PlaybookDespite the wide acceptance of Lean approaches and customer-development strategies, many product teams still have difficulty putting these principles into meaningful action. That's where The Customer-Driven Playbook comes in. This practical guide provides a complete end-to-end process that will help you understand customers, identify their problems, conceptualize new ideas, and create fantastic products they'll love.
To build successful products, you need to continually test your assumptions about your customers and the products you build. This book shows team leads, researchers, designers, and managers how to use the Hypothesis Progression Framework (HPF) to formulate, experiment with, and make sense of critical customer and product assumptions at every stage. With helpful tips, real-world examples, and complete guides, you'll quickly learn how to turn Lean theory into action.
Collect and formulate your assumptions into hypotheses that can be tested to unlock meaningful insights ...
Text Mining with RMuch of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you'll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext and other tidy tools in R can make text analysis easier and more effective.
The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You'll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media.
Learn how to apply the tidy text format to NLP; Use sentiment analysis to mine the emotional content of text; Identify a document's most important terms with frequency ...
Advanced Analytics with Spark, 2nd EditionIn the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this edition acts as an introduction to these techniques and other best practices in Spark programming.
You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques - including classification, clustering, collaborative filtering, and anomaly detection - to fields such as genomics, security, and finance.
If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find the book's patterns useful for working on your own data applications.
Familiarize yourself with the Spark programming model; Become comfortable within the Spark ec ...