Pro Salesforce Analytics CloudThis book explains Salesforce Analytics Cloud and provides a holistic view of different analytical capabilities and how they fit into the overall information architecture. It features real-world industry use cases and demonstrates how Salesforce's Analytics Cloud solves business challenges and brings real value to the organization.
The Salesforce Analytics Cloud represents the rethinking of analytics for the business user. The Analytics Cloud is a cloud-based platform designed for the business user to have access to analytics "on the go," providing answers to questions instantly on any device. This mobile-ready capability of the Salesforce Analytics Cloud means users can immediately collaborate and share insights with team members right inside Salesforce.
Pro Salesforce Analytics Cloud provides actionable guidance on creating analytical capabilities using Salesforce Analytics Cloud. ...
Pro Spark StreamingLearn the right cutting-edge skills and knowledge to leverage Spark Streaming to implement a wide array of real-time, streaming applications. This book walks you through end-to-end real-time application development using real-world applications, data, and code. Taking an application-first approach, each chapter introduces use cases from a specific industry and uses publicly available datasets from that domain to unravel the intricacies of production-grade design and implementation. The domains covered in the book include social media, the sharing economy, finance, online advertising, telecommunication, and IoT.
In the last few years, Spark has become synonymous with big data processing. DStreams enhance the underlying Spark processing engine to support streaming analysis with a novel micro-batch processing model. Pro Spark Streaming by Zubair Nabi will enable you to become a specialist of latency sensitive applications by leveraging the key features of DStreams, micro-batch processi ...
Elasticsearch: The Definitive GuideWhether you need full-text search or real-time analytics of structured data - or both - the Elasticsearch distributed search engine is an ideal way to put your data to work. This practical guide not only shows you how to search, analyze, and explore data with Elasticsearch, but also helps you deal with the complexities of human language, geolocation, and relationships.
If you're a newcomer to both search and distributed systems, you'll quickly learn how to integrate Elasticsearch into your application. More experienced users will pick up lots of advanced techniques. Throughout the book, you'll follow a problem-based approach to learn why, when, and how to use Elasticsearch features. ...
Practical Business Analytics Using SASPractical Business Analytics Using SAS: A Hands-on Guide shows SAS users and businesspeople how to analyze data effectively in real-life business scenarios.
The book begins with an introduction to analytics analytical tools, and SAS programming. The authors—both SAS, statistics, analytics and big data experts—first show how SAS is used in business, and then how to get started programming in SAS by importing data and learning how to manipulate it. Besides illustrating SAS basic functions, you will see how each function can be used to get the information you need to improve business performance. Each chapter offers hands-on exercises drawn from real business situations.
The book then provides an overview of statistics, as well as instruction on exploring data, preparing it for analysis, and testing hypotheses. You will learn how to use SAS to perform analytics and model using both basic and advanced techniques like multiple regression, logistic regre ...
Customer Analytics For DummiesEnsuring your customers are having positive experiences with your company at all levels, including initial brand awareness and loyalty, is crucial to the success of your business. Customer Analytics For Dummies shows you how to measure each stage of the customer journey and use the right analytics to understand customer behavior and make key business decisions.
Customer Analytics For Dummies gets you up to speed on what you should be testing. You'll also find current information on how to leverage A/B testing, social media's role in the post-purchasing analytics usability metrics, prediction and statistics, and much more to effectively manage the customer experience. Written by a highly visible expert in the area of customer analytics this guide will have you up and running on putting customer analytics into practice at your own business in no time. ...
Fast Data Processing with Spark, 2nd EditionSpark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (GraphX), and real-time analysis (Spark Streaming), it can be interactively used to quickly process and query big datasets.
Fast Data Processing with Spark - Second Edition covers how to write distributed programs with Spark. The book will guide you through every step required to write effective distributed programs from setting up your cluster and interactively exploring the API to developing analytics applications and tuning them for your purposes. ...
Learning Apache MahoutIn the past few years the generation of data and our capability to store and process it has grown exponentially. There is a need for scalable analytics frameworks and people with the right skills to get the information needed from this Big Data. Apache Mahout is one of the first and most prominent Big Data machine learning platforms. It implements machine learning algorithms on top of distributed processing platforms such as Hadoop and Spark.
Starting with the basics of Mahout and machine learning, you will explore prominent algorithms and their implementation in Mahout development. You will learn about Mahout building blocks, addressing feature extraction, reduction and the curse of dimensionality, delving into classification use cases with the random forest and Naïve Bayes classifier and item and user-based recommendation. You will then work with clustering Mahout using the K-means algorithm and implement Mahout without MapReduce. Finish with a flourish by exploring end-to ...
Real-time Analytics with Storm and CassandraThis book will teach you how to use Storm for real-time data processing and to make your applications highly available with no downtime using Cassandra.
The book starts off with the basics of Storm and its components along with setting up the environment for the execution of a Storm topology in local and distributed mode. Moving on, you will explore the Storm and Zookeeper configurations, understand the Storm UI, set up Storm clusters, and monitor Storm clusters using various tools. You will then add NoSQL persistence to Storm and set up a Cassandra cluster. You will do all this while being guided by the best practices for Storm and Cassandra applications. Next, you will learn about data partitioning and consistent hashing in Cassandra through examples and also see high availability features and replication in Cassandra. Finally, you'll learn about different methods that you can use to manage and maintain Cassandra and Storm. ...
Advanced Analytics with SparkIn this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example.
You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques - classification, collaborative filtering, and anomaly detection among others - to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find these patterns useful for working on your own data applications. ...
Python Data AnalyticsPython Data Analytics will help you tackle the world of data acquisition and analysis using the power of the Python language. At the heart of this book lies the coverage of pandas, an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
Author Fabio Nelli expertly shows the strength of the Python programming language when applied to processing, managing and retrieving information. Inside, you will see how intuitive and flexible it is to discover and communicate meaningful patterns of data using Python scripts, reporting systems, and data export. This book examines how to go about obtaining, processing, storing, managing and analyzing data using the Python programming language.
You will use Python and other open source tools to wrangle data and tease out interesting and important trends in that data that will allow you to predict future patterns. Whether you are dealing with sales ...
Derivatives Analytics with PythonDerivatives Analytics with Python shows you how to implement market-consistent valuation and hedging approaches using advanced financial models, efficient numerical techniques, and the powerful capabilities of the Python programming language. This unique guide offers detailed explanations of all theory, methods, and processes, giving you the background and tools necessary to value stock index options from a sound foundation. You'll find and use self-contained Python scripts and modules and learn how to apply Python to advanced data and derivatives analytics as you benefit from the 5,000+ lines of code that are provided to help you reproduce the results and graphics presented. Coverage includes market data analysis, risk-neutral valuation, Monte Carlo simulation, model calibration, valuation, and dynamic hedging, with models that exhibit stochastic volatility, jump components, stochastic short rates, and more. The companion website features all code and IPython Notebooks f ...