R in ActionR in Action is the first book to present both the R system and the use cases that make it such a compelling package for business developers. The book begins by introducing the R language, including the development environment. Focusing on practical solutions, the book also offers a crash course in practical statistics and covers elegant methods for dealing with messy and incomplete data using features of R.
R is a powerful language for statistical computing and graphics that can handle virtually any data-crunching task. It runs on all important platforms and provides thousands of useful specialized modules and utilities. This makes R a great way to get meaningful information from mountains of raw data. ...
The Definitive Guide to MongoDB, 3rd EditionThe Definitive Guide to MongoDB, Third Edition, is updated for MongoDB 3 and includes all of the latest MongoDB features, including the aggregation framework introduced in version 2.2 and hashed indexes in version 2.4. The Third Edition also now includes Node.js along with Python.
MongoDB is the most popular of the "Big Data" NoSQL database technologies, and it's still growing. David Hows from 10gen, along with experienced MongoDB authors Peter Membrey and Eelco Plugge, provide their expertise and experience in teaching you everything you need to know to become a MongoDB pro. ...
Client-Side Data StorageOne of the most useful features of today's modern browsers is the ability to store data right on the user's computer or mobile device. Even as more people move toward the cloud, client-side storage can still save web developers a lot of time and money, if you do it right. This hands-on guide demonstrates several storage APIs in action. You'll learn how and when to use them, their plusses and minuses, and steps for implementing one or more of them in your application.
Ideal for experienced web developers familiar with JavaScript, this book also introduces several open source libraries that make storage APIs easier to work with. ...
Panel Data Analysis using EViewsA comprehensive and accessible guide to panel data analysis using EViews software.
This book explores the use of EViews software in creating panel data analysis using appropriate empirical models and real datasets. Guidance is given on developing alternative descriptive statistical summaries for evaluation and providing policy analysis based on pool panel data. Various alternative models based on panel data are explored, including univariate general linear models, fixed effect models and causal models, and guidance on the advantages and disadvantages of each one is given. ...
Learning Informatica PowerCenter 9.xInformatica PowerCenter provides the perfect platform to utilize and leverage business data. It allows you to easily, conveniently, and efficiently work on different types of data.
This book covers functionality such as creating/importing source and target, identifying errors, and debugging your mapping through a series of comprehensive tutorials. Besides learning about types of Slowly Changing Dimensions (SCDs), you will learn to create and link workflows. As you progress, exhaustive knowledge of transformations and techniques to create folders, migrate code, and optimize system performance will be explored in detail.
The step-by-step approach and adoption of real-time scenarios will guide you through effectively accessing all core functionalities offered by Informatica PowerCenter. ...
VMware vRealize Operations Performance and Capacity ManagementVMware vRealize Operations is a suite of products that automates operations management using patented analytics and an integrated approach to performance, capacity, and configuration management. vRealize Operations Manager is the most important component of this suite that helps administrators to maintain and troubleshoot their VMware environment as well as their physical environment.
This book takes you through the fundamental differences between a Software-Defined Data Center and a classic physical data center, and how these differences impact both architecture and operations. From a strategic point of view, you will come across the most common challenges associated with performance management in a Software-Defined Data Center. Furthermore, you will learn all the key counters in vSphere and vRealize Operations, understand their dependencies, and acquaint yourself with practical solutions to configure them for a healthy virtual environment. ...
IPython Interactive Computing and Visualization CookbookIPython is at the heart of the Python scientific stack. With its widely acclaimed web-based notebook, IPython is today an ideal gateway to data analysis and numerical computing in Python.
IPython Interactive Computing and Visualization Cookbook contains many ready-to-use focused recipes for high-performance scientific computing and data analysis. The first part covers programming techniques, including code quality and reproducibility; code optimization; high-performance computing through dynamic compilation, parallel computing, and graphics card programming. The second part tackles data science, statistics, machine learning, signal and image processing, dynamical systems, and pure and applied mathematics. ...
Fast Data Processing with SparkSpark is a framework for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and inbuilt tools for interactive query analysis (Shark), large-scale graph processing and analysis (Bagel), and real-time analysis (Spark Streaming), it can be interactively used to quickly process and query big data sets.
Fast Data Processing with Spark covers how to write distributed map reduce style programs with Spark. The book will guide you through every step required to write effective distributed programs from setting up your cluster and interactively exploring the API, to deploying your job to the cluster, and tuning it for your purposes. ...
Learning SciPy for Numerical and Scientific ComputingIt's essential to incorporate workflow data and code from various sources in order to create fast and effective algorithms to solve complex problems in science and engineering. Data is coming at us faster, dirtier, and at an ever increasing rate. There is no need to employ difficult-to-maintain code, or expensive mathematical engines to solve your numerical computations anymore. SciPy guarantees fast, accurate, and easy-to-code solutions to your numerical and scientific computing applications.
Learning SciPy for Numerical and Scientific Computing unveils secrets to some of the most critical mathematical and scientific computing problems and will play an instrumental role in supporting your research. The book will teach you how to quickly and efficiently use different modules and routines from the SciPy library to cover the vast scope of numerical mathematics with its simplistic practical approach that's easy to follow. ...
Graphing Data with RIt's much easier to grasp complex data relationships with a graph than by scanning numbers in a spreadsheet. This introductory guide shows you how to use the R language to create a variety of useful graphs for visualizing and analyzing complex data for science, business, media, and many other fields. You'll learn methods for highlighting important relationships and trends, reducing data to simpler forms, and emphasizing key numbers at a glance.
Anyone who wants to analyze data will find something useful here - even if you don't have a background in mathematics, statistics, or computer programming. If you want to examine data related to your work, this book is the ideal way to start. ...