Data Analysis Using SQL and ExcelUseful business analysis requires you to effectively transform data into actionable information. This book helps you use SQL and Excel to extract business information from relational databases and use that data to define business dimensions, store transactions about customers, produce results, and more. Each chapter explains when and why to perform a particular type of business analysis in order to obtain useful results, how to design and perform the analysis using SQL and Excel, and what the results should look like. ...
Parallel RIt's tough to argue with R as a high-quality, cross-platform, open source statistical software product - unless you're in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets. You'll learn the basics of Snow, Multicore, Parallel, and some Hadoop-related tools, including how to find them, how to use them, when they work well, and when they don't.
With these packages, you can overcome R's single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R's memory barrier. ...
Ethics of Big DataWhat are your organization's policies for generating and using huge datasets full of personal information? This book examines ethical questions raised by the big data phenomenon, and explains why enterprises need to reconsider business decisions concerning privacy and identity. Authors Kord Davis and Doug Patterson provide methods and techniques to help your business engage in a transparent and productive ethical inquiry into your current data practices. ...
A Developer's Guide to Data Modeling for SQL ServerEffective data modeling is essential to ensuring that your databases will perform well, scale well, and evolve to meet changing requirements. However, if you're modeling databases to run on Microsoft SQL Server 2008 or 2005, theoretical or platform-agnostic data modeling knowledge isn't enough: models that don't reflect SQL Server's unique real-world strengths and weaknesses often lead to disastrous performance. ...
Big Data NowThis collection represents the full spectrum of data-related content we've published on O'Reilly Radar over the last year. Mike Loukides kicked things off in June 2010 with "What is data science?" and from there we've pursued the various threads and themes that naturally emerged. Now, roughly a year later, we can look back over all we've covered and identify a number of core data areas. ...
Big Data Now: 2012 EditionThe Big Data Now anthology is relevant to anyone who creates, collects or relies upon data. It's not just a technical book or just a business guide. Data is ubiquitous and it doesn't pay much attention to borders, so we've calibrated our coverage to follow it wherever it goes.
In the first edition of Big Data Now, the O'Reilly team tracked the birth and early development of data tools and data science. Now, with this second edition, we're seeing what happens when big data grows up: how it's being applied, where it's playing a role, and the consequences - good and bad alike - of data's ascendance. ...
MongoDB: The Definitive Guide, 2nd EditionManage the huMONGOus amount of data collected through your web application with MongoDB. This authoritative introduction—written by a core contributor to the project—shows you the many advantages of using document-oriented databases, and demonstrates how this reliable, high-performance system allows for almost infinite horizontal scalability.
This updated second edition provides guidance for database developers, advanced configuration for system administrators, and an overview of the concepts and use cases for other people on your project. Ideal for NoSQL newcomers and experienced MongoDB users alike, this guide provides numerous real-world schema design examples. ...
Data Structures and Algorithms in PythonBased on the authors' market leading data structures books in Java and C++, this textbook offers a comprehensive, definitive introduction to data structures in Python by respected authors. Data Structures and Algorithms in Python is the first mainstream object-oriented book available for the Python data structures course. Designed to provide a comprehensive introduction to data structures and algorithms, including their design, analysis, and implementation, the text will maintain the same general structure as Data Structures and Algorithms in Java and Data Structures and Algorithms in C++. ...
Data Mining and Business Analytics with RCollecting, analyzing, and extracting valuable information from a large amount of data requires easily accessible, robust, computational and analytical tools. Data Mining and Business Analytics with R utilizes the open source software R for the analysis, exploration, and simplification of large high-dimensional data sets. As a result, readers are provided with the needed guidance to model and interpret complicated data and become adept at building powerful models for prediction and classification.
Highlighting both underlying concepts and practical computational skills, Data Mining and Business Analytics with R begins with coverage of standard linear regression and the importance of parsimony in statistical modeling. ...
Access 2007 VBA BibleLearn how to tap the full potential of Access 2007 Transfer Access data seamlessly between Microsoft Office applications - and that's just for starters. In this all-new, comprehensive guide by well-known Access expert Helen Feddema, you'll learn to write Visual Basic code that automates Access database tasks, creates standalone scripts, extracts and merges data, and allows you to put together powerful solutions. Whether you're a beginner or a power user, this is the book you need to succeed with Access 2007. ...