IT eBooks
Download, Read, Use
Learning MySQL and MariaDB
Learning MySQL and MariaDB

If you're a programmer new to databases - or just new to MySQL and its community-driven variant, MariaDB - you've found the perfect introduction. This hands-on guide provides an easy, step-by-step approach to installing, using, and maintaining these popular relational database engines. Author Russell Dyer, Curriculum Manager at MariaDB and former editor of the MySQL Knowledge Base, takes you through database design and the basics of data management and manipulation, using real-world examples and many practical tips. Exercises and review questions help you practice what you've just learned. ...
Learning Apache Mahout
Learning Apache Mahout

In the past few years the generation of data and our capability to store and process it has grown exponentially. There is a need for scalable analytics frameworks and people with the right skills to get the information needed from this Big Data. Apache Mahout is one of the first and most prominent Big Data machine learning platforms. It implements machine learning algorithms on top of distributed processing platforms such as Hadoop and Spark. Starting with the basics of Mahout and machine learning, you will explore prominent algorithms and their implementation in Mahout development. You will learn about Mahout building blocks, addressing feature extraction, reduction and the curse of dimensionality, delving into classification use cases with the random forest and Naïve Bayes classifier and item and user-based recommendation. You will then work with clustering Mahout using the K-means algorithm and implement Mahout without MapReduce. Finish with a flourish by exploring end-to-end us ...
Real-time Analytics with Storm and Cassandra
Real-time Analytics with Storm and Cassandra

This book will teach you how to use Storm for real-time data processing and to make your applications highly available with no downtime using Cassandra. The book starts off with the basics of Storm and its components along with setting up the environment for the execution of a Storm topology in local and distributed mode. Moving on, you will explore the Storm and Zookeeper configurations, understand the Storm UI, set up Storm clusters, and monitor Storm clusters using various tools. You will then add NoSQL persistence to Storm and set up a Cassandra cluster. You will do all this while being guided by the best practices for Storm and Cassandra applications. Next, you will learn about data partitioning and consistent hashing in Cassandra through examples and also see high availability features and replication in Cassandra. Finally, you'll learn about different methods that you can use to manage and maintain Cassandra and Storm. ...
Mastering pandas
Mastering pandas

Python is a ground breaking language for its simplicity and succinctness, allowing the user to achieve a great deal with a few lines of code, especially compared to other programming languages. The pandas brings these features of Python into the data analysis realm, by providing expressiveness, simplicity, and powerful capabilities for the task of data analysis. By mastering pandas, users will be able to do complex data analysis in a short period of time, as well as illustrate their findings using the rich visualization capabilities of related tools such as IPython and matplotlib. This book is an in-depth guide to the use of pandas for data analysis, for either the seasoned data analysis practitioner or the novice user. It provides a basic introduction to the pandas framework, and takes users through the installation of the library and the IPython interactive environment. Thereafter, you will learn basic as well as advanced features, such as MultiIndexing, modifying data structures, ...
Python Data Analysis
Python Data Analysis

Dive deeper into data analysis with the flexibility of Python and learn how its extensive range of scientific and mathematical libraries can be used to solve some of the toughest challenges in data analysis. Build your confidence and expertise and develop valuable skills in high demand in a world driven by Big Data with this expert data analysis book. This data science tutorial will help you learn how to effectively retrieve, clean, manipulate, and visualize data and establish a successful data analysis workflow. Apply the impressive functionality of Python's data mining tools and scientific and numerical libraries to a range of the most important tasks within data analysis and data science, and develop strategies and ideas to take control your own data analysis projects. Get to grips with statistical analysis using NumPy and SciPy, visualize data with Matplotlib, and uncover sophisticated insights through predictive analytics and machine learning with SciKit-Learn. You will also le ...
Learning pandas
Learning pandas

This learner's guide will help you understand how to use the features of pandas for interactive data manipulation and analysis. This book is your ideal guide to learning about pandas, all the way from installing it to creating one- and two-dimensional indexed data structures, indexing and slicing-and-dicing that data to derive results, loading data from local and Internet-based resources, and finally creating effective visualizations to form quick insights. You start with an overview of pandas and NumPy and then dive into the details of pandas, covering pandas' Series and DataFrame objects, before ending with a quick review of using pandas for several problems in finance. With the knowledge you gain from this book, you will be able to quickly begin your journey into the exciting world of data science and analysis. ...
Big Data Forensics: Learning Hadoop Investigations
Big Data Forensics: Learning Hadoop Investigations

Big Data forensics is an important type of digital investigation that involves the identification, collection, and analysis of large-scale Big Data systems. Hadoop is one of the most popular Big Data solutions, and forensically investigating a Hadoop cluster requires specialized tools and techniques. With the explosion of Big Data, forensic investigators need to be prepared to analyze the petabytes of data stored in Hadoop clusters. Understanding Hadoop's operational structure and performing forensic analysis with court-accepted tools and best practices will help you conduct a successful investigation. Discover how to perform a complete forensic investigation of large-scale Hadoop clusters using the same tools and techniques employed by forensic experts. This book begins by taking you through the process of forensic investigation and the pitfalls to avoid. It will walk you through Hadoop's internals and architecture, and you will discover what types of information Hadoop stores and ...
Beginning Big Data with Power BI and Excel 2013
Beginning Big Data with Power BI and Excel 2013

In Beginning Big Data with Power BI and Excel 2013, you will learn to solve business problems by tapping the power of Microsoft's Excel and Power BI to import data from NoSQL and SQL databases and other sources, create relational data models, and analyze business problems through sophisticated dashboards and data-driven maps. While Beginning Big Data with Power BI and Excel 2013 covers prominent tools such as Hadoop and the NoSQL databases, it recognizes that most small and medium-sized businesses don't have the Big Data processing needs of a Netflix, Target, or Facebook. Instead, it shows how to import data and use the self-service analytics available in Excel with Power BI. As you'll see through the book's numerous case examples, these tools - which you already know how to use - can perform many of the same functions as the higher-end Apache tools many people believe are required to carry out in Big Data projects. ...
The Visual Organization
The Visual Organization

The era of Big Data as arrived, and most organizations are woefully unprepared. Slowly, many are discovering that stalwarts like Excel spreadsheets, KPIs, standard reports, and even traditional business intelligence tools aren't sufficient. These old standbys can't begin to handle today's increasing streams, volumes, and types of data. In The Visual Organization, award-winning author and technology expert Phil Simon looks at how an increasingly number of organizations are embracing new dataviz tools and, more important, a new mind-set based upon data discovery and exploration. Simon adroitly shows how Amazon, Apple, Facebook, Google, Twitter, and other tech heavyweights use powerful data visualization tools to garner fascinating insights into their businesses. But make no mistake: these companies are hardly alone. Organizations of all types, industries, sizes are representing their data in new and amazing ways. As a result, they are asking better questions and making better business ...
Network Programmability and Automation
Network Programmability and Automation

Like sysadmins before them, network engineers are finding that they cannot do their work manually anymore. As the field faces new protocols, technologies, delivery models, and a pressing need for businesses to be more agile and flexible, network automation is becoming essential. This practical guide shows network engineers how to use a range of technologies and tools - including Linux, Python, JSON, and XML - to automate their systems through code. Network programming and automation will help you simplify tasks involved in configuring, managing, and operating network equipment, topologies, services, and connectivity. Through the course of the book, you'll learn the basic skills and tools you need to make this critical transition. Python programming basics: data types, conditionals, loops, functions, classes, and modules; Linux fundamentals to provide the foundation you need on your network automation journey; Data formats and models: JSON, XML, YAML, and YANG for networking; Jinja te ...
← Prev       Next →
Reproduction of site books is authorized only for informative purposes and strictly for personal, private use.
Only Direct Download
IT eBooks Group © 2011-2025