IT eBooks
Download, Read, Use
Web Scraping with Python, 2nd Edition
Web Scraping with Python, 2nd Edition

If programming is magic then web scraping is surely a form of wizardry. By writing a simple automated program, you can query web servers, request data, and parse it to extract the information you need. The expanded edition of this practical book not only introduces you web scraping, but also serves as a comprehensive guide to scraping almost every type of data from the modern web. Part I focuses on web scraping mechanics: using Python to request information from a web server, performing basic handling of the server's response, and interacting with sites in an automated fashion. Part II explores a variety of more specific tools and applications to fit any web scraping scenario you're likely to encounter. Parse complicated HTML pages; Develop crawlers with the Scrapy framework; Learn methods to store data you scrape; Read and extract data from documents; Clean and normalize badly formatted data; Read and write natural languages; Crawl through forms and logins; Scrape JavaScript and ...
Better Business Decisions from Data
Better Business Decisions from Data

Everyone encounters statistics on a daily basis. They are used in proposals, reports, requests, and advertisements, among others, to support assertions, opinions, and theories. Unless you're a trained statistician, it can be bewildering. What are the numbers really saying or not saying? Better Business Decisions from Data: Statistical Analysis for Professional Success provides the answers to these questions and more. It will show you how to use statistical data to improve small, every-day management judgments as well as major business decisions with potentially serious consequences. With the arrival of Big Data, statistical processing has taken on a new level of importance. Kenny lays a foundation for understanding the importance and value of Big Data, and then he shows how mined data can help you see your business in a new light and uncover opportunity. ...
Big Data Made Easy
Big Data Made Easy

Many corporations are finding that the size of their data sets are outgrowing the capability of their systems to store and process them. The data is becoming too big to manage and use with traditional tools. The solution: implementing a big data system. The problem is that the Internet offers IT pros wading into big data many versions of the truth and some outright falsehoods born of ignorance. What is needed is a book just like this one: a wide-ranging but easily understood set of instructions to explain where to get Hadoop tools, what they can do, how to install them, how to configure them, how to integrate them, and how to use them successfully. And you need an expert who has worked in this area for a decade—someone just like author and big data expert Mike Frampton. ...
Data Push Apps with HTML5 SSE
Data Push Apps with HTML5 SSE

Make sure your website or web application users get content updates right now with minimal latency. This concise guide shows you how to push new data from the server to clients with HTML5 Server-Sent Events (SSE), an exceptional technology that doesn't require constant polling or user interaction. You'll learn how to build a real-world SSE application from start to finish that solves a demanding domain problem. You'll also discover how to increase that application's desktop and mobile browser support from 60% to 99%, using different fallback solutions. If you're familiar with HTML, HTTP, and basic JavaScript, you're ready to get started. ...
Data Scientists at Work
Data Scientists at Work

Data Scientists at Work is a collection of interviews with sixteen of the world's most influential and innovative data scientists from across the spectrum of this hot new profession. "Data scientist is the sexiest job in the 21st century," according to the Harvard Business Review. By 2018, the United States will experience a shortage of 190,000 skilled data scientists, according to a McKinsey report. Each of these data scientists shares how he or she tailors the torrent-taming techniques of big data, data visualization, search, and statistics to specific jobs by dint of ingenuity, imagination, patience, and passion. Data Scientists at Work parts the curtain on the interviewees' earliest data projects, how they became data scientists, their discoveries and surprises in working with data, their thoughts on the past, present, and future of the profession, their experiences of team collaboration within their organizations, and the insights they have gained as they get their ha ...
Data Visualization For Dummies
Data Visualization For Dummies

Big data is big news! Every company, industry, not-for-profit, and government agency wants and needs to analyze and leverage datasets that can quickly become ponderously large. Data visualization software enables different industries to present information in ways that are memorable and relevant to their mission. This full-color guide introduces you to a variety of ways to handle and synthesize data in much more interesting ways than mere columns and rows of numbers. Learn meaningful ways to show trending and relationships, how to convey complex data in a clear, concise diagram, ways to create eye-catching visualizations, and much more! ...
Practical Web Scraping for Data Science
Practical Web Scraping for Data Science

This book provides a complete and modern guide to web scraping, using Python as the programming language, without glossing over important details or best practices. Written with a data science audience in mind, the book explores both scraping and the larger context of web technologies in which it operates, to ensure full understanding. The authors recommend web scraping as a powerful tool for any data scientist's arsenal, as many data science projects start by obtaining an appropriate data set. Starting with a brief overview on scraping and real-life use cases, the authors explore the core concepts of HTTP, HTML, and CSS to provide a solid foundation. Along with a quick Python primer, they cover Selenium for JavaScript-heavy sites, and web crawling in detail. The book finishes with a recap of best practices and a collection of examples that bring together everything you've learned and illustrate various data science use cases. Leverage well-established best practices and commonly ...
Practical Data Science with R
Practical Data Science with R

Business analysts and developers are increasingly collecting, curating, analyzing, and reporting on crucial business data. The R language and its associated tools provide a straightforward way to tackle day-to-day data science tasks without a lot of academic theory or advanced mathematics. Practical Data Science with R shows you how to apply the R programming language and useful statistical techniques to everyday business situations. Using examples from marketing, business intelligence, and decision support, it shows you how to design experiments (such as A/B tests), build predictive models, and present results to audiences of all levels. ...
Advanced Analytics with Spark, 2nd Edition
Advanced Analytics with Spark, 2nd Edition

In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this edition acts as an introduction to these techniques and other best practices in Spark programming. You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques - including classification, clustering, collaborative filtering, and anomaly detection - to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find the book's patterns useful for working on your own data applications. Familiarize yourself with the Spark programming model; Become comfortable within the Spark ec ...
Mastering Feature Engineering
Mastering Feature Engineering

Feature engineering is essential to applied machine learning, but using domain knowledge to strengthen your predictive models can be difficult and expensive. To help fill the information gap on feature engineering, this complete hands-on guide teaches beginning-to-intermediate data scientists how to work with this widely practiced but little discussed topic. Author Alice Zheng explains common practices and mathematical principles to help engineer features for new data and tasks. If you understand basic machine learning concepts like supervised and unsupervised learning, you're ready to get started. Not only will you learn how to implement feature engineering in a systematic and principled way, you'll also learn how to practice better data science. ...
← Prev       Next →
Reproduction of site books is authorized only for informative purposes and strictly for personal, private use.
Only Direct Download
IT eBooks Group © 2011-2025