Foundations for Architecting Data SolutionsWhile many companies ponder implementation details such as distributed processing engines and algorithms for data analysis, this practical book takes a much wider view of big data development, starting with initial planning and moving diligently toward execution. Authors Ted Malaska and Jonathan Seidman guide you through the major components necessary to start, architect, and develop successful big data projects.
Everyone from CIOs and COOs to lead architects and developers will explore a variety of big data architectures and applications, from massive data pipelines to web-scale applications. Each chapter addresses a piece of the software development life cycle and identifies patterns to maximize long-term success throughout the life of your project.
Start the planning process by considering the key data project types; Use guidelines to evaluate and select data management solutions; Reduce risk related to technology, your team, and vague requirements; Explore system interface de ...
Learning Kali LinuxWith more than 600 security tools in its arsenal, the Kali Linux distribution can be overwhelming. Experienced and aspiring security professionals alike may find it challenging to select the most appropriate tool for conducting a given test. This practical book covers Kali's expansive security capabilities and helps you identify the tools you need to conduct a wide range of security tests and penetration tests. You'll also explore the vulnerabilities that make those tests necessary.
Author Ric Messier takes you through the foundations of Kali Linux and explains methods for conducting tests on networks, web applications, wireless security, password vulnerability, and more. You'll discover different techniques for extending Kali tools and creating your own toolset.
Learn tools for stress testing network stacks and applications; Perform network reconnaissance to determine what's available to attackers; Execute penetration tests using automated exploit tools such as Metasploit; Use c ...
Getting Started with KuduFast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to choose solutions using the least common denominator - either fast analytics at the cost of slow data ingestion or fast data ingestion at the cost of slow analytics. There is an answer to this problem. With the Apache Kudu column-oriented data store, you can easily perform fast analytics on fast data. This practical guide shows you how.
Begun as an internal project at Cloudera, Kudu is an open source solution compatible with many data processing frameworks in the Hadoop environment. In this book, current and former solutions professionals from Cloudera provide use cases, examples, best practices, and sample code to help you get up to speed with Kudu.
Explore Kudu's high-level design, including how it spreads data across servers; Fully administer a Kudu cluster, enable security, and add or remove nodes; Learn Kudu's client-side APIs, including how to integrate Apache Impala, ...
Prometheus: Up & RunningGet up to speed with Prometheus, the metrics-based monitoring system used by tens of thousands of organizations in production. This practical guide provides application developers, sysadmins, and DevOps practitioners with a hands-on introduction to the most important aspects of Prometheus, including dashboarding and alerting, direct code instrumentation, and metric collection from third-party systems with exporters.
This open source system has gained popularity over the past few years for good reason. With its simple yet powerful data model and query language, Prometheus does one thing, and it does it well. Author and Prometheus developer Brian Brazil guides you through Prometheus setup, the Node exporter, and the Alertmanager, then demonstrates how to use them for application and infrastructure monitoring.
Know where and how much to apply instrumentation to your application code; Identify metrics with labels using unique key-value pairs; Get an introduction to Grafana, a popular ...
Head First PMP, 4th EditionHead First PMP teaches you the latest principles and certification objectives in The PMBOK Guide in a unique and inspiring way. This updated fourth edition takes you beyond specific questions and answers with a unique visual format that helps you grasp the big picture of project management. By putting PMP concepts into context, you'll be able to understand, remember, and apply them—not just on the exam, but on the job. No wonder so many people have used Head First PMP as their sole source for passing the PMP exam.
Learn PMP's underlying concepts to help you understand the PMBOK principles and pass the certification exam with flying colors; Get 100% coverage of the latest principles and certification objectives in The PMBOK Guide, Fifth Edition; Make use of a thorough and effective preparation guide with hundreds of practice questions and exam strategies; Explore the material through puzzles, games, problems, and exercises that make learning easy and entertaining. ...
This Is Service Design MethodsIn this book, you'll find 54 hands-on descriptions that help you DO the key methods used in service design. These methods include instructions, guidelines, and tips-and-tricks for activities within research, ideation, prototyping, and facilitation. This is the print version of the method companion to the book This Is Service Design Doing (#TiSDD). It includes the same content that you can find free on the book website, tisdd.com, but nicely revisualized and presented in a professional bound format.
Caveat: While methods are the building blocks of a service design process, owning a pile of bricks does not make you an architect or even a bricklayer. Success in doing service design certainly requires a mastery of these methods. However, you must also be able to combine them into a process that fits the context and needs of your organization and guide people through this new way of working.
This book only contains the building blocks - the methods. It doesn't detail how to assemble t ...
Learning PHP, MySQL & JavaScript, 5th EditionBuild interactive, data-driven websites with the potent combination of open source technologies and web standards, even if you have only basic HTML knowledge. In this update to this popular hands-on guide, you'll tackle dynamic web programming with the latest versions of today's core technologies: PHP, MySQL, JavaScript, CSS, HTML5, and key jQuery libraries.
Web designers will learn how to use these technologies together and pick up valuable web programming practices along the way—including how to optimize websites for mobile devices. At the end of the book, you'll put everything together to build a fully functional social networking site suitable for both desktop and mobile browsers.
Explore MySQL, from database structure to complex queries; Use the MySQLi extension, PHP's improved MySQL interface; Create dynamic PHP web pages that tailor themselves to the user; Manage cookies and sessions and maintain a high level of security; Enhance the JavaScript language with jQuery and j ...
Visualizing Streaming DataWhile tools for analyzing streaming and real-time data are gaining adoption, the ability to visualize these data types has yet to catch up. Dashboards are good at conveying daily or weekly data trends at a glance, though capturing snapshots when data is transforming from moment to moment is more difficult - but not impossible.
With this practical guide, application designers, data scientists, and system administrators will explore ways to create visualizations that bring context and a sense of time to streaming text data. Author Anthony Aragues guides you through the concepts and tools you need to build visualizations for analyzing data as it arrives.
Determine your company's goals for visualizing streaming data; Identify key data sources and learn how to stream them; Learn practical methods for processing streaming data; Build a client application for interacting with events, logs, and records; Explore common components for visualizing streaming data; Consider analysis concepts ...
Designing Data-Intensive ApplicationsData is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relational databases, NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords?
In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain the same. With this book, software engineers and architects will learn how to apply those ideas in practice, and how to make full use of data in modern applications.
Peer under the hood of the systems you already use, and learn how to use and operate them more effectively; Make informed decisions b ...
Terraform: Up and RunningTerraform has emerged as a key player in the DevOps world for defining, launching, and managing infrastructure as code (IAC) across a variety of cloud and virtualization platforms, including AWS, Google Cloud, and Azure. This hands-on book is the fastest way to get up and running with Terraform.
Gruntwork co-founder Yevgeniy (Jim) Brikman walks you through dozens of code examples that demonstrate how to use Terraform's simple, declarative programming language to deploy and manage infrastructure with just a few commands. Whether you're a novice developer, aspiring DevOps engineer, or veteran sysadmin, this book will take you from Terraform basics to running a full tech stack capable of supporting a massive amount of traffic and a large team of developers.
Compare Terraform to other IAC tools, such as Chef, Puppet, Ansible, and Salt Stack; Use Terraform to deploy server clusters, load balancers, and databases; Learn how Terraform manages the state of your infrastructure and how it ...
Python Data Science HandbookFor many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all - IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools.
Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing different types of data; and using data to build statistical or machine learning models. Quite simply, this is the must-have reference for scientific computing in Python.
IPython and Jupyter: provide computational environments for data scientists using Python; NumPy: includes the ndarray for efficient storage and manipulation of dense data arrays in Python; Pandas: features the DataFrame for efficient stora ...