更新时间:2021-06-10 18:59:37
coverpage
Title Page
About Packt
Why subscribe?
Packt.com
Contributors
About the authors
About the reviewers
Packt is searching for authors like you
Preface
Who this book is for
What this book covers
To get the most out of this book
Download the example code files
Download the color images
Conventions used
Get in touch
Reviews
Introduction to Big Data and Data Management
The concept of big data
Interesting insights regarding big data
Characteristics of big data
Sources and types of big data
Challenges of big data
Introduction to big data modeling
Uses of models
Introduction to managing big data
Importance and implications of big data modeling and management
Benefits of big data management
Challenges in big data management
Setting up big data modeling platforms
Getting started on Windows
Getting started on macOS
Summary
Further reading
Data Modeling and Management Platforms
Big data management
Data ingestion
Data storage
Data quality
Data operations
Data scalability and security
Big data management services
Data cleansing
Data integration
Big data management vendors
Big data storage and data models
Storage models
Block-based storage
File-based storage
Object-based storage
Data models
Relational stores (SQLs)
Scalable relational systems
Database as a Service (DaaS)
NoSQL stores
Document stores
Key-value stores
Extensible-record stores
Big data programming models
MapReduce
MapReduce functionality
Hadoop
Features of Hadoop frameworks
Yet Another Resource Negotiator
Functional programming
Spark
Reasons to choose Apache Spark
Flink
Advantages of Flink
SQL data models
Hive Query Langauge (HQL)
Cassandra Query Language (CQL)
Spark SQL
Apache Drill
Getting started with Python and R
Python on macOS
Python on Windows
R on macOS
R on Windows
Defining Data Models
Data model structures
Structured data
Unstructured data
Sources of unstructured data
Comparing structured and unstructured data
Subsetting
Union
Projection
Join
Data constraints
Types of constraints
Value constraints
Uniqueness constraints
Cardinality constraints
Type constraints
Domain constraints