Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
Health & Fitness
Technology
About Us
Contact Us
Copyright
© 2024 PodJoint
Loading...
0:00 / 0:00
Podjoint Logo
US
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts115/v4/88/26/a6/8826a66b-63d8-fcc1-6859-7af44fd24ce3/mza_143065371144946236.jpeg/600x600bb.jpg
Big Data Systems (WT 2020/21) - tele-TASK
Prof. Dr. Tilmann Rabl
47 episodes
18 hours ago
The amount of data that can be generated and stored in academic and industrial projects and applications is increasing rapidly. Big data analytics technologies have established themselves as a solution for big data challenges to the scalability problems of traditional database systems. The vast amounts of new data that is collected, however, usually is not as easily analyzed as curated, structured data in a data warehouse is. Typically, these data are noisy, of varying format and velocity, and need to be analyzed with techniques from statistics and machine learning rather than pure SQL-like aggregations and drill-downs. Moreover, the results of the analyses frequently are models that are used for decision making and prediction. The complete process of big data analysis is described as a pipeline, which includes data recording, cleaning, integration, modeling, and interpretation. In this lecture, we will discuss big data systems, i.e., infrastructures that are used to handle all steps in typical big data processing pipelines. We will learn about data center infrastructure and scale-out software systems. The software discussed will cover the full big data stack, i.e., distributed file systems, Map Reduce, key value stores, stream processing, graph processing, ML systems.
Show more...
Courses
Education
RSS
All content for Big Data Systems (WT 2020/21) - tele-TASK is the property of Prof. Dr. Tilmann Rabl and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
The amount of data that can be generated and stored in academic and industrial projects and applications is increasing rapidly. Big data analytics technologies have established themselves as a solution for big data challenges to the scalability problems of traditional database systems. The vast amounts of new data that is collected, however, usually is not as easily analyzed as curated, structured data in a data warehouse is. Typically, these data are noisy, of varying format and velocity, and need to be analyzed with techniques from statistics and machine learning rather than pure SQL-like aggregations and drill-downs. Moreover, the results of the analyses frequently are models that are used for decision making and prediction. The complete process of big data analysis is described as a pipeline, which includes data recording, cleaning, integration, modeling, and interpretation. In this lecture, we will discuss big data systems, i.e., infrastructures that are used to handle all steps in typical big data processing pipelines. We will learn about data center infrastructure and scale-out software systems. The software discussed will cover the full big data stack, i.e., distributed file systems, Map Reduce, key value stores, stream processing, graph processing, ML systems.
Show more...
Courses
Education
Episodes (20/47)
Big Data Systems (WT 2020/21) - tele-TASK
Fair Benchmarking
4 years ago
12 minutes

Big Data Systems (WT 2020/21) - tele-TASK
BigBench / TPCx-BB - Big Data Benchmark
4 years ago
13 minutes 48 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Sort Benchmarks
4 years ago
16 minutes 25 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Benchmarks
4 years ago
9 minutes 28 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Some Statistics
4 years ago
47 minutes 45 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Measurements & Metrics
4 years ago
14 minutes 22 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Back of the Envelope Calculation
4 years ago
12 minutes 6 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Introduction
4 years ago
8 minutes 49 seconds

Big Data Systems (WT 2020/21) - tele-TASK
A Brief Introduction to RDMAs
4 years ago
22 minutes

Big Data Systems (WT 2020/21) - tele-TASK
Intro to Persistent Memory II
4 years ago
20 minutes 25 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Intro to Persistent Memory I
4 years ago
10 minutes 23 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Data Processing on GPUs
4 years ago
37 minutes 26 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Modern Hardware I
4 years ago
19 minutes 31 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Data-Parallel Parameter Server
4 years ago
22 minutes 5 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Execution Strategies
4 years ago
44 minutes 8 seconds

Big Data Systems (WT 2020/21) - tele-TASK
SystemML
4 years ago
28 minutes 52 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Language Abstraction & System Architectures
4 years ago
18 minutes 46 seconds

Big Data Systems (WT 2020/21) - tele-TASK
ML System Stack
4 years ago
28 minutes 2 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Machine Learning Models
4 years ago
15 minutes 58 seconds

Big Data Systems (WT 2020/21) - tele-TASK
Introduction
4 years ago
18 minutes 29 seconds

Big Data Systems (WT 2020/21) - tele-TASK
The amount of data that can be generated and stored in academic and industrial projects and applications is increasing rapidly. Big data analytics technologies have established themselves as a solution for big data challenges to the scalability problems of traditional database systems. The vast amounts of new data that is collected, however, usually is not as easily analyzed as curated, structured data in a data warehouse is. Typically, these data are noisy, of varying format and velocity, and need to be analyzed with techniques from statistics and machine learning rather than pure SQL-like aggregations and drill-downs. Moreover, the results of the analyses frequently are models that are used for decision making and prediction. The complete process of big data analysis is described as a pipeline, which includes data recording, cleaning, integration, modeling, and interpretation. In this lecture, we will discuss big data systems, i.e., infrastructures that are used to handle all steps in typical big data processing pipelines. We will learn about data center infrastructure and scale-out software systems. The software discussed will cover the full big data stack, i.e., distributed file systems, Map Reduce, key value stores, stream processing, graph processing, ML systems.