【Spark】What is the difference between Input and Shuffle Read

TaiKuLaHa2023年11月03日

收录于话题

Spark调参过程中
保持每个task的 input + shuffle read 量在300-500M左右比较合适

The Spark UI is documented here: https://spark.apache.org/docs/3.0.1/web-ui.html

The relevant paragraph reads:

Input: Bytes read from storage in this stageOutput: Bytes written in storage in this stageShuffle read: Total shuffle bytes and records read, includes both data read locally and data read from remote executorsShuffle write: Bytes and records written to disk in order to be read by a shuffle in a future stage

上一篇R -- match,pmatch,charmatch

下一篇iPhone无法关机未必是坏了！如何修复无法关闭的iPhone

【Spark】What is the difference between Input and Shuffle Read

2.Spark的工作与架构原理

计算机视觉的应用20-图像生成模型(Stable Diffusion)的 ...

2023最新AI创作系统ChatGPT网站源码+Midjourney绘画 ...

re:Invent 构建未来：云计算&生成式 AI 诞生科技新局 ...

计算机视觉的应用20-图像生成模型(Stable Diffusion)的原理详解与相关项目介绍

2023最新AI创作系统ChatGPT网站源码+Midjourney绘画+支持GPT-4-Turbo模型+即将支持TSS语音对话功能模块

re:Invent 构建未来：云计算&生成式 AI 诞生科技新局面

初识Dockerfile

OpenCV中的一些图像方法记录

精选内容

2023-2024赛季英超效力一支球队时间最长的10大球员排名

曼联历史最成功的传奇球员有哪些？曼联历史上最伟大的10名球员

阿根廷足球队历史上进球最多的球员是谁？阿根廷国家队历史上十大最佳射手前十名

【Spark】What is the difference between Input and Shuffle Read

相关信息

你可能还喜欢

热门推荐信息