YARN
Yet Another Resource Negotiator
YARN Architecture
Launching an application
yarn jar application.jar DriverClass input output
List running jobs
yarn application -list
yarn top
See application logs
yarn logs -applicationId applicationId
Kill an application
yarn application -kill applicationId
Fair Scheduler
- Resources will be shared with the rest of users using the YARN fair share scheduler
- Dominant Resource Fairness: both CPU and memory considered
- Jobs should be composed of lots of short running tasks so they share resources nicely with other jobs
- Long running tasks that monopolize resources during large times can be preempted to allow other applications to run
Fair Scheduler Queues
- root.users.[username]: default queue, one per user
- interactive: Jupyter Notebooks and interactive jobs
- urgent: limited resources that can be used for urgent jobs
Web Job Browser
You can access the HUE Job Browser from the
WebUI: