Azure Batch Notes
Azure Batch Notes
Define the Azure compute resources to execute your applications in parallel or at scale without
manually configuring or managing infrastructure
Schedule compute-intensive tasks and dynamically add or remove compute resources based on
your requirements
Azure Batch:
Parallel workloads
1. Upload input files and the applications to process those files to the Azure Storage account
• The application files can include scripts or applications that process the data
• The input files can be any data that the application processes, such as financial modeling data,
or video files to be transcoded
2. Create a Batch pool of compute nodes in Batch account, a job to run the workload on the pool,
and tasks in the job
Batch pool of compute nodes - Pool nodes are the VMs that execute your tasks through a job.
Specify properties such as the number and size of the nodes, a Windows or Linux VM image,
and an application to install when the nodes join the pool.Manage the cost and size of the pool
by using low-priority VMs or automatically scaling the number of nodes as the workload
changes.
Jobs - When you add tasks to a job, the Batch service automatically schedules the tasks for
execution on the compute nodes in the pool.
Tasks - Each task uses the application that you uploaded to process the input files.
• As the tasks run, query Batch to monitor the progress of the job and its tasks
• As the tasks complete, they can upload their result data to Azure Storage. Files can be directly
retrieved from the filesystem on the compute node.
When monitoring detects that the tasks in the job have completed, client application or service can
download the output data for further processing.
Billing: