You are designing a Dataflow pipeline for a batch processing job. You want to mitigate multiple zonal failures at job submission time. What should you do?
A is correct because by specifying a region instead of a specific zone, Dataflow can automatically choose the best possible zone for the job, providing an additional level of fault tolerance. B is not correct because even though retrying the job submission (without explicitly specifying a zone) can be a helpful step in mitigating zonal failures, it is not optimal as it will add additional delays and overhead. C is not correct because even though this approach would mitigate zonal failures, duplicated pipelines would cause an additional overhead in terms of costs and management of resources. Also, spinning pipelines in 2 different zones does not fully mitigate failure in case both zones are down while other zones in the region might still be available. D is not correct because setting the staging location to a regional bucket does not address the risk of workers failures in case of zonal outages.
Ready to practice?
These 27 official sample questions are free to practice on WiseOwlLearns — no account required. Get real-time tutoring from WiseOwl Tutor™ and step-by-step elimination reasoning from Option Analyzer™.