Spring Sale 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exams65

ExamsBrite Dumps

Google Cloud Certified - Professional Cloud DevOps Engineer Exam Question and Answers

Google Cloud Certified - Professional Cloud DevOps Engineer Exam

Last Update Feb 28, 2026
Total Questions : 201

We are offering FREE Professional-Cloud-DevOps-Engineer Google exam questions. All you do is to just go and sign up. Give your details, prepare Professional-Cloud-DevOps-Engineer free exam questions and then go for complete pool of Google Cloud Certified - Professional Cloud DevOps Engineer Exam test questions that will help you more.

Professional-Cloud-DevOps-Engineer pdf

Professional-Cloud-DevOps-Engineer PDF

$36.75  $104.99
Professional-Cloud-DevOps-Engineer Engine

Professional-Cloud-DevOps-Engineer Testing Engine

$43.75  $124.99
Professional-Cloud-DevOps-Engineer PDF + Engine

Professional-Cloud-DevOps-Engineer PDF + Testing Engine

$57.75  $164.99
Questions 1

You are designing a continuous delivery (CD) strategy for a new serverless application. The application is packaged as a container image, stored in Artifact Registry, and deployed to Cloud Run. Your design requires a staging environment, a fully-managed Google Cloud service, mandatory manual approval for production deployments, and a phased rollout to production. Your solution should minimize administrative overhead. What should you do?

Options:

A.  

Use Cloud Deploy to define a single delivery pipeline that promotes a release between a staging target and a production target. Configure the production target to require approval and to automatically execute a phased rollout that incrementally shifts traffic.

B.  

Use a Cloud Build trigger to initiate a GitOps workflow. Configure the trigger to update a manifest in a Git repository, which a controller on a GKE Autopilot cluster then synchronizes to manage a phased traffic rollout to the new revision.

C.  

Use Cloud Build to create a multi-stage pipeline. Configure the trigger to require approval before starting the build. Use the deploy command with the --traffic flag to incrementally shift traffic to the new revision in production.

D.  

Define two separate Cloud Deploy pipelines. Configure the first pipeline to deploy to staging, and configure the second pipeline to trigger and execute a phased, canary rollout to the production Cloud Run service.

Discussion 0
Questions 2

You use a multiple step Cloud Build pipeline to build and deploy your application to Google Kubernetes Engine (GKE). You want to integrate with a third-party monitoring platform by performing a HTTP POST of the build information to a webhook. You want to minimize the development effort. What should you do?

Options:

A.  

Add logic to each Cloud Build step to HTTP POST the build information to a webhook.

B.  

Add a new step at the end of the pipeline in Cloud Build to HTTP POST the build information to a webhook.

C.  

Use Stackdriver Logging to create a logs-based metric from the Cloud Buitd logs. Create an Alert with a Webhook notification type.

D.  

Create a Cloud Pub/Sub push subscription to the Cloud Build cloud-builds PubSub topic to HTTP POST the build information to a webhook.

Discussion 0
Questions 3

You are responsible for creating and modifying the Terraform templates that define your Infrastructure. Because two new engineers will also be working on the same code, you need to define a process and adopt a tool that will prevent you from overwriting each other's code. You also want to ensure that you capture all updates in the latest version. What should you do?

Options:

A.  

• Store your code in a Git-based version control system.• Establish a process that allows developers to merge their own changes at the end of each day.• Package and upload code lo a versioned Cloud Storage bucket as the latest master version.

B.  

• Store your code in a Git-based version control system.• Establish a process that includes code reviews by peers and unit testing to ensure integrity and functionality before integration of code.• Establish a process where the fully integrated code in the repository becomes the latest master version.

C.  

• Store your code as text files in Google Drive in a defined folder structure that organizes the files.• At the end of each day. confirm that all changes have been captured in the files within the folder structure.• Rename the folder structure with a predefined naming convention that increments the version.

D.  

• Store your code as text files in Google Drive in a defined folder structure that organizes the files.• At the end of each day, confirm that all changes have been captured in the files within the folder structure and create a new .zip archive with a predefined naming convention.• Upload the .zip archive to a versioned Cloud Storage bucket and accept it as the latest version.

Discussion 0
Questions 4

You use Terraform to manage an application deployed to a Google Cloud environment The application runs on instances deployed by a managed instance group The Terraform code is deployed by using aCI/CD pipeline When you change the machine type on the instance template used by the managed instance group, the pipeline fails at the terraform apply stage with the following error message

You need to update the instance template and minimize disruption to the application and the number of pipeline runs What should you do?

Options:

A.  

Delete the managed instance group and recreate it after updating the instance template

B.  

Add a new instance template update the managed instance group to use the new instance template and delete the old instance template

C.  

Remove the managed instance group from the Terraform state file update the instance template and reimport the managed instance group.

D.  

Set the create_bef ore_destroy meta-argument to true in the lifecycle block on the instance template

Discussion 0
Questions 5

You support a trading application written in Python and hosted on App Engine flexible environment. You want to customize the error information being sent to Stackdriver Error Reporting. What should you do?

Options:

A.  

Install the Stackdriver Error Reporting library for Python, and then run your code on a Compute Engine VM.

B.  

Install the Stackdriver Error Reporting library for Python, and then run your code on Google Kubernetes Engine.

C.  

Install the Stackdriver Error Reporting library for Python, and then run your code on App Engine flexible environment.

D.  

Use the Stackdriver Error Reporting API to write errors from your application to ReportedErrorEvent, and then generate log entries with properly formatted error messages in Stackdriver Logging.

Discussion 0
Questions 6

You need to introduce postmortems into your organization. You want to ensure that the postmortem process is well received. What should you do?

Choose 2 answers

Options:

A.  

Create a designated team that is responsible for conducting all postmortems.

B.  

Encourage new employees to conduct postmortems to learn through practice.

C.  

Ensure that writing effective postmortems is a rewarded and celebrated practice.

D.  

Encourage your senior leadership to acknowledge and participate in postmortems.

E.  

Provide your organization with a forum to critique previous postmortems.

Discussion 0
Questions 7

Your application runs on Google Cloud Platform (GCP). You need to implement Jenkins for deploying application releases to GCP. You want to streamline the release process, lower operational toil, and keep user data secure. What should you do?

Options:

A.  

Implement Jenkins on local workstations.

B.  

Implement Jenkins on Kubernetes on-premises

C.  

Implement Jenkins on Google Cloud Functions.

D.  

Implement Jenkins on Compute Engine virtual machines.

Discussion 0
Questions 8

Your team of Infrastructure DevOps Engineers is growing, and you are starting to use Terraform to manage infrastructure. You need a way to implement code versioning and to share code with other team members. What should you do?

Options:

A.  

Store the Terraform code in a version-control system. Establish procedures for pushing new versions and merging with the master.

B.  

Store the Terraform code in a network shared folder with child folders for each version release. Ensure that everyone works on different files.

C.  

Store the Terraform code in a Cloud Storage bucket using object versioning. Give access to the bucket to every team member so they can download the files.

D.  

Store the Terraform code in a shared Google Drive folder so it syncs automatically to every team member’s computer. Organize files with a naming convention that identifies each new version.

Discussion 0
Questions 9

Your company follows Site Reliability Engineering principles. You are writing a postmortem for an incident, triggered by a software change, that severely affected users. You want to prevent severe incidents from happening in the future. What should you do?

Options:

A.  

Identify engineers responsible for the incident and escalate to their senior management.

B.  

Ensure that test cases that catch errors of this type are run successfully before new software releases.

C.  

Follow up with the employees who reviewed the changes and prescribe practices they should follow in the future.

D.  

Design a policy that will require on-call teams to immediately call engineers and management to discuss a plan of action if an incident occurs.

Discussion 0
Questions 10

You are creating a CI/CD pipeline in Cloud Build to build an application container image The application code is stored in GitHub Your company requires thai production image builds are only run against the main branch and that the change control team approves all pushes to the main branch You want the image build to be as automated as possible What should you do?

Choose 2 answers

Options:

A.  

Create a trigger on the Cloud Build job Set the repository event setting to Pull request'

B.  

Add the owners file to the Included files filter on the trigger

C.  

Create a trigger on the Cloud Build job Set the repository event setting to Push to a branch

D.  

Configure a branch protection rule for the main branch on the repository

E.  

Enable the Approval option on the trigger

Discussion 0
Questions 11

You have deployed a fleet Of Compute Engine instances in Google Cloud. You need to ensure that monitoring metrics and logs for the instances are visible in Cloud Logging and Cloud Monitoring by your company's operations and cyber

security teams. You need to grant the required roles for the Compute Engine service account by using Identity and Access Management (IAM) while following the principle of least privilege. What should you do?

Options:

A.  

Grant the logging.editor and monitoring.metricwriter roles to the Compute Engine service accounts.

B.  

Grant the Logging. admin and monitoring . editor roles to the Compute Engine service accounts.

C.  

Grant the logging. logwriter and monitoring. editor roles to the Compute Engine service accounts.

D.  

Grant the logging. logWriter and monitoring. metricWriter roles to the Compute Engine service accounts.

Discussion 0
Questions 12

You are running an application on Compute Engine and collecting logs through Stackdriver. You discover that some personally identifiable information (Pll) is leaking into certain log entry fields. All Pll entries begin with the text userinfo. You want to capture these log entries in a secure location for later review and prevent them from leaking to Stackdriver Logging. What should you do?

Options:

A.  

Create a basic log filter matching userinfo, and then configure a log export in the Stackdriver console with Cloud Storage as a sink.

B.  

Use a Fluentd filter plugin with the Stackdriver Agent to remove log entries containing userinfo, and then copy the entries to a Cloud Storage bucket.

C.  

Create an advanced log filter matching userinfo, configure a log export in the Stackdriver console with Cloud Storage as a sink, and then configure a tog exclusion with userinfo as a filter.

D.  

Use a Fluentd filter plugin with the Stackdriver Agent to remove log entries containing userinfo, create an advanced log filter matching userinfo, and then configure a log export in the Stackdriver console with Cloud Storage as a sink.

Discussion 0
Questions 13

Your team is running microservices in Google Kubernetes Engine (GKE) You want to detect consumption of an error budget to protect customers and define release policies What should you do?

Options:

A.  

Create SLIs from metrics Enable Alert Policies if the services do not pass

B.  

Use the metrics from Anthos Service Mesh to measure the health of the microservices

C.  

Create a SLO Create an Alert Policy on select_slo_bum_rate

D.  

Create a SLO and configure uptime checks for your services Enable Alert Policies if the services do not pass

Discussion 0
Questions 14

You are implementing a CI'CD pipeline for your application in your company s multi-cloud environment Your application is deployed by using custom Compute Engine images and the equivalent in other cloud providers You need to implement a solution that will enable you to build and deploy the images to your current environment and is adaptable to future changes Which solution stack should you use'?

Options:

A.  

Cloud Build with Packer

B.  

Cloud Build with Google Cloud Deploy

C.  

Google Kubernetes Engine with Google Cloud Deploy

D.  

Cloud Build with kpt

Discussion 0
Questions 15

You are writing a postmortem for an incident that severely affected users. You want to prevent similar incidents in the future. Which two of the following sections should you include in the postmortem? (Choose two.)

Options:

A.  

An explanation of the root cause of the incident

B.  

A list of employees responsible for causing the incident

C.  

A list of action items to prevent a recurrence of the incident

D.  

Your opinion of the incident’s severity compared to past incidents

E.  

Copies of the design documents for all the services impacted by the incident

Discussion 0
Questions 16

You are using Stackdriver to monitor applications hosted on Google Cloud Platform (GCP). You recently deployed a new application, but its logs are not appearing on the Stackdriver dashboard.

You need to troubleshoot the issue. What should you do?

Options:

A.  

Confirm that the Stackdriver agent has been installed in the hosting virtual machine.

B.  

Confirm that your account has the proper permissions to use the Stackdriver dashboard.

C.  

Confirm that port 25 has been opened in the firewall to allow messages through to Stackdriver.

D.  

Confirm that the application is using the required client library and the service account key has proper permissions.

Discussion 0
Questions 17

Your company experiences bugs, outages, and slowness in its production systems. Developers use the production environment for new feature development and bug fixes. Configuration and experiments are done in the production environment, causing outages for users. Testers use the production environmentfor load testing, which often slows the production systems. You need to redesign the environment to reduce the number of bugs and outages in production and to enable testers to load test new features. What should you do?

Options:

A.  

Create an automated testing script in production to detect failures as soon as they occur.

B.  

Create a development environment with smaller server capacity and give access only to developers and testers.

C.  

Secure the production environment to ensure that developers can't change it and set up one controlled update per year.

D.  

Create a development environment for writing code and a test environment for configurations, experiments, and load testing.

Discussion 0
Questions 18

You work for a global organization and run a service with an availability target of 99% with limited engineering resources. For the current calendar month you noticed that the service has 99 5% availability. You must ensure that your service meets the defined availability goals and can react to business changes including the upcoming launch of new features You also need to reduce technical debt while minimizing operational costs You want to follow Google-recommended practices What should you do?

Options:

A.  

Add N+1 redundancy to your service by adding additional compute resources to the service

B.  

Identify, measure and eliminate toil by automating repetitive tasks

C.  

Define an error budget for your service level availability and minimize the remaining error budget

D.  

Allocate available engineers to the feature backlog while you ensure that the sen/ice remains within the availability target

Discussion 0
Questions 19

You are creating Cloud Logging sinks to export log entries from Cloud Logging to BigQuery for future analysis Your organization has a Google Cloud folder named Dev that contains development projects and a folder named Prod that contains production projects Log entries for development projects must be exported to dev_dataset. and log entries for production projects must be exported to prod_datasetYou need to minimize the number of log sinks created and you want to ensure that the log sinks apply to future projects What should you do?

Options:

A.  

Create a single aggregated log sink at the organization level.

B.  

Create a log sink in each project

C.  

Create two aggregated log sinks at the organization level, and filter by project ID

D.  

Create an aggregated Iog sink in the Dev and Prod folders

Discussion 0
Questions 20

You support a production service that runs on a single Compute Engine instance. You regularly need to spend time on recreating the service by deleting the crashing instance and creating a new instance based on the relevant image. You want to reduce the time spent performing manual operations while following Site Reliability Engineering principles. What should you do?

Options:

A.  

File a bug with the development team so they can find the root cause of the crashing instance.

B.  

Create a Managed Instance Group with a single instance and use health checks to determine the system status.

C.  

Add a Load Balancer in front of the Compute Engine instance and use health checks to determine the system status.

D.  

Create a Stackdriver Monitoring dashboard with SMS alerts to be able to start recreating the crashed instance promptly after it has crashed.

Discussion 0
Questions 21

You are performing a semiannual capacity planning exercise for your flagship service. You expect a service user growth rate of 10% month-over-month over the next six months. Your service is fully containerized and runs on Google Cloud Platform (GCP). using a Google Kubernetes Engine (GKE) Standard regional cluster on three zones with cluster autoscaler enabled. You currently consume about 30% of your total deployed CPU capacity, and you require resilience against the failure of a zone. You want to ensure that your users experience minimal negative impact as a result of this growth or as a result of zone failure, while avoiding unnecessary costs. How should you prepare to handle the predicted growth?

Options:

A.  

Verity the maximum node pool size, enable a horizontal pod autoscaler, and then perform a load test to verity your expected resource needs.

B.  

Because you are deployed on GKE and are using a cluster autoscaler. your GKE cluster will scale automatically, regardless of growth rate.

C.  

Because you are at only 30% utilization, you have significant headroom and you won't need to add any additional capacity for this rate of growth.

D.  

Proactively add 60% more node capacity to account for six months of 10% growth rate, and then perform a load test to make sure you have enough capacity.

Discussion 0
Questions 22

You support a Node.js application running on Google Kubernetes Engine (GKE) in production. The application makes several HTTP requests to dependent applications. You want to anticipate which dependent applications might cause performance issues. What should you do?

Options:

A.  

Instrument all applications with Stackdriver Profiler.

B.  

Instrument all applications with Stackdriver Trace and review inter-service HTTP requests.

C.  

Use Stackdriver Debugger to review the execution of logic within each application to instrument all applications.

D.  

Modify the Node.js application to log HTTP request and response times to dependent applications. Use Stackdriver Logging to find dependent applications that are performing poorly.

Discussion 0
Questions 23

Your organization is running multiple Google Kubernetes Engine (GKE) clusters in a project. You need to design a highly-available solution to collect and query both domain-specific workload metrics and GKE default metrics across all clusters, while minimizing operational overhead. What should you do?

Options:

A.  

Use Prometheus Operator to install Prometheus in every cluster and scrape the metrics. Ensure that a Thanos sidecar is enabled on every Prometheus instance. Configure Thanos in the central cluster. Query the central Thanos instance.

B.  

Use Prometheus Operator to install Prometheus in every cluster and scrape the metrics. Configure remote-write to one central Prometheus. Query the central Prometheus instance.

C.  

Enable managed collection on every GKE cluster. Query the metrics in Cloud Monitoring.

D.  

Enable managed collection on every GKE cluster. Query the metrics in BigQuery.

Discussion 0
Questions 24

You deploy a new release of an internal application during a weekend maintenance window when there is minimal user traffic. After the window ends, you learn that one of the new features isn't working as expected in the production environment. After an extended outage, you roll back the new release and deploy a fix. You want to modify your release process to reduce the mean time to recovery so you can avoid extended outages in the future. What should you do?

Choose 2 answers

Options:

A.  

Before merging new code, require 2 different peers to review the code changes.

B.  

Adopt the blue/green deployment strategy when releasing new code via a CD server.

C.  

Integrate a code linting tool to validate coding standards before any code is accepted into the repository.

D.  

Require developers to run automated integration tests on their local development environments before release.

E.  

Configure a CI server.Add a suite of unit tests to your code and have your CI server run them on commit and verify any changes.

Discussion 0
Questions 25

You are running a real-time gaming application on Compute Engine that has a production and testing environment. Each environment has their own Virtual Private Cloud (VPC) network. The application frontend and backend servers are located on different subnets in the environment's VPC. You suspect there is a malicious process communicating intermittently in your production frontend servers. You want to ensure that network traffic is captured for analysis. What should you do?

Options:

A.  

Enable VPC Flow Logs on the production VPC network frontend and backend subnets only with a sample volume scale of 0.5.

B.  

Enable VPC Flow Logs on the production VPC network frontend and backend subnets only with a sample volume scale of 1.0.

C.  

Enable VPC Flow Logs on the testing and production VPC network frontend and backend subnets with a volume scale of 0.5. Apply changes intesting before production.

D.  

Enable VPC Flow Logs on the testing and production VPC network frontend and backend subnets with a volume scale of 1.0. Apply changes in testing before production.

Discussion 0
Questions 26

You are designing a new Google Cloud organization for a client. Your client is concerned with the risks associated with long-lived credentials created in Google Cloud. You need to design a solution to completely eliminate the risks associated with the use of JSON service account keys while minimizing operational overhead. What should you do?

Options:

A.  

Use custom versions of predefined roles to exclude all iam.serviceAccountKeys. * service account role permissions.

B.  

Apply the constraints/iam.disableserviceAccountKeycreation constraint to the organization.

C.  

Apply the constraints/iam. disableServiceAccountKeyUp10ad constraint to the organization.

D.  

Grant the roles/ iam.serviceAccountKeyAdmin IAM role to organization administrators only.

Discussion 0
Questions 27

You are ready to deploy a new feature of a web-based application to production. You want to use Google Kubernetes Engine (GKE) to perform a phased rollout to half of the web server pods.

What should you do?

Options:

A.  

Use a partitioned rolling update.

B.  

Use Node taints with NoExecute.

C.  

Use a replica set in the deployment specification.

D.  

Use a stateful set with parallel pod management policy.

Discussion 0
Questions 28

You support a high-traffic web application and want to ensure that the home page loads in a timely manner. As a first step, you decide to implement a Service Level Indicator (SLI) to represent home page request latency with an acceptable page load time set to 100 ms. What is the Google-recommended way of calculating this SLI?

Options:

A.  

Buckelize Ihe request latencies into ranges, and then compute the percentile at 100 ms.

B.  

Bucketize the request latencies into ranges, and then compute the median and 90th percentiles.

C.  

Count the number of home page requests that load in under 100 ms, and then divide by the total number of home page requests.

D.  

Count the number of home page requests that load in under 100 ms. and then divide by the total number of all web application requests.

Discussion 0
Questions 29

You are creating and assigning action items in a postmodern for an outage. The outage is over, but you need to address the root causes. You want to ensure that your team handles the action items quickly and efficiently. How should you assign owners and collaborators to action items?

Options:

A.  

Assign one owner for each action item and any necessary collaborators.

B.  

Assign multiple owners for each item to guarantee that the team addresses items quickly

C.  

Assign collaborators but no individual owners to the items to keep the postmortem blameless.

D.  

Assign the team lead as the owner for all action items because they are in charge of the SRE team.

Discussion 0
Questions 30

You are designing a new multi-tenant Google Kubernetes Engine (GKE) cluster for a customer. Your customer is concerned with the risks associated with long-lived credentials use. The customer requires that each GKE workload has the minimum Identity and Access Management (IAM) permissions set following the principle of least privilege (PoLP). You need to design an IAM impersonation solution while following Google-recommended practices. What should you do?

Options:

A.  

Create a Google service account.

Create a Kubernetes service account in a Workload Identity-enabled cluster.

Link the Google service account with the Kubernetes service account by using the roles/iam.workloadIdentityUser role and iam.gke.io/gcp-service-account annotation.

Map the Kubernetes service account to the workload.

Repeat for each workload.

B.  

Create a Google service account.

Create a node pool, and set the Google service account as the default identity.

Ensure that workloads can only run on the designated node pool by using node selectors, taints, and tolerations.

Repeat for each workload.

C.  

Create a Google service account.

Create a service account key for the Google service account.

Create a Kubernetes secret with a service account key.

Ensure that workload mounts the secret and set the GOOGLE_APPLICATION_CREDENTIALS environment variable to point at the mount path.

Repeat for each workload.

D.  

Create a Google service account.

Create a node pool without taints, and set the Google service account as the default identity.

Grant IAM permissions to the Google service account.

Discussion 0
Questions 31

Your applications performance in Google Cloud has degraded since the last release You suspect that downstream dependencies might be causing some requests to take longer to complete You need to investigate the issue with your application to determine the cause What should you do?

Options:

A.  

Configure Error Reporting in your application

B.  

Configure Google Cloud Managed Service for Prometheus in your application

C.  

Configure Cloud Profiler in your application

D.  

Configure Cloud Trace in your application

Discussion 0
Questions 32

Your company is developing applications that are deployed on Google Kubernetes Engine (GKE). Each team manages a different application. You need to create the development and production environments for each team, while minimizing costs. Different teams should not be able to access other teams’ environments. What should you do?

Options:

A.  

Create one GCP Project per team. In each project, create a cluster for Development and one for Production. Grant the teams IAM access to their respective clusters.

B.  

Create one GCP Project per team. In each project, create a cluster with a Kubernetes namespace for Development and one for Production. Grant the teams IAM access to their respective clusters.

C.  

Create a Development and a Production GKE cluster in separate projects. In each cluster, create a Kubernetes namespace per team, and then configure Identity Aware Proxy so that each team can only access its own namespace.

D.  

Create a Development and a Production GKE cluster in separate projects. In each cluster, create a Kubernetes namespace per team, and then configure Kubernetes Role-based access control (RBAC) so that each team can only access its own namespace.

Discussion 0
Questions 33

You support an application running on App Engine. The application is used globally and accessed from various device types. You want to know the number of connections. You are using Stackdriver Monitoring for App Engine. What metric should you use?

Options:

A.  

flex/connections/current

B.  

tcp_ssl_proxy/new_connections

C.  

tcp_ssl_proxy/open_connections

D.  

flex/instance/connections/current

Discussion 0
Questions 34

You are running an application in a virtual machine (VM) using a custom Debian image. The image has the Stackdriver Logging agent installed. The VM has the cloud-platform scope. The application is logging information via syslog. You want to use Stackdriver Logging in the Google Cloud Platform Console to visualize the logs. You notice that syslog is not showing up in the "All logs" dropdown list of the Logs Viewer. What is the first thing you should do?

Options:

A.  

Look for the agent's test log entry in the Logs Viewer.

B.  

Install the most recent version of the Stackdriver agent.

C.  

Verify the VM service account access scope includes the monitoring.write scope.

D.  

SSH to the VM and execute the following commands on your VM: ps ax I grep fluentd

Discussion 0
Questions 35

You recently noticed that one Of your services has exceeded the error budget for the current rolling window period. Your company's product team is about to launch a new feature. You want to follow Site Reliability Engineering (SRE) practices.

What should you do?

Options:

A.  

Notify the team that their error budget is used up. Negotiate with the team for a launch freeze or tolerate a slightly worse user experience.

B.  

Look through other metrics related to the product and find SLOs with remaining error budget. Reallocate the error budgets and allow the feature launch.

C.  

Escalate the situation and request additional error budget.

D.  

Notify the team about the lack of error budget and ensure that all their tests are successful so the launch will not further risk the error budget.

Discussion 0
Questions 36

You support a multi-region web service running on Google Kubernetes Engine (GKE) behind a Global HTTP'S Cloud Load Balancer (CLB). For legacy reasons, user requests first go through a third-party Content Delivery Network (CDN). which then routes traffic to the CLB. You have already implemented an availability Service Level Indicator (SLI) at the CLB level. However, you want to increase coverage in case of a potential load balancer misconfiguration. CDN failure, or other global networking catastrophe. Where should you measure this new SLI?

Choose 2 answers

Options:

A.  

Your application servers' logs

B.  

Instrumentation coded directly in the client

C.  

Metrics exported from the application servers

D.  

GKE health checks for your application servers

E.  

A synthetic client that periodically sends simulated user requests

Discussion 0
Questions 37

Your team has an application built by using a Dockerfile. The build is executed from Cloud Build, and the resulting artifacts are stored in Artifact Registry. Your team is reporting that builds are slow. You need to increase build speed, while following Google-recommended practices. What should you do?

Options:

A.  

Use the --cache-from parameter, and point to Artifact Registry. Add the most frequently modified files to the later stages of the build process.

B.  

Use the --cache-from parameter, and point to Artifact Registry. Add the most frequently modified files to the earlier stages of the build process.

C.  

Cache the container layers of the build process to Cloud Storage. Add the most frequently modified files to the earlier stages of the build process.

D.  

Cache the container layers of the build process to Cloud Storage. Add the most frequently modified files to the later stages of the build process.

Discussion 0
Questions 38

You support a user-facing web application When analyzing the application's error budget over the previous six months you notice that the application never consumed more than 5% of its error budget You hold a SLO review with business stakeholders and confirm that the SLO is set appropriately You want your application's reliability to more closely reflect its SLO What steps can you take to further that goal while balancing velocity, reliability, and business needs?

Choose 2 answers

Options:

A.  

Add more serving capacity to all of your application's zones

B.  

Implement and measure all other available SLIs for the application

C.  

Announce planned downtime to consume more error budget and ensure that users are not depending on a tighter SLO

D.  

Have more frequent or potentially risky application releases

E.  

Tighten the SLO to match the application's observed reliability

Discussion 0
Questions 39

You need to enforce several constraint templates across your Google Kubernetes Engine (GKE) clusters. The constraints include policy parameters, such as restricting the Kubernetes API. You must ensure that the policy parameters are stored in a GitHub repository and automatically applied when changes occur. What should you do?  

Options:

A.  

Set up a GitHub action to trigger Cloud Build when there is a parameter change. In Cloud Build, run a gcloud CLI command to apply the change.

B.  

When there is a change in GitHub, use a webhook to send a request to Cloud Service Mesh, and apply the change.

C.  

Configure Config Sync with the GitHub repository. When there is a change in the repository, use Config Sync to apply the change.

D.  

Configure Config Connector with the GitHub repository. When there is a change in the repository, use Config Connector to apply the change.

Discussion 0
Questions 40

You are running a web application deployed to a Compute Engine managed instance group Ops Agent is installed on all instances You recently noticed suspicious activity from a specific IP address You need to configure Cloud Monitoring to view the number of requests from that specific IP address with minimal operational overhead. What should you do?

Options:

A.  

Configure the Ops Agent with a logging receiver Create a logs-based metric

B.  

Create a script to scrape the web server log Export the IP address request metrics to the Cloud Monitoring API

C.  

Update the application to export the IP address request metrics to the Cloud Monitoring API

D.  

Configure the Ops Agent with a metrics receiver

Discussion 0
Questions 41

You use Cloud Build to build your application. You want to reduce the build time while minimizing cost and development effort. What should you do?

Options:

A.  

Use Cloud Storage to cache intermediate artifacts.

B.  

Run multiple Jenkins agents to parallelize the build.

C.  

Use multiple smaller build steps to minimize execution time.

D.  

Use larger Cloud Build virtual machines (VMs) by using the machine-type option.

Discussion 0
Questions 42

You need to reduce the cost of virtual machines (VM| for your organization. After reviewing different options, you decide to leverage preemptible VM instances. Which application is suitable for preemptible VMs?

Options:

A.  

A scalable in-memory caching system

B.  

The organization's public-facing website

C.  

A distributed, eventually consistent NoSQL database cluster with sufficient quorum

D.  

A GPU-accelerated video rendering platform that retrieves and stores videos in a storage bucket

Discussion 0
Questions 43

You are deploying an application that needs to access sensitive information. You need to ensure that this information is encrypted and the risk of exposure is minimal if a breach occurs. What should you do?

Options:

A.  

Store the encryption keys in Cloud Key Management Service (KMS) and rotate the keys frequently

B.  

Inject the secret at the time of instance creation via an encrypted configuration management system.

C.  

Integrate the application with a Single sign-on (SSO) system and do not expose secrets to the application

D.  

Leverage a continuous build pipeline that produces multiple versions of the secret for each instance of the application.

Discussion 0
Questions 44

Your company is migrating its production systems to Google Cloud. You need to implement site reliability engineering (SRE) practices during the migration to minimize customer impact from potential future incidents. Which two SRE practices should you implement?

Choose 2 answers

Options:

A.  

Ensure that full autonomy and permissions are only granted to the on-call team.

B.  

Automate common tasks to analyze key impact information and intelligently suggest mitigating actions for the on-call team.

C.  

Ensure that all teams can modify the production environment to resolve issues.

D.  

Create an alerting mechanism for your SRE team based on your system's internal behavior.

E.  

Create up-to-date playbooks with instructions for debugging and mitigating issues.

Discussion 0
Questions 45

Your organization wants to collect system logs that will be used to generate dashboards in Cloud Operations for their Google Cloud project. You need to configure all current and future Compute Engine instances to collect the system logs and you must ensure that the Ops Agent remains up to date. What should you do?

Options:

A.  

Use the gcloud CLI to install the Ops Agent on each VM listed in the Cloud Asset Inventory

B.  

Select all VMs with an Agent status of Not detected on the Cloud Operations VMs dashboard Then select Install agents

C.  

Use the gcloud CLI to create an Agent Policy.

D.  

Install the Ops Agent on the Compute Engine image by using a startup script

Discussion 0
Questions 46

You support a user-facing web application. When analyzing the application’s error budget over the previous six months, you notice that the application has never consumed more than 5% of its error budget in any given time window. You hold a Service Level Objective (SLO) review with business stakeholders and confirm that the SLO is set appropriately. You want your application’s SLO to more closely reflect its observed reliability. What steps can you take to further that goal while balancing velocity, reliability, and business needs? (Choose two.)

Options:

A.  

Add more serving capacity to all of your application’s zones.

B.  

Have more frequent or potentially risky application releases.

C.  

Tighten the SLO match the application’s observed reliability.

D.  

Implement and measure additional Service Level Indicators (SLIs) fro the application.

E.  

Announce planned downtime to consume more error budget, and ensure that users are not depending on a tighter SLO.

Discussion 0
Questions 47

Your Cloud Run application writes unstructured logs as text strings to Cloud Logging. You want to convert the unstructured logs to JSON-based structured logs. What should you do?

Options:

A.  

A Install a Fluent Bit sidecar container, and use a JSON parser.

B.  

Install the log agent in the Cloud Run container image, and use the log agent to forward logs to Cloud Logging.

C.  

Configure the log agent to convert log text payload to JSON payload.

D.  

Modify the application to use Cloud Logging software development kit (SDK), and send log entries with a jsonPay10ad field.

Discussion 0
Questions 48

You are managing an MLOps platform on a Google Kubernetes Engine (GKE) cluster that serves two teams:

    The Research team runs hundreds of small, daily experiments that can run as capacity allows and are highly cost-sensitive.

    The Production team must retrain the company's main language model every Friday at 10:00 PM. This job is business-critical, cannot be interrupted, and must start on time.

You need to configure the modes for Dynamic Workload Scheduler to meet each team's requirements. What should you do?

Options:

A.  

Use Flex Start mode for both the Research and Production teams.

B.  

Use Flex Start mode for the Research team and Calendar mode for the Production team.

C.  

Use Calendar mode for both the Research and Production teams.

D.  

Use Calendar mode for the Research team and Flex Start mode for the Production team.

Discussion 0
Questions 49

Your organization wants to implement Site Reliability Engineering (SRE) culture and principles. Recently, a service that you support had a limited outage. A manager on another team asks you to provide a formal explanation of what happened so they can action remediations. What should you do?

Options:

A.  

Develop a postmortem that includes the root causes, resolution, lessons learned, and a prioritized list of action items. Share it with the manager only.

B.  

Develop a postmortem that includes the root causes, resolution, lessons learned, and a prioritized list of action items. Share it on the engineering organization's document portal.

C.  

Develop a postmortem that includes the root causes, resolution, lessons learned, the list of people responsible, and a list of action items for each person. Share it with the manager only.

D.  

Develop a postmortem that includes the root causes, resolution, lessons learned, the list of people responsible, and a list of action items for each person. Share it on the engineering organization's document portal.

Discussion 0
Questions 50

You use Artifact Registry to store container images built with Cloud Build. You need to ensure that all existing and new images are continuously scanned for vulnerabilities. You also want to track who pushed each image to the registry. What should you do?

Options:

A.  

Configure Artifact Registry to automatically trigger vulnerability scans for new image tags, and view scan results. Use Cloud Audit Logs to track image tag creation events.

B.  

Configure Artifact Registry to automatically scan new images and periodically re-scan all images. Use Cloud Audit Logs to track image uploads and identify the user who pushed each image.

C.  

Configure Artifact Registry to automatically re-scan images daily. Enable Cloud Audit Logs to track these scans, and use Logs Explorer to identify vulnerabilities.

D.  

Configure Artifact Registry to send vulnerability scan results to a Cloud Storage bucket. Use a separate script to parse results and notify a security team.

Discussion 0
Questions 51

You are the Site Reliability Engineer responsible for managing your company's data services and products. You regularly navigate operational challenges, such as unpredictable data volume and high cost, with your company's data ingestion processes. You recently learned that a new data ingestion product will be developed in Google Cloud. You need to collaborate with the product development team to provide operational input on the new product. What should you do?

Options:

A.  

Deploy the prototype product in a test environment, run a load test, and share the results with the product development team.

B.  

When the initial product version passes the quality assurance phase and compliance assessments, deploy the product to a staging environment. Share error logs and performancemetrics with the product development team.

C.  

When the new product is used by at least one internal customer in production, share error logs and monitoring metrics with the product development team.

D.  

Review the design of the product with the product development team to provide feedback early in the design phase.

Discussion 0
Questions 52

Your company runs an ecommerce website built with JVM-based applications and microservice architecture in Google Kubernetes Engine (GKE) The application load increases during the day and decreases during the night Your operations team has configured the application to run enough Pods to handle the evening peak load You want to automate scaling by only running enough Pods and nodes for the load What should you do?

Options:

A.  

Configure the Vertical Pod Autoscaler but keep the node pool size static

B.  

Configure the Vertical Pod Autoscaler and enable the cluster autoscaler

C.  

Configure the Horizontal Pod Autoscaler but keep the node pool size static

D.  

Configure the Horizontal Pod Autoscaler and enable the cluster autoscaler

Discussion 0
Questions 53

Your team is designing a new application for deployment both inside and outside Google Cloud Platform (GCP). You need to collect detailed metrics such as system resource utilization. You want to use centralized GCP services while minimizing the amount of work required to set up this collection system. What should you do?

Options:

A.  

Import the Stackdriver Profiler package, and configure it to relay function timing data to Stackdriver for further analysis.

B.  

Import the Stackdriver Debugger package, and configure the application to emit debug messages with timing information.

C.  

Instrument the code using a timing library, and publish the metrics via a health check endpoint that is scraped by Stackdriver.

D.  

Install an Application Performance Monitoring (APM) tool in both locations, and configure an export to a central data storage location for analysis.

Discussion 0
Questions 54

You are the Operations Lead for an ongoing incident with one of your services. The service usually runs at around 70% capacity. You notice that one node is returning 5xx errors for all requests. There has also been a noticeable increase in support cases from customers. You need to remove the offending node from the load balancer pool so that you can isolate and investigate the node. You want to follow Google-recommended practices to manage the incident and reduce the impact on users. What should you do?

Options:

A.  

1. Communicate your intent to the incident team.2. Perform a load analysis to determine if the remaining nodes can handle the increase in traffic offloaded from the removed node, and scale appropriately.3. When any new nodes report healthy, drain traffic from the unhealthy node, and remove the unhealthy node from service.

B.  

1. Communicate your intent to the incident team.2. Add a new node to the pool, and wait for the new node to report as healthy.3. When traffic is being served on the new node, drain traffic from the unhealthy node, and remove the old node from service.

C.  

1 . Drain traffic from the unhealthy node and remove the node from service.2. Monitor traffic to ensure that the error is resolved and that the other nodes in the pool are handling the traffic appropriately.3. Scale the pool as necessary to handle the new load.4. Communicate your actions to the incident team.

D.  

1 . Drain traffic from the unhealthy node and remove the old node from service.2. Add a new node to the pool, wait for the new node to report as healthy, and then serve traffic to the new node.3. Monitor traffic to ensure that the pool is healthy and is handling traffic appropriately.4. Communicate your actions to the incident team.

Discussion 0
Questions 55

You work for a healthcare company and regulations require you to create all resources in a United States-based region. You attempted to create a secret in Secret Manager but received the following error message:

The resource cannot be created in the global location due to an organization policy restriction.

You need to resolve the error while remaining compliant with regulations. What should you do?

Options:

A.  

Create the secret with an automatic replication policy.

B.  

Create the secret with a user-managed replication policy.

C.  

Remove the organization policy referenced in the error message.

D.  

Add the global region to the organization policy referenced in the error message.

Discussion 0
Questions 56

Your company is using HTTPS requests to trigger a public Cloud Run-hosted service accessible at the https://booking-engine-abcdef .a.run.app URL You need to give developers the ability to test the latest revisions of the service before the service is exposed to customers What should you do?

Options:

A.  

Runthegcioud run deploy booking-engine —no-traffic —-ag dev command Use the https://dev----booking-engine-abcdef. a. run. app URL for testing

B.  

Runthegcioud run services update-traffic booking-engine —to-revisions LATEST*! command Use the ht tps: //booking-engine-abcdef. a. run. ape URL for testing

C.  

Pass the curl -K "Authorization: Hearer S(gclcud auth print-identity-token)" auth token Use the https: / /booking-engine-abcdef. a. run. app URL to test privately

D.  

Grant the roles/run. invoker role to the developers testing the booking-engine service Use the https: //booking-engine-abcdef. private. run. app URL for testing

Discussion 0
Questions 57

Your company runs services by using Google Kubernetes Engine (GKE). The GKE clusters in the development environment run applications with verbose logging enabled. Developers view logs by using the kubect1 logs

command and do not use Cloud Logging. Applications do not have a uniform logging structure defined. You need to minimize the costs associated with application logging while still collecting GKE operational logs. What should you do?

Options:

A.  

Run the gcloud container clusters update --logging—SYSTEM command for the development cluster.

B.  

Run the gcloud container clusters update logging=WORKLOAD command for the development cluster.

C.  

Run the gcloud logging sinks update _Defau1t --disabled command in the project associated with the development environment.

D.  

Add the severity >= DEBUG resource. type "k83 container" exclusion filter to the Default logging sink in the project associated with the development environment.

Discussion 0
Questions 58

You are configuring connectivity across Google Kubernetes Engine (GKE) clusters in different VPCs You notice that the nodes in Cluster A are unable to access the nodes in Cluster B You suspect that the workload access issue is due to the network configuration You need to troubleshoot the issue but do not have execute access to workloads and nodes You want to identify the layer at which the network connectivity is broken What should you do?

Options:

A.  

Install a toolbox container on the node in Cluster A Confirm that the routes to Cluster B are configured appropriately

B.  

Use Network Connectivity Center to perform a Connectivity Test from Cluster A to Cluster

C.  

Use a debug container to run the traceroute command from Cluster A to Cluster B and from Cluster B to Cluster A Identify the common failure point

D.  

Enable VPC Flow Logs in both VPCs and monitor packet drops

Discussion 0
Questions 59

Your company's security team needs to have read-only access to Data Access audit logs in the _Required bucket You want to provide your security team with the necessary permissions following the principle of least privilege and Google-recommended practices. What should you do?

Options:

A.  

Assign the roles/logging, viewer role to each member of the security team

B.  

Assign the roles/logging. viewer role to a group with all the security team members

C.  

Assign the roles/logging.privateLogViewer role to each member of the security team

D.  

Assign the roles/logging.privateLogviewer role to a group with all the security team members

Discussion 0
Questions 60

Your company has a Google Cloud resource hierarchy with folders for production test and development Your cyber security team needs to review your company's Google Cloud security posture to accelerate security issue identification and resolution You need to centralize the logs generated by Google Cloud services from all projects only inside your production folder to allow for alerting and near-real time analysis. What should you do?

Options:

A.  

Enable the Workflows API and route all the logs to Cloud Logging

B.  

Create a central Cloud Monitoring workspace and attach all related projects

C.  

Create an aggregated log sink associated with the production folder that uses a Pub Sub topic as the destination

D.  

Create an aggregated log sink associated with the production folder that uses a Cloud Logging bucket as the destination

Discussion 0