Are you looking for the Google Cloud Professional Data Engineer Exam Questions? The questions and answers provided here test and enhance your knowledge of the exam objectives. Show
A professional Data Engineer collects, transforms, and publishes the data, thereby enabling data-driven decision making. Earning a Google Cloud Certified Professional Data Engineer certification may help you in pursuing a better career in the Google cloud industry. To pass the actual exam, you have to spend more time on learning & re-learning through multiple practice tests. Let’s start learning! Domain: Design Data Processing SystemsQ1 : A company is migrating its current infrastructure from on-premise to Google cloud. It stores over 280TB of data on its on-premise HDFS servers. You were tasked to move data from HDFS to Google Storage in a secure and efficient manner. Which of the following approaches are best to fulfill this task?A. Install Google Storage gsutil tool on servers and copy the data from HDFS to Google Storage. Correct Answer: D Explanation : Storage Transfer Service allows you to quickly import ONLINE data into Cloud Storage. You can also set up a repeating schedule for transferring data, as well as transfer data within Cloud Storage, from one bucket to another. Transfer Appliance is an OFFLINE secure, high capacity storage server that you set up in your datacenter. You fill it with data and ship it to an ingest location where the data is uploaded to Google Cloud Storage. So, answer D is the correct one, while B is incorrect. References: Google Cloud Storage Transfer Service: https://cloud.google.com/storage-transfer/docs/ Google Appliance Transfer Service: https://cloud.google.com/transfer-appliance/ Migrate HDFS to Google Storage: https://cloud.google.com/solutions/migration/hadoop/hadoop-gcp- migration-data Domain: Design Data Processing SystemsQ2 : You have a Dataflow pipeline to run and process a set of data files received from a client, for transformation and loading into a data warehouse. This pipeline should run each morning so that metrics can be ready when stakeholders need the latest stats based on data sent the day before. Which tool should you use?A. Cloud Functions Correct Answer: D Explanation : The question is asking to suggest a name of service that can be used to trigger to schedule a dataflow pipeline. A: Cloud Functions Cloud Functions can be written in Node.js, Python, Go, Java, .NET, Ruby, and PHP programming languages, and are executed in language-specific runtimes. This can be invoked HTTP functions from standard HTTP requests. These HTTP requests wait for the response and support handling of common HTTP request methods like GET, PUT, POST, DELETE and OPTIONS Hence this is not a correct
solution. Reference: Cloud Scheduler: https://cloud.google.com/scheduler/ Domain: Build and Operationalize Data Processing SystemsQ3 : A pharmaceutical factory has over
100,000 different sensors generating JSON-format events every 10 seconds to be collected. You need to gather the event data for sensor & time series analysis. |