DP-201 Free Update With 100% Exam Passing Guarantee [2022]
[Mar-2022] Verified Microsoft Exam Dumps with DP-201 Exam Study Guide
Potential Candidates and Prerequisites
The Microsoft DP-201 exam is intended mainly for the Azure data engineers. These are the professionals who work closely with the business stakeholders in order to determine and satisfy the data requirements and design data solutions that utilize Azure data services. These specialists are also responsible for designing Azure data storage solutions that utilize relational & non-relational data stores, real-time and batch data processing solutions, as well as data security & compliance solutions.
To successfully complete the DP-201 test, the applicants should have competence in designing data solutions that utilize Azure services, including Azure SQL Database, Azure Cosmos DB, Azure Data Lake Storage, Azure Synapse Analytics, Azure Stream Analytics, Azure Data Factory, Azure Blob storage, and Azure Databricks.
What is Microsoft DP-201 Certification Exam:
Microsoft DP-201 is a valid certification that establishes your credibility as an expert in the field of cloud administration with Microsoft Office 365. It is a certification that covers technical and implementation skills across service plans, plans and subscriptions, deployment, implementation, and administrative management of the Office 365 platform. Hundreds of customers have recognized this certification as an industry benchmark. This certification helps you to integrate your technical skills with broad business knowledge. Regional-specific sections in this certification provide in-depth knowledge of the nuances in different marketplaces. Material design review questions and a performance-based exam are available for this certification. Button to your success! Read this guide to find out more about the Microsoft DP-201 Certification Exam!
Reason and benefits of taking DP-201 Dumps* offered by the ExamsReviews:
- The DP-201 Exam is time-sensitive, so you have to be fully prepared for the worst before sitting the exam. Vendor advocates recommend you prep well in advance and not one minute before. Immense stress will be reduced after the long preparation process.
- Dumps will help you get used to the exam pattern of topics and how they will be asked. You'll also get used to the language questions are asked in an exam environment. Answers will be more genuine, and you'll have a better chance of remembering them.
- It has prepared over 2000 candidates all over the world who are now certified and have certification in their certlibrary. Studies have shown that by preparing with exam dumps, you will gain more knowledge and skills than reading the book. It also cuts down on time saving you over 25% of your time and energy spent.
- It contains both theoretical and practical questions of the DP-201 exam, which is more useful for candidates, especially for those candidates who want practical knowledge before sitting for real exams. You can also publish a comment or feedback for us if you feel the dump is inappropriate or helpful.
- There is a full 100% money-back guarantee in case you do not pass the exam. Now we can say that we are the best and reliable partner to get Microsoft DP-201 certification with a 100% pass guarantee. Privacy-protected results are guaranteed.
NEW QUESTION 21
You need to design storage for the solution.
Which storage services should you recommend? To answer, select the appropriate configuration in the answer area.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
Images: Azure Data Lake Storage
Scenario: Image data must be stored in a single data store at minimum cost.
Customer data: Azure Blob Storage
Scenario: Customer data must be analyzed using managed Spark clusters.
Spark clusters in HDInsight are compatible with Azure Storage and Azure Data Lake Storage.
Azure Storage includes these data services: Azure Blob, Azure Files, Azure Queues, and Azure Tables.
References:
https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-overview
NEW QUESTION 22
A company stores sensitive information about customers and employees in Azure SQL Database.
You need to ensure that the sensitive data remains encrypted in transit and at rest.
What should you recommend?
- A. Azure Disk Encryption
- B. Always Encrypted with secure enclaves
- C. SQL Server AlwaysOn
- D. Transparent Data Encryption
Answer: B
Explanation:
References:
https://cloudblogs.microsoft.com/sqlserver/2018/12/17/confidential-computing-using-always-encrypted-withsecure-enclaves-in-sql-server-2019-preview/
Topic 4, ADatum Corporation
General Overview
ADatum Corporation is a medical company that has 5,000 physicians located in more than 300 hospitals across the US. The company has a medical department, a sales department, a marketing department, a medical research department, and a human resources department.
You are redesigning the application environment of ADatum.
Physical Locations
ADatum has three main offices in New York, Dallas, and Los Angeles. The offices connect to each other by using a WAN link. Each office connects directly to the Internet. The Los Angeles office also has a datacenter that hosts all the company's applications.
Existing Environment
Health Review
ADatum has a critical OLTP web application named Health Review that physicians use to track billing, patient care, and overall physician best practices.
Health Interface
ADatum has a critical application named Health Interface that receives hospital messages related to patient care and status updates. The messages are sent in batches by each hospital's enterprise relationship management (ERM) system by using a VPN. The data sent from each hospital can have varying columns and formats.
Currently, a custom C# application is used to send the data to Health Interface. The application uses deprecated libraries and a new solution must be designed for this functionality.
Health Insights
ADatum has a web-based reporting system named Health Insights that shows hospital and patient insights to physicians and business users. The data is created from the data in Health Review and Health Interface, as well as manual entries.
Database Platform
Currently, the databases for all three applications are hosted on an out-of-date VMware cluster that has a single instance of Microsoft SQL Server 2012.
Problem Statements
ADatum identifies the following issues in its current environment:
Over time, the data received by Health Interface from the hospitals has slowed, and the number of messages has increased.
When a new hospital joins ADatum, Health Interface requires a schema modification due to the lack of data standardization.
The speed of batch data processing is inconsistent.
Business Requirements
Business Goals
ADatum identifies the following business goals:
Migrate the applications to Azure whenever possible.
Minimize the development effort required to perform data movement.
Provide continuous integration and deployment for development, test, and production environments.
Provide faster access to the applications and the data and provide more consistent application performance.
Minimize the number of services required to perform data processing, development, scheduling, monitoring, and the operationalizing of pipelines.
Health Review Requirements
ADatum identifies the following requirements for the Health Review application:
Ensure that sensitive health data is encrypted at rest and in transit.
Tag all the sensitive health data in Health Review. The data will be used for auditing.
Health Interface Requirements
ADatum identifies the following requirements for the Health Interface application:
Upgrade to a data storage solution that will provide flexible schemas and increased throughput for writing data. Data must be regionally located close to each hospital, and reads must display be the most recent committed version of an item.
Reduce the amount of time it takes to add data from new hospitals to Health Interface.
Support a more scalable batch processing solution in Azure.
Reduce the amount of development effort to rewrite existing SQL queries.
Health Insights Requirements
ADatum identifies the following requirements for the Health Insights application:
The analysis of events must be performed over time by using an organizational date dimension table.
The data from Health Interface and Health Review must be available in Health Insights within 15 minutes of being committed.
The new Health Insights application must be built on a massively parallel processing (MPP) architecture that will support the high performance of joins on large fact tables.
NEW QUESTION 23
You have a large amount of sensor data stored in an Azure Data Lake Storage Gen2 account. The files are in the Parquet file format.
New sensor data will be published to Azure Event Hubs.
You need to recommend a solution to add the new sensor data to the existing sensor data in real-time. The solution must support the interactive querying of the entire dataset.
Which type of server should you include in the recommendation?
- A. Azure Stream Analytics
- B. Azure Databricks
- C. Azure Cosmos DB
- D. Azure SQL Database
Answer: A
Explanation:
Explanation
Azure Stream Analytics is a fully managed PaaS offering that enables real-time analytics and complex event processing on fast moving data streams.
By outputting data in parquet format into a blob store or a data lake, you can take advantage of Azure Stream Analytics to power large scale streaming extract, transfer, and load (ETL), to run batch processing, to train machine learning algorithms, or to run interactive queries on your historical data.
Reference:
https://azure.microsoft.com/en-us/blog/new-capabilities-in-stream-analytics-reduce-development-time-for-big-da
NEW QUESTION 24
You need to design the unauthorized data usage detection system.
What Azure service should you include in the design?
- A. Azure Analysis Services
- B. Azure Databricks
- C. Azure SQL Data Warehouse
- D. Azure Data Factory
Answer: C
Explanation:
SQL Database and SQL Data Warehouse
SQL threat detection identifies anomalous activities indicating unusual and potentially harmful attempts to access or exploit databases.
Advanced Threat Protection for Azure SQL Database and SQL Data Warehouse detects anomalous activities indicating unusual and potentially harmful attempts to access or exploit databases.
Scenario:
Requirements. Security
The solution must meet the following security requirements:
* Unauthorized usage of data must be detected in real time. Unauthorized usage is determined by looking for unusual usage patterns.
Reference:
https://docs.microsoft.com/en-us/azure/sql-database/sql-database-threat-detection-overview Design for data security and compliance Testlet 5 Case study This is a case study. Case studies are not timed separately. You can use as much exam time as you would like to complete each case. However, there may be additional case studies and sections on this exam. You must manage your time to ensure that you are able to complete all questions included on this exam in the time provided.
To answer the questions included in a case study, you will need to reference information that is provided in the case study. Case studies might contain exhibits and other resources that provide more information about the scenario that is described in the case study. Each question is independent of the other questions in this case study.
At the end of this case study, a review screen will appear. This screen allows you to review your answers and to make changes before you move to the next section of the exam. After you begin a new section, you cannot return to this section.
To start the case study
To display the first question in this case study, click the Next button. Use the buttons in the left pane to explore the content of the case study before you answer the questions. Clicking these buttons displays information such as business requirements, existing environment, and problem statements. If the case study has an All Information tab, note that the information displayed is identical to the information displayed on the subsequent tabs. When you are ready to answer a question, click the Question button to return to the question.
Background
Trey Research is a technology innovator. The company partners with regional transportation department office to build solutions that improve traffic flow and safety.
The company is developing the following solutions:
Regional transportation departments installed traffic sensor systems on major highways across North America.
Sensors record the following information each time a vehicle passes in front of a sensor:
* Time
* Location in latitude and longitude
* Speed in kilometers per second (kmps)
* License plate number
* Length of vehicle in meters
Sensors provide data by using the following structure:
Traffic sensors will occasionally capture an image of a vehicle for debugging purposes.
You must optimize performance of saving/storing vehicle images.
Traffic sensor data
* Sensors must have permission only to add items to the SensorData collection.
* Traffic data insertion rate must be maximized.
* Once every three months all traffic sensor data must be analyzed to look for data patterns that indicate sensor malfunctions.
* Sensor data must be stored in a Cosmos DB named treydata in a collection named SensorData
* The impact of vehicle images on sensor data throughout must be minimized.
Backtrack
This solution reports on all data related to a specific vehicle license plate. The report must use data from the SensorData collection. Users must be able to filter vehicle data in the following ways:
* vehicles on a specific road
* vehicles driving above the speed limit
Planning Assistance
Data used for Planning Assistance must be stored in a sharded Azure SQL Database.
Data from the Sensor Data collection will automatically be loaded into the Planning Assistance database once a week by using Azure Data Factory. You must be able to manually trigger the data load process.
Privacy and security policy
* Azure Active Directory must be used for all services where it is available.
* For privacy reasons, license plate number information must not be accessible in Planning Assistance.
* Unauthorized usage of the Planning Assistance data must be detected as quickly as possible. Unauthorized usage is determined by looking for an unusual pattern of usage.
* Data must only be stored for seven years.
Performance and availability
* The report for Backtrack must execute as quickly as possible.
* The SLA for Planning Assistance is 70 percent, and multiday outages are permitted.
* All data must be replicated to multiple geographic regions to prevent data loss.
* You must maximize the performance of the Real Time Response system.
Financial requirements
Azure resource costs must be minimized where possible.
Design for data security and compliance
Testlet 6
Case study
This is a case study. Case studies are not timed separately. You can use as much exam time as you would like to complete each case. However, there may be additional case studies and sections on this exam. You must manage your time to ensure that you are able to complete all questions included on this exam in the time provided.
To answer the questions included in a case study, you will need to reference information that is provided in the case study. Case studies might contain exhibits and other resources that provide more information about the scenario that is described in the case study. Each question is independent of the other questions in this case study.
At the end of this case study, a review screen will appear. This screen allows you to review your answers and to make changes before you move to the next section of the exam. After you begin a new section, you cannot return to this section.
To start the case study
To display the first question in this case study, click the Next button. Use the buttons in the left pane to explore the content of the case study before you answer the questions. Clicking these buttons displays information such as business requirements, existing environment, and problem statements. If the case study has an All Information tab, note that the information displayed is identical to the information displayed on the subsequent tabs. When you are ready to answer a question, click the Question button to return to the question.
Overview
You develop data engineering solutions for Graphics Design Institute, a global media company with offices in New York City, Manchester, Singapore, and Melbourne.
The New York office hosts SQL Server databases that stores massive amounts of customer data. The company also stores millions of images on a physical server located in the New York office. More than 2 TB of image data is added each day. The images are transferred from customer devices to the server in New York.
Many images have been placed on this server in an unorganized manner, making it difficult for editors to search images. Images should automatically have object and color tags generated. The tags must be stored in a document database, and be queried by SQL.
You are hired to design a solution that can store, transform, and visualize customer data.
Requirements
Business
The company identifies the following business requirements:
* You must transfer all images and customer data to cloud storage and remove on-premises servers.
* You must develop an analytical processing solution for transforming customer data.
* You must develop an image object and color tagging solution.
* Capital expenditures must be minimized.
* Cloud resource costs must be minimized.
Technical
The solution has the following technical requirements:
* Tagging data must be uploaded to the cloud from the New York office location.
* Tagging data must be replicated to regions that are geographically close to company office locations.
* Image data must be stored in a single data store at minimum cost.
* Customer data must be analyzed using managed Spark clusters.
* Power BI must be used to visualize transformed customer data.
* All data must be backed up in case disaster recovery is required.
Security and optimization
All cloud data must be encrypted at rest and in transit. The solution must support:
* parallel processing of customer data
* hyper-scale storage of images
* global region data replication of processed image data
Design for data security and compliance
Question Set 7
NEW QUESTION 25
What should you recommend as a batch processing solution for Health Interface?
- A. Azure Stream Analytics
- B. Azure CycleCloud
- C. Azure Databricks
- D. Azure Data Factory
Answer: A
Explanation:
Explanation
Scenario: ADatum identifies the following requirements for the Health Interface application:
Support a more scalable batch processing solution in Azure.
Reduce the amount of time it takes to add data from new hospitals to Health Interface.
Data Factory integrates with the Azure Cosmos DB bulk executor library to provide the best performance when you write to Azure Cosmos DB.
References:
https://docs.microsoft.com/en-us/azure/data-factory/connector-azure-cosmos-db
NEW QUESTION 26
You plan to use Azure SQL Database to support a line of business app.
You need to identify sensitive data that is stored in the database and monitor access to the data.
Which three actions should you recommend? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
- A. Enable Auditing.
- B. Run Vulnerability Assessment.
- C. Use Advanced Threat Protection.
- D. Configure Data Discovery and Classification.
- E. Implement Transparent Data Encryption (TDE).
Answer: A,C,D
Explanation:
Explanation
References:
https://docs.microsoft.com/en-us/azure/azure-sql/database/data-discovery-and-classification-overview
NEW QUESTION 27
You are designing a real-time processing solution for maintenance work requests that are received via email.
The solution will perform the following actions:
* Store all email messages in an archive.
* Access weather forecast data by using the Python SDK for Azure Open Datasets.
* Identify high priority requests that will be affected by poor weather conditions and store the requests in an Azure SQL database.
The solution must minimize costs.
How should you complete the solution? To answer, drag the appropriate services to the correct locations. Each service may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
Box 1: Azure Storage
Azure Event Hubs enables you to automatically capture the streaming data in Event Hubs in an Azure Blob storage or Azure Data Lake Storage Gen 1 or Gen 2 account of your choice, with the added flexibility of specifying a time or size interval. Setting up Capture is fast, there are no administrative costs to run it, and it scales automatically with Event Hubs throughput units. Event Hubs Capture is the easiest way to load streaming data into Azure, and enables you to focus on data processing rather than on data capture.
Box 2: Azure Logic Apps
You can monitor and manage events sent to Azure Event Hubs from inside a logic app with the Azure Event Hubs connector. That way, you can create logic apps that automate tasks and workflows for checking, sending, and receiving events from your Event Hub.
Reference:
https://docs.microsoft.com/en-us/azure/event-hubs/event-hubs-capture-overview
https://docs.microsoft.com/en-us/azure/connectors/connectors-create-api-azure-event-hubs
NEW QUESTION 28
Which Azure Data Factory components should you recommend using together to import the daily inventory data from SQL to Data Lake Storage? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation:
Box 1: Self-hosted integration runtime
A self-hosted IR is capable of nunning copy activity between a cloud data stores and a data store in private network.
Scenario: Daily inventory data comes from a Microsoft SQL server located on a private network.
Box 2: Schedule trigger
Daily schedule
Box 3: Copy activity
Scenario:
Stage inventory data in Azure Data Lake Storage Gen2 before loading the data into the analytical data store. Litware wants to remove transient data from Data Lake Storage once the data is no longer in use. Files that have a modified date that is older than 14 days must be removed.
NEW QUESTION 29
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You plan to store delimited text files in an Azure Data Lake Storage account that will be organized into department folders.
You need to configure data access so that users see only the files in their respective department folder.
Solution: From the storage account, you enable a hierarchical namespace, and you use RBAC.
Does this meet the goal?
- A. No
- B. Yes
Answer: A
Explanation:
Explanation
Explanation:
Disable the hierarchical namespace. And instead of RBAC use access control lists (ACLs).
Note: Azure Data Lake Storage implements an access control model that derives from HDFS, which in turn derives from the POSIX access control model.
Blob container ACLs does not support the hierarchical namespace, so it must be disabled.
Reference:
https://docs.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-known-issues
https://docs.microsoft.com/en-us/azure/data-lake-store/data-lake-store-access-control
NEW QUESTION 30
You need to ensure that emergency road response vehicles are dispatched automatically.
How should you design the processing system? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
Box1: API App
* Events generated from the IoT data sources are sent to the stream ingestion layer through Azure HDInsight Kafka as a stream of messages. HDInsight Kafka stores streams of data in topics for a configurable of time.
* Kafka consumer, Azure Databricks, picks up the message in real time from the Kafka topic, to process
* the data based on the business logic and can then send to Serving layer for storage.
* Downstream storage services, like Azure Cosmos DB, Azure SQL Data warehouse, or Azure SQL DB, will then be a data source for presentation and action layer.
* Business analysts can use Microsoft Power BI to analyze warehoused data. Other applications can be built upon the serving layer as well. For example, we can expose APIs based on the service layer data for third party uses.
Box 2: Cosmos DB Change Feed
Change feed support in Azure Cosmos DB works by listening to an Azure Cosmos DB container for any changes. It then outputs the sorted list of documents that were changed in the order in which they were modified.
The change feed in Azure Cosmos DB enables you to build efficient and scalable solutions for each of these patterns, as shown in the following image:
References:
https://docs.microsoft.com/bs-cyrl-ba/azure/architecture/example-scenario/data/realtime-analytics-vehicle-iot?vie
NEW QUESTION 31
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have streaming data that is received by Azure Event Hubs and stored in Azure Blob storage. The data contains social media posts that relate to a keyword of Contoso.
You need to count how many times the Contoso keyword and a keyword of Litware appear in the same post every 30 seconds. The data must be available to Microsoft Power BI in near real-time.
Solution: You use Azure Data Factory and an event trigger to detect when new blobs are created. You use mapping data flows in Azure Data Factory to aggregate and filter the data, and then send the data to an Azure SQL database. You consume the data in Power BI by using DirectQuery mode.
Does the solution meet the goal?
- A. No
- B. Yes
Answer: A
NEW QUESTION 32
You have a CSV file in Azure Blob storage. The file does NOT have a header row.
You need to use Azure Data Factory to copy the file to an Azure SQL database. The solution must minimize how long it takes to copy the file.
How should you configure the copy process? To answer, drag the appropriate components to the correct locations. Each component may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
Input: A delimited text dataset that has a comma a column delimiter
columnDelimiter: The character(s) used to separate columns in a file.
The default value is comma ,. When the column delimiter is defined as empty string, which means no delimiter, the whole line is taken as a single column.
Pipeline: A data flow activity that has a general purpose compute type
When you're transforming data in mapping data flows, you can read and write files from Azure Blob storage.
Output: A copy activity that has an explicit schema mapping
Use Copy Activity in Azure Data Factory to copy data from and to Azure SQL Database, and use Data Flow to transform data in Azure SQL Database.
Reference:
https://docs.microsoft.com/en-us/azure/data-factory/format-delimited-text
https://docs.microsoft.com/en-us/azure/data-factory/connector-azure-sql-database
NEW QUESTION 33
You are designing a new application that uses Azure Cosmos DB. The application will support a variety of data patterns including log records and social media mentions.
You need to recommend which Cosmos DB API to use for each data pattern. The solution must minimize resource utilization.
Which API should you recommend for each data pattern? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation:
Log records: SQL
Social media mentions: Gremlin
You can store the actual graph of followers using Azure Cosmos DB Gremlin API to create vertexes for each user and edges that maintain the "A-follows-B" relationships. With the Gremlin API, you can get the followers of a certain user and create more complex queries to suggest people in common. If you add to the graph the Content Categories that people like or enjoy, you can start weaving experiences that include smart content discovery, suggesting content that those people you follow like, or finding people that you might have much in common with.
References:
https://docs.microsoft.com/en-us/azure/cosmos-db/social-media-apps
NEW QUESTION 34
You are planning the deployment of two separate Azure Cosmos DB databases named db1 and db2.
You need to recommend a deployment strategy that meets the following requirements:
* Costs for both databases must be minimized.
* Db1 must meet an SLA of 99.99% for both reads and writes.
* Db2 must meet an SLA of 99.99% for writes and 99.999% for reads.
Which deployment strategy should you recommend for each database? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
Db1: A single read/write region
Db2: A single write region and multi read regions
References:
https://docs.microsoft.com/en-us/azure/cosmos-db/high-availability
NEW QUESTION 35
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have streaming data that is received by Azure Event Hubs and stored in Azure Blob storage. The data contains social media posts that relate to a keyword of Contoso.
You need to count how many times the Contoso keyword and a keyword of Litware appear in the same post every 30 seconds. The data must be available to Microsoft Power BI in near real-time.
Solution: You use Azure Databricks to create a Scala notebook. You use a Structured Streaming job to connect to the event hub that counts the posts that have the specified keywords, and then writes the data to a Delta table. You consume the data in Power BI by using DirectQuery mode.
Does the solution meet the goal?
- A. No
- B. Yes
Answer: A
NEW QUESTION 36
Which Azure service should you recommend for the analytical data store so that the business analysts and data scientists can execute ad hoc queries as quickly as possible?
- A. Azure Data Lake Storage Gen2
- B. Azure Synapse Analytics
- C. Azure Cosmos DB
- D. Azure SQL Database
Answer: A
Explanation:
There are several differences between a data lake and a data warehouse. Data structure, ideal users, processing methods, and the overall purpose of the data are the key differentiators.
Scenario: Litware employs business analysts who prefer to analyze data by using Microsoft Power BI, and data scientists who prefer analyzing data in Azure Databricks notebooks.
Note: Azure Synapse Analytics formerly known as Azure SQL Data Warehouse.
Design Azure data storage solutions
Question Set 7
NEW QUESTION 37
You plan to migrate data to Azure SQL Database.
The database must remain synchronized with updates to Microsoft Azure and SQL Server.
You need to set up the database as a subscriber.
What should you recommend?
- A. Data Migration Assistant
- B. SQL Server Management Studio 17.9.1 or later
- C. Azure Data Factory
- D. SQL Server Data Tools
- E. SQL Server Agent for SQL Server 2017 or later
Answer: B
Explanation:
To set up the database as a subscriber we need to configure database replication. You can use SQL Server Management Studio to configure replication. Use the latest versions of SQL Server Management Studio in order to be able to use all the features of Azure SQL Database.
Reference:
https://www.sqlshack.com/sql-server-database-migration-to-azure-sql-database-using-sql-server-transactional- replication/
NEW QUESTION 38
You need to design a backup solution for the processed customer data.
What should you include in the design?
- A. Geo-Replication
- B. AdlCopy
- C. Geo-Redundancy
- D. AzCopy
Answer: C
Explanation:
Scenario: All data must be backed up in case disaster recovery is required.
Geo-redundant storage (GRS) is designed to provide at least 99.99999999999999% (16 9's) durability of objects over a given year by replicating your data to a secondary region that is hundreds of miles away from the primary region. If your storage account has GRS enabled, then your data is durable even in the case of a complete regional outage or a disaster in which the primary region isn't recoverable.
Reference:
https://docs.microsoft.com/en-us/azure/storage/common/storage-redundancy-grs
NEW QUESTION 39
You need to design the unauthorized data usage detection system.
What Azure service should you include in the design?
- A. Azure Analysis Services
- B. Azure Databricks
- C. Azure Data Factory
- D. Azure Synapse Analytics
Answer: D
Explanation:
SQL Database and SQL Data Warehouse
SQL threat detection identifies anomalous activities indicating unusual and potentially harmful attempts to access or exploit databases.
Advanced Threat Protection for Azure SQL Database and SQL Data Warehouse detects anomalous activities indicating unusual and potentially harmful attempts to access or exploit databases.
Scenario:
Requirements. Security
The solution must meet the following security requirements:
* Unauthorized usage of data must be detected in real time. Unauthorized usage is determined by looking for unusual usage patterns.
Reference:
https://docs.microsoft.com/en-us/azure/sql-database/sql-database-threat-detection-overview Design for data security and compliance Testlet 5 Case study This is a case study. Case studies are not timed separately. You can use as much exam time as you would like to complete each case. However, there may be additional case studies and sections on this exam. You must manage your time to ensure that you are able to complete all questions included on this exam in the time provided.
To answer the questions included in a case study, you will need to reference information that is provided in the case study. Case studies might contain exhibits and other resources that provide more information about the scenario that is described in the case study. Each question is independent of the other questions in this case study.
At the end of this case study, a review screen will appear. This screen allows you to review your answers and to make changes before you move to the next section of the exam. After you begin a new section, you cannot return to this section.
To start the case study
To display the first question in this case study, click the Next button. Use the buttons in the left pane to explore the content of the case study before you answer the questions. Clicking these buttons displays information such as business requirements, existing environment, and problem statements. If the case study has an All Information tab, note that the information displayed is identical to the information displayed on the subsequent tabs. When you are ready to answer a question, click the Question button to return to the question.
Background
Trey Research is a technology innovator. The company partners with regional transportation department office to build solutions that improve traffic flow and safety.
The company is developing the following solutions:
Regional transportation departments installed traffic sensor systems on major highways across North America.
Sensors record the following information each time a vehicle passes in front of a sensor:
* Time
* Location in latitude and longitude
* Speed in kilometers per second (kmps)
* License plate number
* Length of vehicle in meters
Sensors provide data by using the following structure:
Traffic sensors will occasionally capture an image of a vehicle for debugging purposes.
You must optimize performance of saving/storing vehicle images.
Traffic sensor data
* Sensors must have permission only to add items to the SensorData collection.
* Traffic data insertion rate must be maximized.
* Once every three months all traffic sensor data must be analyzed to look for data patterns that indicate sensor malfunctions.
* Sensor data must be stored in a Cosmos DB named treydata in a collection named SensorData
* The impact of vehicle images on sensor data throughout must be minimized.
Backtrack
This solution reports on all data related to a specific vehicle license plate. The report must use data from the SensorData collection. Users must be able to filter vehicle data in the following ways:
* vehicles on a specific road
* vehicles driving above the speed limit
Planning Assistance
Data used for Planning Assistance must be stored in a sharded Azure SQL Database.
Data from the Sensor Data collection will automatically be loaded into the Planning Assistance database once a week by using Azure Data Factory. You must be able to manually trigger the data load process.
Privacy and security policy
* Azure Active Directory must be used for all services where it is available.
* For privacy reasons, license plate number information must not be accessible in Planning Assistance.
* Unauthorized usage of the Planning Assistance data must be detected as quickly as possible. Unauthorized usage is determined by looking for an unusual pattern of usage.
* Data must only be stored for seven years.
Performance and availability
* The report for Backtrack must execute as quickly as possible.
* The SLA for Planning Assistance is 70 percent, and multiday outages are permitted.
* All data must be replicated to multiple geographic regions to prevent data loss.
* You must maximize the performance of the Real Time Response system.
Financial requirements
Azure resource costs must be minimized where possible.
Design for data security and compliance
Testlet 6
Case study
This is a case study. Case studies are not timed separately. You can use as much exam time as you would like to complete each case. However, there may be additional case studies and sections on this exam. You must manage your time to ensure that you are able to complete all questions included on this exam in the time provided.
To answer the questions included in a case study, you will need to reference information that is provided in the case study. Case studies might contain exhibits and other resources that provide more information about the scenario that is described in the case study. Each question is independent of the other questions in this case study.
At the end of this case study, a review screen will appear. This screen allows you to review your answers and to make changes before you move to the next section of the exam. After you begin a new section, you cannot return to this section.
To start the case study
To display the first question in this case study, click the Next button. Use the buttons in the left pane to explore the content of the case study before you answer the questions. Clicking these buttons displays information such as business requirements, existing environment, and problem statements. If the case study has an All Information tab, note that the information displayed is identical to the information displayed on the subsequent tabs. When you are ready to answer a question, click the Question button to return to the question.
Overview
You develop data engineering solutions for Graphics Design Institute, a global media company with offices in New York City, Manchester, Singapore, and Melbourne.
The New York office hosts SQL Server databases that stores massive amounts of customer data. The company also stores millions of images on a physical server located in the New York office. More than 2 TB of image data is added each day. The images are transferred from customer devices to the server in New York.
Many images have been placed on this server in an unorganized manner, making it difficult for editors to search images. Images should automatically have object and color tags generated. The tags must be stored in a document database, and be queried by SQL.
You are hired to design a solution that can store, transform, and visualize customer data.
Requirements
Business
The company identifies the following business requirements:
* You must transfer all images and customer data to cloud storage and remove on-premises servers.
* You must develop an analytical processing solution for transforming customer data.
* You must develop an image object and color tagging solution.
* Capital expenditures must be minimized.
* Cloud resource costs must be minimized.
Technical
The solution has the following technical requirements:
* Tagging data must be uploaded to the cloud from the New York office location.
* Tagging data must be replicated to regions that are geographically close to company office locations.
* Image data must be stored in a single data store at minimum cost.
* Customer data must be analyzed using managed Spark clusters.
* Power BI must be used to visualize transformed customer data.
* All data must be backed up in case disaster recovery is required.
Security and optimization
All cloud data must be encrypted at rest and in transit. The solution must support:
* parallel processing of customer data
* hyper-scale storage of images
* global region data replication of processed image data
Design for data security and compliance
Question Set 7
NEW QUESTION 40
You manage a process that performs analysis of daily web traffic logs on an HDInsight cluster. Each of the 250 web servers generates approximately 10megabytes (MB) of log data each day. All log data is stored in a single folder in Microsoft Azure Data Lake Storage Gen 2.
You need to improve the performance of the process.
Which two changes should you make? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.
- A. Increase the value of the hive.tez.containerize parameter
- B. Increase the value of the mapreduce map memory parameter
- C. Combine the daily log files for all servers into one file
- D. Increase the number of worker nodes
- E. Move the log files into folders so that each day's logs are in their own folder
Answer: C,E
Explanation:
A: Typically, analytics engines such as HDInsight and Azure Data Lake Analytics has a per-five overhead. If you store your data as many small files, this can negatively affect performance. In general, organize your data into larger sized files for better performance (256MB to 100GB in size). Some engines and applications might have trouble efficiently processing files that are greater than 100GB in size.
C: For Hive workloads, partition pruning of time-series data can help some queries read only a subset of the data which improves performance.
Those pipelines that ingest time-series data, often place their files with a very structured naming for files and folders. Below is a very common example we see for data is structured by date:
\DataSet\YYYY\MM\DD\datafile_YYYY_MM_DD.tsv
Notice that the datetime information appears both as folders and in the filename.
Reference:
https://docs.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-performance-tuning-guidance
NEW QUESTION 41
You are designing an Azure SQL Data Warehouse for a financial services company. Azure Active Directory will be used to authenticate the users.
You need to ensure that the following security requirements are met:
* Department managers must be able to create new database.
* The IT department must assign users to databases.
* Permissions granted must be minimized.
Which role memberships should you recommend? To answer, drag the appropriate roles to the correct groups.
Each role may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
Box 1: dbmanager
Members of the dbmanager role can create new databases.
Box 2: db_accessadmin
Members of the db_accessadmin fixed database role can add or remove access to the database for Windows logins, Windows groups, and SQL Server logins.
References:
https://docs.microsoft.com/en-us/azure/sql-database/sql-database-manage-logins
Topic 4, ADatum Corporation
Case study
This is a case study. Case studies are not timed separately. You can use as much exam time as you would like to complete each case. However, there may be additional case studies and sections on this exam. You must manage your time to ensure that you are able to complete all questions included on this exam in the time provided.
To answer the questions included in a case study, you will need to reference information that is provided in the case study. Case studies might contain exhibits and other resources that provide more information about the scenario that is described in the case study. Each question is independent of the other questions in this case study.
At the end of this case study, a review screen will appear. This screen allows you to review your answers and to make changes before you move to the next section of the exam. After you begin a new section, you cannot return to this section.
To start the case study
To display the first question in this case study, click the button. Use the buttons in the left pane to explore the content of the case study before you answer the questions. Clicking these buttons displays information such as business requirements, existing environment, and problem statements. If the case study has an All Information tab, note that the information displayed is identical to the information displayed on the subsequent tabs. When you are ready to answer a question, click the button to return to the question.
Overview
General Overview
ADatum Corporation is a medical company that has 5,000 physicians located in more than 300 hospitals across the US. The company has a medical department, a sales department, a marketing department, a medical research department, and a human resources department.
You are redesigning the application environment of ADatum.
Physical Locations
ADatum has three main offices in New York, Dallas, and Los Angeles. The offices connect to each other by using a WAN link. Each office connects directly to the Internet. The Los Angeles office also has a datacenter that hosts all the company's applications.
Existing Environment
Health Review
ADatum has a critical OLTP web application named Health Review that physicians use to track billing, patient care, and overall physician best practices.
Health Interface
ADatum has a critical application named Health Interface that receives hospital messages related to patient care and status updates. The messages are sent in batches by each hospital's enterprise relationship management (ERM) system by using a VPN. The data sent from each hospital can have varying columns and formats.
Currently, a custom C# application is used to send the data to Health Interface. The application uses deprecated libraries and a new solution must be designed for this functionality.
Health Insights
ADatum has a web-based reporting system named Health Insights that shows hospital and patient insights to physicians and business users. The data is created from the data in Health Review and Health Interface, as well as manual entries.
Database Platform
Currently, the databases for all three applications are hosted on an out-of-date VMware cluster that has a single instance of Microsoft SQL Server 2012.
Problem Statements
ADatum identifies the following issues in its current environment:
* Over time, the data received by Health Interface from the hospitals has slowed, and the number of messages has increased.
* When a new hospital joins ADatum, Health Interface requires a schema modification due to the lack of data standardization.
* The speed of batch data processing is inconsistent.
Business Requirements
Business Goals
ADatum identifies the following business goals:
* Migrate the applications to Azure whenever possible.
* Minimize the development effort required to perform data movement.
* Provide continuous integration and deployment for development, test, and production environments.
* Provide faster access to the applications and the data and provide more consistent application performance.
* Minimize the number of services required to perform data processing, development, scheduling, monitoring, and the operationalizing of pipelines.
Health Review Requirements
ADatum identifies the following requirements for the Health Review application:
* Ensure that sensitive health data is encrypted at rest and in transit.
* Tag all the sensitive health data in Health Review. The data will be used for auditing.
Health Interface Requirements
ADatum identifies the following requirements for the Health Interface application:
* Upgrade to a data storage solution that will provide flexible schemas and increased throughput for writing data. Data must be regionally located close to each hospital, and reads must display be the most recent committed version of an item.
* Reduce the amount of time it takes to add data from new hospitals to Health Interface.
* Support a more scalable batch processing solution in Azure.
* Reduce the amount of development effort to rewrite existing SQL queries.
Health Insights Requirements
ADatum identifies the following requirements for the Health Insights application:
* The analysis of events must be performed over time by using an organizational date dimension table.
* The data from Health Interface and Health Review must be available in Health Insights within 15 minutes of being committed.
* The new Health Insights application must be built on a massively parallel processing (MPP) architecture that will support the high performance of joins on large fact tables.
NEW QUESTION 42
You need to design the image processing and storage solutions.
What should you recommend? To answer, select the appropriate configuration in the answer area.
NOTE: Each correct selection is worth one point.
Answer:
Explanation:
Explanation
References:
https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/batch-processing
https://docs.microsoft.com/en-us/azure/sql-database/sql-database-service-tier-hyperscale
NEW QUESTION 43
......
Authentic Best resources for DP-201 Online Practice Exam: https://www.examsreviews.com/DP-201-pass4sure-exam-review.html