Data Engineering Assignment
===================================
Dear Candidate,
Thanks for accepting our invitation on doing this assignment. Purpose is to provide us an assessment of your
current skills, which is just another variable to be consider in the overall interview process.
Context
The client “World Wide Importers” realized that using the procurement data directly from the production
database for the different services and requirements carries risk they are not willing to take, your task
is to provide an alternative solution.
Objective
You are a consultant whose task is to make the purchasing information more accessible for the clients reporting
needs. The client expects you to move and transfer the data to appropri-ate format and storage that can
then be consumed by their BI platform. From the data the client should be able to produce two reports.
How much they have paid for their suppliers by supplier by year and how many different items have they
purchased from their top 50 suppli-ers. You should do this implementation in Azure.
Expected results.
1. Provide us the code and instructions on how to deploy it to Azure as well as the notes from your
implementation if any
2. In addition we would like you to describe (in written text, architecture diagram etc.):
a) What would you change or add if you had to implement the same solution for five different clients?
b) How about 100 different clients?
Walkthrough tips
1. The source data is in WorldWideImporters database catalog
2. All the needed data is located in following tables:
Purchasing.SupplierTransactions, Pur-chasing.Suppliers, Purchasing.PurchaseOrders and Purchasing.PurchaseOrderLines
3. You can take shortcuts in the implementation (ie. not store secrets in KeyVault to ease the deployment)
but please note this down and provide with your answer
Please deliver the assignment as a link to a zip archive in some file-sharing service
(e.g. Google Drive, OneDrive, iCloud etc.) back to the person you received the invitation
for the assignment via email. Please don't use other compression formats than zip and don't
encrypt the archive with a password.
Please do not post the code in a public repository on GitHub, GitLab or such.
We will get back to you the soonest we have reviewed the assignment, max. time of two weeks.
Good Luck and hope you have fun doing it!
Sievo Crew