


According to our co-creation way, Baufest recommended a series of steps to allow for YCH BI team to make informed decisions for their journey into the new Enterprise Data Warehouse solution.

 A Technology Decision Matrix so that YCH can compare the main features of Azure offers for EDW and decide with objective parameters the most suitable choice for their needs.
  A Technology Decision Matrix so that YCH can compare the main features of Azure offers for EDW and decide with objective parameters the most suitable choice for their needs. An architecture diagram, resulting from the technology chosen,
  An architecture diagram, resulting from the technology chosen, A suggested, phased Roadmap with clear deliverables to implement and evolve the solution,
  A suggested, phased Roadmap with clear deliverables to implement and evolve the solution, And the description of typical EDW project Roles adapted and suggested for YCH
  And the description of typical EDW project Roles adapted and suggested for YCHYCH BI team had developed several reports in Power BI, supporting 150 users in the last 2 years. Realizing that the existing platform could not support requirements such as intraday updates and historical information for tendency analysis and prediction capabilities, YCH IT & BI teams envisioned an Enterprise Data Warehouse solution.
With historical production and sales data readily available, the data science team could build a model to predict the probability of over/under inventory levels, proactively freeing up inventory for additional sales.

 Data Warehouse
  Data Warehouse
environment is proposed using schemas inside a single Azure SQL Database. Schemas include:
 Dimensional Layer: to unify and encapsulate business logic in a single layer optimized for intuitive data access to allow for Self Service BI in the reporting tool.
  Dimensional Layer: to unify and encapsulate business logic in a single layer optimized for intuitive data access to allow for Self Service BI in the reporting tool.
 Atomic Layer: consolidated 3rd normal form relational model, including business rules.
  Atomic Layer: consolidated 3rd normal form relational model, including business rules. Staging area: cleansed data with technical validation rules and record counts.
  Staging area: cleansed data with technical validation rules and record counts.
 Raw data: Depending on origin source type. Json files can be stored in Cloud Storage to keep original format.
  Raw data: Depending on origin source type. Json files can be stored in Cloud Storage to keep original format.
 Data Science Sandbox can be extended using an additional Schema on Azure SQL Database, extracting information from Atomic Layer or Staging Area depending on the need, or adding additional data types (such as json), including external ad-hoc sources.
  Data Science Sandbox can be extended using an additional Schema on Azure SQL Database, extracting information from Atomic Layer or Staging Area depending on the need, or adding additional data types (such as json), including external ad-hoc sources. Data integration layer
  Data integration layer
 SSIS is the most suitable choice in terms of current volume needs.
  SSIS is the most suitable choice in terms of current volume needs.
 Azure Integration Runtime to orchestrate executions
  Azure Integration Runtime to orchestrate executions Visualization Layer
  Visualization Layer
 Power BI as visualization tool
  Power BI as visualization tool
 Connect with Dimensional Layer via DirectQuery. This allows to build visualizations over very large datasets, where it would otherwise be
unfeasible to first import all the data with pre-aggregation.
DirectQuery reports always use current data.
  Connect with Dimensional Layer via DirectQuery. This allows to build visualizations over very large datasets, where it would otherwise be
unfeasible to first import all the data with pre-aggregation.
DirectQuery reports always use current data.YCH now has access to 6 distinct business units’ data in a scalable, measurable, and controllable environment.
With the creation of the EDW at YCH we were able to create a single source for information eliminating the need to query multiple sources. This also enabled the business to focus on analysis rather than operational tasks to gather and cleanse data, ensuring consistency, quality controls and timeliness.

YCH Data
Admin Manager