Connect on Whatsapp : +1 206 673 2541, Get Homework Help 24x7, 100% Confidential. Connect Now

Data and Metadata Design Assignment | Leading Homework Help

Data technologies are constantly advancing, but most have been adopted piecemeal by institutions. As a result, enterprise information is vastly underutilized, whether associated with client interactions, company performance, or external happenings in the corporate atmosphere. Furthermore, corporate information ecosystems have become complicated and riddled with information silos. This makes the information more challenging to access, limiting institutions’ value from it. To realize the value hidden in their information, businesses must begin to treat information as a supply chain, allowing it to flow without difficulty and usefully throughout each firm’s ecosystem of associates, including suppliers and clienteles. This strategy is appropriate at this time. For starters, new external sources of information are becoming accessible, opening up new avenues for data insights. Furthermore, the tools and technology needed to create an improved data platform are accessible and in use. These lay the groundwork for businesses to build an incorporated, end-to-end data supply chain.

Selected Environment or Scenario

SCM has emerged as a critical enabler of gaining a competitive advantage in recent years. An efficient supply chain may be deemed as critical for an organization’s achievement or failure, and therefore it becomes an enhanced value driver for institutions. Enhanced client demand and diversity, intense competition, increased difficulty and dynamic nature of international set-ups, the strain on product and process innovations, and technological advances, especially in information and communication technology (ICT), have all added sophistication to the design and management of supply chain operations. To thrive in today’s volatile business environment, it is critical to employ information-driven techniques in which cooperation among members is a key success factor. Coordination and collaboration are beneficial to allow supply chain members to achieve their global goals. Information sharing, on the other hand, is critical in supply chain assimilation. It increases customer satisfaction and monetary performance by offering timely and accurate information and increasing supply chain prominence. It establishes and observes crucial performance pointers to identify alterations and shortfalls and alleviate the fundamental influence, which is primarily attributed to the alteration of data flow as it moves from downstream to upstream.

Questions and Problems That Will Be Addressed With the Data.

What has occurred, what is occurring, and why. Visualization tools and an online analytical processing (OLAP) system are used in this process, supported by reporting technology and real-time data to identify new opportunities and problems. They are used to gather, describe, and evaluate raw data from previous events. It evaluates and characterizes past events to make them interpretable and understandable. It can be used to show average cash, inventory supply, and annual sales shifts. It is also beneficial for budgetary, revenues, operational processes, and production findings. Second, it aims to uncover the causes of events and occurrences and precisely predict the future or fill in gaps in data or information that do not already occur. It would anticipate purchase behavior, customer behavior, and purchase patterns to identify and forecast future market trends. These data mechanisms are also used to forecast customers’ needs, inventory data, and business operations. Lastly, it aims to identify the key challenges in SCM, such as key member consideration, interconnected procedures, and the degree of incorporation. The adherents have been divided into two groups: primary, those liable for providing value to customers, and secondary, those who provide support through facilitating elementary functions by providing resources and knowledge.


Suppliers, manufacturers, distributors/retailers, and clienteles are all important players in the scenario. This group of stakeholders offer input information in structured and unstructured arrangements. While files obtained and obtained from conventional databases like relational database systems are organized, files gathered from external sensors, RFID tags, and other sources is unstructured. To encourage suppliers to share their data, a company could offer product discounts and provide a platform where the client can get a centralized view of all his supply chain processes. Furthermore, the insights about the customer provided by his data allow the company to recommend specific plans or programs.

Data Sources

This attempt will necessitate the use of some physical data sources. Client data, sales data, market and rival details, product and service level requirements, brand details, sales projections, stock, resource utilization, process monitoring and scheduling details, skill inventory, supplying information, logistics, pricing, and fund flow/working capital knowledge are examples of data sources in the context of SCM. Supplier data are fundamentally linked to the activities involved with the sourcing process. Production data is created as a result of the manufacturer’s conversion operations. Following production, the content is shipped to storage facilities, from which it is dispersed to end-users (Ittmann, 2015). Delivery data details the delivery evidence. Supply and marketing information includes client details associated with sales and product demand. To match a customer to the most appropriate policy model, certain criteria must be deduced from data available to enable client categorization.

Needs Analysis

Based on the previous assignment, the needs analysis was executed on an insurance company to determine the best data architecture to model all of the various data types for digital health insurance. This is premised on creating a program that protects IT resources and details through a proactive configuration of the insurance company. The system should generate a detailed customer profile, allowing the personal healthcare system to pass fitness level statistics and health recommendations to the user interface. A risk assessment should be performed based upon the customer profile, which defines the personal insurance rate. Lastly, data records are organized by source, in chronological order, according to a predetermined protocol structure, or by allocating all records to specific problems. A chronological storage structure is best suited for fitness data with a continuous inflow and may come from multiple sensor sources. Data can be kept in a distributed, centralized, or hybrid structure. As a result, software experts can model a complete analysis of the study, interrelations, and process, from data collection to customer request.

Proposed Data Architecture Design

The entities at the bottom-most layer represent the numerous input data sources within the supply chain. Suppliers, producers, merchants, and clients are instances of these entities. These entities offer input information in structured and unstructured organisations. While information produced and regained from conventional databases like the relational database systems are framed, data gathered from external sensors, RFID tags, and other sources is unorganized. The collection of these massive amounts of data leads to the production of metadata in the system. The information is then keyed into the metadata architecture as input. While structured information is regained by ETL methods and stored in a storage unit, unstructured information is controlled by the Hadoop cluster’s HDFS and MapReduce structures and retained in a database management system.

After the ETL procedure, an operational information store is also implemented for the structured information responses before they are occupied into the storage centres. The ODS is a databank known to integrate information from numerous sources and carry out extra processes on the information. Information could be pre-processed, polished, remedied for redundancy, and reviewed for reliability and conformance with commercial requirements while in the ODS. Structured input incorporated in the existing process could be stored in the ODS before being sent to the storage unit for long-term storage and archiving. A real-time intelligence system then gains access to the data within the system. RTI is a data analytics solution that enables users to obtain actual information by clearly gaining access to operational structures or keying in commercial transactions into a real-time information storage and business intelligence framework.

Information virtualization, enterprise data integration, enterprise application incorporation, and service-oriented design are among the techniques that allow real-time RTI. RTI facilitates quick decision-making by analyzing data flows in real-time with complex event processing tools and either activating automated deeds or notifying users to trends and patterns. The RTI production could be directly keyed into analytics software, allowing users to visualize data analysis outcomes immediately. Nevertheless, for non-real-time analytics, RTI result could be keyed into a dimensional information storage segment (Leveling Edelbrock & Otto, 2014). A DDS is a database that retains information in data storage segment or RTI module outcome in a layout other than OLTP in conventional database systems. The reason for transferring information from the source information warehouse to the DDS and then querying the DDS rather than instantly querying the information storage unit is that information within a DDS is organized in a dimensional layout that is more appropriate for analytics engines.

The data quality rules perform various data quality checks when the ETL structure distributes information into the DDS. For clarification, the source data distribution center feeds poor-quality details back into the data quality database. A control system manages and orchestrates the ETL system based on the data warehouse’s metadata pattern, rules, and logic. The metadata is a file that contains a summary of the information in the storage unit, including the data arrangement, information significance, data usage, data quality guidelines, and other data-related evidence (Ittmann, 2015). The RTI and DDS modules’ outcomes are keyed into the data mining segment, which is in charge of discovering patterns and correlations that are of interest to the analytics component. The analytics engine’s output is then appropriately presented to the users through rich graphic methods in the elements of reports, diagrams, and graphs. In some instances, the result of the DDS component could be promptly entered into the analytics component for trends and alerts that do not necessitate sophisticated data systems to determine.


In today’s competitive marketplace, the evolution of information technology, consumer expectations, global economic integration, and other modern competitive preferences have compelled enterprises to adopt. As a result, competitive edge between businesses is being replaced by rivalry between companies and the existing supply chains. In today’s competitive environment, supply chain professionals struggle to manage massive amounts of data to achieve an interconnected, efficient, useful, and agile supply chain. Real-time systems will have strict requirements for delays and consistency, whereas general applications might specify a certain tolerance level for these parameters. As a result of the explosive growth in volume and variety of data types throughout the supply chain, there is a need to invest in technology that can intelligently and rapidly evaluate massive quantities of data. The architecture should be open to new stakeholders, types of data, contents, interfaces, and programs to be future-oriented, versatile, and marketable.

Cite this Page

Data and Metadata Design Assignment | Leading Homework Help . (2022, June 24). Essay Writing . Retrieved March 25, 2023, from https://www.essay-writing.com/samples/data-and-metadata-design/
“ Data and Metadata Design Assignment | Leading Homework Help .” Essay Writing , 24 Jun. 2022, www.essay-writing.com/samples/data-and-metadata-design/
Data and Metadata Design Assignment | Leading Homework Help . [online]. Available at: <https://www.essay-writing.com/samples/data-and-metadata-design/> [Accessed 25 Mar. 2023].
Data and Metadata Design Assignment | Leading Homework Help [Internet]. Essay Writing . 2022 Jun 24 [cited 2023 Mar 25]. Available from: https://www.essay-writing.com/samples/data-and-metadata-design/
Get FREE Essay Price Quote
Pages (550 words)
Approximate price: -