TL;DR: In this article, the authors present an approach to derive OLAP dimensions from the normalized relational table and the results of the OLAP measures derivation by an automated method according to the present invention.
Abstract: A Relational Database Management System (RDBMS) having any arbitrary structure is translated into a multi-dimensional data model suitable for performing OLAP operations upon. If a relational table defining the relational model includes any tables with cardinality of 1,1 or 0,1, the tables are merged into a single table. If the relational table is not normalized, then normalization is performed and a relationship between the original table and the normalized table is created. If the relational table is normalized, but not by dependence between columns, such as in the dimension table in a snowflake schema, the normalization process is performed using the foreign key in order to generate the normalized table. Once the normalized table is generated, OLAP measures are derived from the normalized relational table by an automated method. In addition, OLAP dimensions are derived from the normalized relational table and the results of the OLAP measures derivation by an automated method according to the present invention. According to an aspect, it is possible to associate a member of a dimension to another member of the same or another dimension. According to another aspect, it is possible to create a new dimension of analysis, the members of which are all the different values that a scalar expression can take on. According to yet another aspect, it is possible to access the various instances of a Reporting Object as members in an OLAP dimension. According to the yet another aspect, it is possible to apply opaque filters or a combination of them to the data that underlies analysis.
TL;DR: Intended for IS professionals as well as strategic planners, this fascinating book can be well relied upon as the essential reference to the standards, tools, technologies, and possibilities of data warehousing today.
Abstract: From the Publisher:
Optimize your organization's data delivery system! Improving data delivery is a top priority in business computing today. This comprehensive,cutting-edge guide can helpby showing you how to effectively integrate data mining and other powerful data warehousing technologies. You'll learn how to: Use data warehousing to establish a competitive advantage; Solve business problems faster by exploiting online analytical processing (OLAP); Evaluate various data warehousing solutions (including SMP and MPP,parallel database management systems,metadata,OLAP,etc. ); Leverage your data warehousing utility via the Internet,client/server computing,and various data mining tools. In addition to providing a detailed overview and strategic analysis of the available data warehousing technologies,the book serves as a practical guide to data warehouse database design,star and snowflake schema approaches,multidimensional and mutirelational models,advanced indexing techniques,and data mining. You'll also learn how to compare different data mine technologies and products,and understand how they fit into your overall business and data processes. Intended for IS professionals as well as strategic planners,this fascinating book can be well relied upon as the essential reference to the standards,tools,technologiesand possibilitiesof data warehousing today
TL;DR: This work formalises the concept of a snowflake schema in terms of an acyclic database schema whose join tree satisfies certain structural properties and defines a normal form for snowflake schemas which captures its intuitive meaning with respect to a set of functional and inclusion dependencies.
TL;DR: In this article, a process for duplicate detection is implemented based on interpreting records from multiple dimensional tables in a data warehouse, which are associated with hierarchies specified through key-foreign key relationships in a snowflake schema.
Abstract: The invention concerns a detection of duplicate tuples in a database. Previous domain independent detection of duplicated tuples relied on standard similarity functions (e.g., edit distance, cosine metric) between multi-attribute tuples. However, such prior art approaches result in large numbers of false positives if they are used to identify domain-specific abbreviations and conventions. In accordance with the invention a process for duplicate detection is implemented based on interpreting records from multiple dimensional tables in a data warehouse, which are associated with hierarchies specified through key—foreign key relationships in a snowflake schema. The invention exploits the extra knowledge available from the table hierarchy to develop a high quality, scalable duplicate detection process.
TL;DR: A literature study about data analysis by using ERD and conceptual model of data warehouse and also a case study about mini market information system to support that explanation are suggested.
Abstract: Data is an important part in enterprise information system. So data can be used effectively, we have to analyse the data. There are many ways to analysing and modelling data, some of them are by using Entity Relationship Diagram (ERD) and conceptual model of data warehouse such as star schema, snowflake schema, dan fact constellations schema. This paper suggest a literature study about data analysis by using ERD and conceptual model of data warehouse and also a case study about mini market information system to support that explanation. Design of ERD, star schema, snowflake schema, dan fact constellations schema in that information system meant to manage point of sale, purchasing, and stock control. Design of ERD can be used to modelling transactional data. While data warehouse more used to support manager to make a decision in an enterprise.
Keywords: Entity Relationship Design, Star schema, Snowflakes schema, Fact constellation schema.