TL;DR: This paper studies the generation of database instances that respect the semantics of SQL statements embedded in a database application program and describes a supporting tool which generates a set of constraints.
Abstract: Testing is a critical activity for database application programs as faults if undetected could lead to unrecoverable data loss. Database application programs typically contain statements written in an imperative programming language with embedded data manipulation commands, such as SQL. However relatively little study has been made in the testing of database application programs. In particular, few testing techniques explicitly consider the inclusion of database instances in the selection of test cases and the generation of test data input. In this paper, we study the generation of database instances that respect the semantics of SQL statements embedded in a database application program. The paper also describes a supporting tool which generates a set of constraints. These constraints collectively represent a property against which the program is tested. Database instances for program testing can be derived by solving the set of constraints using existing constraint solvers.
TL;DR: In this paper, a method of distributing research data from a common database to a user of the common database is provided, where data concerning research results and data upon which the research results are based are stored in a local database and are linked to each other.
Abstract: A method of distributing research data from a common database to a user of the common database is provided. Data concerning research results and data upon which the research results are based are stored in a local database and are linked to each other. Data concerning research results and data upon which the research results are based are selectively extracted from the local database to the common database. Research data are then selected by a user of the common database from the extracted data concerning research results and from the data upon which the extracted data are based and the selected research data are distributed to the user.
TL;DR: In this paper, a keyword search is performed on the lookup tables rather than the database tables to determine database column locations of the keyword, and the retrieved rows are ranked in order of relevance before being output.
Abstract: Searching by keywords and providing generalized matching capabilities on a relational database is enabled by performing preprocessing operations to construct inverted list lookup tables based on data record components at an interim level of granularity, such as column location Prefix information is in the inverted list stored for each keyword, keyword sub-string, or stemmed version of the keyword A keyword search is performed on the lookup tables rather than the database tables to determine database column locations of the keyword The lookup tables is scanned to identify each prefix associated with the search term Schema information about the database is used to link the column locations to form database subgraphs that span the keywords Join tables are to generated based on the subgraphs consisting of columns containing the keywords A query on the database is generated to join the tables and retrieve database rows that contain the keyword and the prefixes associated with the keyword The retrieved rows are ranked in order of relevance before being output By preprocessing a relational database to form lookup tables, and initially searching the lookup tables to obtain a targeted subset of the database upon which SQL queries can be performed to collect data records, keyword searching on relational database is made efficient
TL;DR: In this article, a method and system for maintaining a database of configurations of mechanical equipment is described, in which a functional configuration database is established to store functional information about an end item and internal components of the end item.
Abstract: A method and system is disclosed for maintaining a database of configurations of mechanical equipment. A functional configuration database is established to store functional information about an end item and internal components of the end item. A logical configuration database is established that corresponds to the functional configuration database. A physical configuration database is established to store physical information about the end item. An operational configuration database is established to store operational information about the end item. The database of configurations of mechanical equipment is maintained in accordance with the functional configuration database, the logical configuration database, the physical configuration database and the operational configuration database.
TL;DR: In this paper, a computer-implemented export engine automatically extracts selected data directly from an object model via an object query language, automatically builds relational database tables for the extracted data and automatically inserts the extracted extracted data into the tables.
Abstract: A computer-implemented export engine automatically extracts selected data directly from an object model via an object query language, automatically builds relational database tables for the extracted data and automatically inserts the extracted data into the tables. The tables with the inserted data are then loaded into a relational database.
TL;DR: In this paper, a dynamic database management system includes a data dictionary, a data importer and a query front-end, where the importer automatically imports data from an input file into a database, while adding new tables for new attributes as necessary, and updating parameters and folders tables in the data dictionary accordingly.
Abstract: A dynamic database management system includes a data dictionary, a data importer and a query front-end. The data importer automatically imports data from an input file into a database, while adding new tables for new attributes as necessary, and updating parameters and folders tables in the data dictionary accordingly, so that end-users may access the imported data by database queries through the query front-end.
TL;DR: In this article, a method and system for deleting data in a database is presented, which is configurable, extensible and adaptable to the changing structure of a database to provide a desired deletion.
Abstract: The present invention is directed to a method and system for deleting data in a database The method and system is configurable, extensible and adaptable to the changing structure of a database to provide a desired deletion. The method and system includes storing at least one criteria for specifying data to be deleted, accessing the at least one criteria, and identifying tables in a family of tables referencing data associated with the one criteria, wherein each table is associated with at least one delete rule. The method and system further includes performing a delete function on the data associated with the one criteria in the tables according to the at least one delete rule associated with each table.
TL;DR: In this article, a system and method for automated database management is presented, wherein statistics relating to operation of a database may be collected, wherein the database comprises one or more database objects.
Abstract: A system and method for automated database management are provided. Statistics relating to operation of a database may be collected, wherein the database comprises one or more database objects. Characteristics of the database objects may be determined, either automatically or by user intervention, using the collected statistics, one or more policies, and/or one or more definitions. The policies and definitions may be defaults or may be customized by a user. Actions to be performed on the database objects may be determined, either automatically or by user intervention, based on the characteristics of the database objects. A schedule for performing the actions on the database objects may be automatically determined. The actions may be performed on the database objects based on the schedule. The actions on the database objects may be confirmed. The results of the actions on the database objects may be analyzed or monitored. Policies and/or definitions may be reconfigured based on the results. The system and method may automatically decide to perform one or more REORG actions on the database objects based on the statistics. A recommendation of a modification to one or more of the database objects may be generated based on the statistics. The scheduled actions may be modified as a result of detecting a change in the characteristics or usage of at least one of the database objects.
TL;DR: Aliased installation databases as discussed by the authors are easily manageable and provide greater flexibility to software developers in the creation and shipping of a software product, and provide a significant improvement in database performance by allowing the use of uniform data fields to add temporary rows and columns directly to a database table.
Abstract: Methods for Optimizing the Installation of a Software Product onto a Target Computer System Optimizations for the process of installing a software product onto a target computer system. A relational installation database for storing data elements in the form of strings, objects, etc. is aliased with integer identifiers corresponding to each data element. The integer identifiers are obtained from an index that sequentially stores a copy of each unique occurrences of a data element. Populating an installation database with only integers reduces persistent size and provides uniformity to the data fields underlying the database tables, and provides a significant improvement in database performance The uniform data fields may be expanded and contracted to add temporary rows and columns directly to a database table. In this way, temporary data elements may be stored directly in a database table without the need for creating a view of the database table. Temporary data elements are lost when the database table is no longer referenced, and temporary data is ignored when persisting the database. Aliased installation databases are easily manageable and provide greater flexibility to software developers in the creation and shipping of a software product. Database tables may be created in a modular fashion and may be efficiently merged together when the software product is complete. Also, differences between various versions of the software product may be recorded in database transforms. Database transforms may be shipped to the end-user and applied to a prior installation database, so as to provided enhancements, patches, upgrades, custom installations, etc. without the need to ship an entire modified installation database.
TL;DR: In this paper, a centralized synchronization system synchronizes data stored in multiple devices distributed over a relatively unreliable network, to provide automatic recovery from synchronization failures, maintaining a truth database indicating the true state of data and action tables indicating actions to be performed on each of the devices during a next update of the data.
Abstract: A centralized synchronization system synchronizes data stored in multiple devices distributed over a relatively unreliable network, to provide automatic recovery from synchronization failures. The synchronization system maintains a truth database indicating the true state of the data and action tables indicating actions to be performed on each of the devices during a next update of the data. Data is retrieved from the devices over the network, and actual states of the data are determined from the action tables and the retrieved data. The truth database and action tables are updated accordingly. An effective action table is then generated to account for any of the devices which were offline during the most-recent synchronization. The truth database and effective action tables are saved to a central database in an atomic transaction. The effective action tables are then used to update the data on the devices over the network.
TL;DR: In this article, an approach for optimizing a database management system is described, where a summary table that is used to generate a plurality of tables is stored in a solid state disk.
Abstract: An approach for optimizing a database management system is disclosed. A summary table that is used to generate a plurality of tables is stored in a solid state disk. The plurality of tables are stored in the memory of a database server. In addition, a transaction log associated with the plurality of tables are stored on the solid state disk. According to one embodiment of the present invention, this approach is applied to a connection management system in a satellite communication system, wherein the connection management utilizes a relational database to track the communication connections.
TL;DR: A data representation based on graph theory which captures the highly interconnected structure of genome data is developed which serves as the foundation of a graph database management system.
Abstract: Genome databases have specific requirements which limit the usefulness of some database management systems. By using more appropriate database technology, a database system can be developed for genome data. We have developed a data representation based on graph theory which captures the highly interconnected structure of genome data. Graphs are a language which can be tailored for describing genomic information, and we develop a data model based on graphs which serves as the foundation of a graph database management system. IEEE Engineering in Medicine and Biology special issue on Managing Data for the Human Genome Project.
TL;DR: A methodology for the design of an efficient storage structure of OODB that minimizes the database operating costs and uses a genetic algorithm to solve the intractable problem of inheritance of instance variables.
Abstract: Object-oriented databases (OODBs) are known to be rich in functionality but poor in performance. One of the important factors that affect performance is the physical database design. We developed a methodology for the design of an efficient storage structure of OODB that minimizes the database operating costs. The input for our method is the logical OODB schema and set of user transactions of retrieval and update types. The output of our method is the determination of which instance variables should be inherited from direct and indirect superclasses and stored in which subclasses. We used a genetic algorithm (GA) to solve this intractable problem. The methodology was applied on a university database. Compared to previous storage models, the storage model produced with our methodology showed database performance improvement ranging from 26% to 31%, on the average. Our results demonstrate a cost-effective storage structure design that boosts the operating performance of OODBs.
TL;DR: In this paper, the authors proposed a method to reduce the generation load of a new road database by effectively utilizing information included in an existing database, such as one way traffic, a traffic strip, and a sidewalk in the city plan database 20 and road polygons in the road database 30.
Abstract: PROBLEM TO BE SOLVED: To reduce generation load of a new database by effectively utilizing information included in an existing database. SOLUTION: A city plan database 20 as an existing database and a road database 30 as a new database are first input. When a symbol such as one way traffic, a traffic strip, and a sidewalk in the city plan database 20 is included in road polygons in the road database 30, an attribute corresponding to the symbol is set in the road database 30. Thus, the generation load of the newly generated road database 30 can be reduced by effectively utilizing attribute information set up in the existing database. COPYRIGHT: (C)2008,JPO&INPIT
TL;DR: In this paper, a system and method for automated database management are provided, wherein statistics relating to operation of a database may be collected, wherein the database comprises one or more database objects Characteristics of the database objects may be determined, either automatically or by user intervention, using the collected statistics, using one or multiple policies, and/or definitions The policies and definitions may be defaults or may be customized by a user.
Abstract: A system and method for automated database management are provided Statistics relating to operation of a database may be collected, wherein the database comprises one or more database objects Characteristics of the database objects may be determined, either automatically or by user intervention, using the collected statistics, one or more policies, and/or one or more definitions The policies and definitions may be defaults or may be customized by a user Actions to be performed on the database objects may be determined, either automatically or by user intervention, based on the characteristics of the database objects A schedule for performing the actions on the database objects may be automatically determined The actions may be performed on the database objects based on the schedule The actions on the database objects may be confirmed The results of the actions on the database objects may be analyzed or monitored Policies and/or definitions may be reconfigured based on the results The system and method may automatically decide to perform one or more REORG actions on the database objects based on the statistics A recommendation of a modification to one or more of the database objects may be generated based on the statistics The scheduled actions may be modified as a result of detecting a change in the characteristics or usage of at least one of the database objects
TL;DR: The database extensibility features initially developed to support text, spatial and similar forms of complex data are now being used to build e-commerce applications and are evolving into an important mechanism to enable the development of e- commerce systems.
Abstract: Over the last decade, database system products have been extended to provide support for defining, storing, updating, indexing and retrieving complex data with full transaction semantics. Oracle, IBM, Informix and others have used extensibility technology to build database system extensions for text, image, spatial, audio/video, chemical, genetic and other types of complex data. Currently, we find database systems being deployed in support of e-commerce. In many cases, these e-commerce database applications use only simple SQL data types to represent items such as office supplies, computers, books and CDs. There is also a large and important set of e-commerce applications that employ complex data formats such as EDI, SWIFT and HL7. The database extensibility features initially developed to support text, spatial and similar forms of complex data are now being used to build e-commerce applications. Thus, database extensibility technology is evolving into an important mechanism to enable the development of e-commerce systems.
TL;DR: The role of data management in multimedia digital libraries, and its implications for the design of database management systems are investigated, and the notions of content abstraction and content independence are introduced.
Abstract: A database management system is a general-purpose software system that facilitates the processes of defining, constructing, and manipulating databases for various applications. This article investigates the role of data management in multimedia digital libraries, and its implications for the design of database management systems. The notions of content abstraction and content independence are introduced, which clearly expose the unique challenges (for database architecture) of applications involving multimedia search. A blueprint of a new class of database technology is proposed, which supports the basic functionality for the management of both content and structure of multimedia objects.
TL;DR: A database was used for data management and interprogram communication in an image processing and three-dimensional reconstruction program suite for biological bundles, using FORTRAN interface and function libraries to communicate with image processing software.
TL;DR: A data model for managing vector data, image data, DEM data and attribute data, and its implementation in GeoStar is introduced.
Abstract: Spatial database management system is the most important module in GIS software This paper discusses the concepts and development in spatial database management system Then it introduces a data model for managing vector data, image data, DEM data and attribute data, and its implementation in GeoStar
TL;DR: The New England Water-Use Data System (NEWUDS) is a database for the storage and retrieval of water-use data that handles single-user and aggregate-user water- use data, can be used for large or small water-network projects, and is available as a stand-alone Microsoft® Access database structure.
Abstract: The New England Water-Use Data System (NEWUDS) is a database for the storage and retrieval of water-use data. NEWUDS can handle data covering many facets of water use, including (1) tracking various types of water-use activities (withdrawals, returns, transfers, distributions, consumptive-use, wastewater collection, and treatment); (2) the description, classification and location of places and organizations involved in water-use activities; (3) details about measured or estimated volumes of water associated with water-use activities; and (4) information about data sources and water resources associated with water use. In NEWUDS, each water transaction occurs unidirectionally between two site objects, and the sites and conveyances form a water network. The core entities in the NEWUDS model are site, conveyance, transaction/rate, location, and owner. Other important entities include water resources (used for withdrawals and returns), data sources, and aliases. Multiple water-exchange estimates can be stored for individual transactions based on different methods or data sources. Storage of user-defined details is accommodated for several of the main entities. Numerous tables containing classification terms facilitate detailed descriptions of data items and can be used for routine or custom data summarization. NEWUDS handles single-user and aggregate-user water-use data, can be used for large or small water-network projects, and is available as a stand-alone Microsoft® Access database structure. Users can customize and extend the database, link it to other databases, or implement the design in other relational database applications.
TL;DR: In this article, a database management method is proposed to solve the problem that a database operation stop time or a batch operation time cannot be secured on account of 24-hour global use of a database system, e.g. typically under web environment.
Abstract: PROBLEM TO BE SOLVED: To solve the problem that a database operation stop time or a batch operation time cannot be secured on account of 24-hour global use of a database system, e.g. typically under Web environment. SOLUTION: This database management method is as follows. Inputted data are stored in a first database. When set conditions are satisfied, data not registered in a second database is obtained from the first database on the basis of management information managing data registered in the second database. The obtained data is registered into the second database, and the management information of the obtained data is updated.
TL;DR: The deep-foundation load-test database is a result of research quality data collection over the past 15 years and consists of soils data along with the deep-foundations load- test data gathered from prototype tests conducted all over the world.
Abstract: The deep-foundation load-test database is a result of research quality data collection over the past 15 years. The database consists of soils data along with the deep-foundations load-test data gathered from prototype tests conducted all over the world. The soils data include general site information, stratigraphy, laboratory, and in situ test details. The foundations data consist of general foundations information including foundation construction, and load-settlement information to failure. Over 1,000 foundations are currently in the database, and more are being added. The database is checked for its validity and correctness both before and after the data are added to the database to make sure that the data integrity is preserved. Also, the data are periodically backed up and the data input is strictly administered by providing controlled access to the designated individuals. The database is structured to follow the rules of relational database management systems (RDBMS). The database resides in a Unix-based Sun Solaris server, and the database engine is Sybase RDBMS. The database front-end query application is under development for the Internet using Java as the programming language and will run under any Internet-capable browser (e.g., Netscape, Microsoft Internet Explorer) environment. The application will use Java applets to communicate with the database server. The user community includes state highway engineers, geotechnical researchers, students, and practicing engineers. Interested users can access the database using the interface to view, download, and chart the data at run-time.
TL;DR: In this article, a system and method for delivering batches of data to a client, while maintaining the client's state in the accessed database is described, without requiring the use of a web server's resources or the creation of a partitioned database.
Abstract: A system and method are provided for delivering batches of data to a client, while maintaining the client's state in the accessed database. The system and method eliminate the need for a web server to maintain the location of any given client within a database. Accordingly, the web server operates with greater efficiency, because it is freed from having to maintain the state of any given client within a database and need not create or maintain a partitioned database. Maximum and minimum values of any data previously requested and received by the client are maintained. These maximum and minimum values are later used within a novel set of Structured Query Language (SQL) statements to indicate the user's state within the database. By maintaining minimum and maximum reference values, the user's state within the accessed database can be maintained without storing any information on the web server. The next and previous batches are successfully obtained by comparing these stored reference values to the database so that the user can quickly move between adjacent batches of records. The invention provides a simple and elegant means for maintaining a user's state within a database, without requiring the use of a web server's resources or the creation of a partitioned database.
TL;DR: Performing a statistical review on the database prior to database lock can detect anomalies that would otherwise only show up during analysis.
Abstract: The statistical results from a clinical trial rely on the accuracy and completeness of the database used to generate the results. The quality of the database depends on the quality of the data collection methods and the data management procedures. As part of the data management-biostatistics team, a statistician can ensure that a protocol is written in such a manner as to enhance data management; a statistician's input into the subsequent design of the case report form can facilitate the collection of appropriate and complete data. A statistician can contribute to the accuracy and validity of a database by providing input into the design of both the database structure and data plausibility checks, and the quality control procedures performed to evaluate the integrity of a database. Finally, performing a statistical review on the database prior to database lock can detect anomalies that would otherwise only show up during analysis.
TL;DR: This chapter discusses the major components of neurocore database and their relation to the design criteria and the database federation interface, and database extension interface.
Abstract: Publisher Summary This chapter discusses the major components of neurocore database and their relation to the design criteria. A neurocore database consists of four main components—namely, core database, database extension interface, database federation interface, and new data types for the database created as data blades. The core framework cannot store a complete neuroscientific experiment because it lacks the extensions that allow to the full description of the experimental preparation and the types of research data that are being collected. The core schema of neurocore consists of five basic components: core experimental framework, neuroanatomical concepts, neurochemical concepts, database federation interface, and database extension interface. The core experimental framework in neurocore contains the database tables that store the information needed for storing an experiment. The neuroanatomical section of the neurocore database contains the basic neuroanatomical information needed to provide the framework for storing specific neuroanatomical data, as well as for providing the proper anatomical framework to store any neuroscience experiment within the neurocore. The federation interface enables each neurocore database to share information with other databases.
TL;DR: This report documents the design and implementation of the NDAMS relational database, which was designed to provide a catalog of available information and the results of an assessment of the available data.
Abstract: A National highway and urban runoff water-quality metadatabase was developed by the U.S. Geological Survey in cooperation with the Federal Highway Administration as part of the National Highway Runoff Water-Quality Data and Methodology Synthesis (NDAMS). The database was designed to catalog available literature and to document results of the synthesis in a format that would facilitate current and future research on highway and urban runoff. This report documents the design and implementation of the NDAMS relational database, which was designed to provide a catalog of available information and the results of an assessment of the available data. All the citations and the metadata collected during the review process are presented in a stratified metadatabase that contains citations for relevant publications, abstracts (or previa), and report-review metadata for a sample of selected reports that document results of runoff quality investigations. The database is referred to as a metadatabase because it contains information about available data sets rather than a record of the original data. The database contains the metadata needed to evaluate and characterize how valid, current, complete, comparable, and technically defensible published and available information may be when evaluated for application to the different data-quality objectives as defined by decision makers. This database is a relational database, in that all information is ultimately linked to a given citation in the catalog of available reports. The main database file contains 86 tables consisting of 29 data tables, 11 association tables, and 46 domain tables. The data tables all link to a particular citation, and each data table is focused on one aspect of the information collected in the literature search and the evaluation of available information. This database is implemented in the Microsoft (MS) Access database software because it is widely used within and outside of government and so, is familiar to many existing and potential customers. The stratified metadatabase design for the NDAMS program is implemented in the MS Access file DBDESIGN.mdb and documented using the NDAMS_DD.mdb file recorded on the CD-ROM. The data dictionary file includes complete documentation of the table names, table descriptions, and information about each of the 419 fields in the database.
TL;DR: Aiming at the data characteristics of the reservoir operation system, the system's database is designed and developed by utilizing comprehensively object_oriented method and fast prototype approach and it is shown that the software system is reliable, stable and safety.
Abstract: Aiming at the data characteristics of the reservoir operation system, the system's database is designed and developed by utilizing comprehensively object_oriented method and fast prototype approach. The functions of reservoir operation database system are analyzed by using the multi_visual angle techniques. The system's conceptual model, logical model and function model is obtained based on the object_oriented method. Some key technical issues, such as computer server program, client interface and Web database program are exploited and discussed. The developed database software has been applied to more than 20 reservoirs or hydropower plants in China and it is shown that the software system is reliable, stable and safety.
TL;DR: The functions of the database system include management, inquire and systemize the design of the composite to develop a database system for the composite designing.
Abstract: This paper develops the database system of the composite designing .A large amount of data have been collected, arranged and stored in the database. The functions of the database system include management, inquire and systemize the design of the composite.
TL;DR: This document describes key generation, database layout, and search of the database of the Geometric Search Engine.
Abstract: The Geometric Search Engine is a software system for storing and searching a database of geometric models. The database maybe searched for modeled objects similar in shape to a target model supplied by the user. The database models are generally from CAD models while the target model may be either a CAD model or a model generated from range data collected from a physical object. This document describes key generation, database layout, and search of the database.