Data
Links
Data
Quality Tools for Data Warehousing - A Small Sample Survey
This paper, generated by the Center for Technology in Government, discusses
how data quality is one of the biggest issues people face when integrating
data. Data quality tools are used in data warehousing to ready the data
and ensure its cleanliness. This research focuses on how the data quality
tools address problems in data.
Data
Quality and Systems Theory
This paper, written by Ken Orr from the Ken Orr Institute, discusses how
data quality, meta data, and systems theory influence information systems
and the results they generate.
Top
of page
Total Data Quality
Management Research Program at M.I.T.
The overall objective of this program is to establish a solid theoretical
foundation for Data Quality Management to devise practical methods for
business and industry to improve data quality. It includes tools and other
capabilities necessary for data quality management in the technical, economic,
and organizational phases of business operations. Also listed on this
site are links to conferences and papers about data quality.
DATA
QUALITY "is an annual peer-reviewed journal founded in 1994 by
a group of statisticians, information scientists, and quality practitioners.
DATA QUALITY includes original research papers and book reviews about
data and information quality. They range in scope from general interest
to theoretical."
Top
of page
Tools
for Traveling Data
This article, written by Joseph Williams, discusses the idea that "one
of the most important but overlooked steps in building a data warehouse
is loading data in the warehouse database. This article discusses three
broad categories of tools for helping load data into a data warehouse-data
quality, extraction and transformation, and cleansing."
Introduction to Metadata: Setting the Stage
“Metadata, literally "data about data," is an increasingly ubiquitous term that is understood in different ways by the diverse professional communities that design, create, describe, preserve, and use information systems and resources.” This article provides a description of the different uses and categories of metadata and their various roles and functions.
Top
of page
Technology
Policy 97-6 Geographic Information Systems (GIS) Data Sharing
This document, set forth by the NYS Office for Technology, discusses how
electronic geographic data can be shared among federal, state, and local
agencies. It reviews provisions that have been made to ensure that GIs
data is shared in an appropriate and consistent manner. Custodianship,
standards, maintenance, pricing, requests, and submissions are outlined.
Technology
Policy 96-19 Data Sharing Among Agencies
This document, set forth by the NYS Office for Technology, describes steps
that NYS agencies should follow when considering sharing data electronically.
In addition, this policy helps ensure a standard process of data sharing
among NYS agencies. Each step describes a key component in data sharing
and includes questions that can help an agency prepare and complete that
step. Also included is a sample case study that describes one agencys
initiative through each step.
Top
of page
Technology
Policy 97-3 Statewide Data Dictionary
This policy, set forth by the NYS Office for Technology, announces
the establishment of a "Statewide Data Dictionary" that features a core
section of data elements that cross all State agencies (i.e., name, address,
etc.). It has a section dedicated to each functional area of government
activity (i.e., criminal justice, health, etc.) that covers the data elements
unique to that area.
Meta Data Standards and Registries: An Overview
This paper (in PDF format), generated by the US Environmental Protection
Agency and the US Bureau of Labor Statistics, discusses the work that
is being done to reach consensus on standardizing meta data and registries
for organizing that meta data. It provides information about meta data
and also goes in-depth on the impact a meta data registry can have on
a statistical agency.
Top
of page
Dr.
Toms Meta-Data Primer
This paper, written by Dr. Thomas Wason, provides a basic understanding
of meta data, how it is structured, what it means, and how it is represented.
It also describes when there is sufficient meta data, and how meta data
is scaleable and interoperable with other systems.
Creating
a Statewide Spatial Data Repository and Geographic Information System
Data Cooperative
The NYS GIS Clearninghouse shows data sharing was integral to the successful
creation of the NYS GIS Clearinghouse. This project focused on tools and
policies for data sharing. A prototype meta data repository accessible
over the Internet was produced. It included an inventory of spatial data
resources around the state, and a set of policy and management recommendations
for a permanent data cooperative.
Top
of page
An
Introduction to Metadata: Pathways to Digital Information
This site is a collection of metadata articles, a suite of standards,
and useful links for Metadata for the World Wide Web. This metadata resource
strives to "help those with a stake in the debate (everyone from librarians
to museum professionals to anyone who intends to make information available
via the Internet) to avoid mistakes and wasted effort, and to make informed
decisions about the information they seek to record and disseminate."
The Getty Research Institute, which sponsored the development of this
site, "promotes innovative scholarship in the arts and humanities, to
bridge traditional academic boundaries, and to provide a unique environment
for research, critical inquiry, and debate."
Research
and Practical Experiences in the Use of Multiple Data Sources for Enterprise
Level Planning and Decision Making: A Literature Review
This paper written by the Center for Technology in Government, discusses
how many agencies are faced with the challenge of integrating data sources
to provide a broader look at programs and service. To do this, information
sharing is essential. It allows agencies to improve planning and increase
productivity. The use of multiple data sources for enterprise level planning
and decision making is also increasing. This paper identifies current
research. It also outlines practical experiences in the use of multiple
data sources to support performance measurement, strategic planning, and
interorganizational business processes.
Dealing
with Data Seminar Summary Report
This report (in PDF format) from CTG's seminar "Dealing with Data," addresses
many data issues through summaries of presentations on data quality management,
data tools and techniques, long term maintenance and preservation, and other
data issues.
Top
of page
Putting Information Together: Building Integrated Data Repositories
Seminar Summary Report
Using CTG's Homeless Information Management System prototype project as
an example, this seminar summary report discusses the management, policy,
and technology issues organizations face when integrating data from multiple
sources, to create a new information resource for cross program, cross
organizational decision making and planning.
DM Review
DM Review is a monthly issues and solutions publication that focuses on
data warehousing and business intelligence.
Top
of page
DB2 Magazine
DB2 Magazine is an electronic publication dedicated to providing strategies
and solutions for database programmers, administrators, and users.
Data Warehousing: Introduction to data warehousing
A web-based resource produced by the Operational Research Society to help members of the OR community gain a good understanding of data warehousing. Efficient data warehousing is seen as crucial in enabling companies and organizations to utilize existing information by providing a a central data repository of stable, accurate, consistent and clearly understood data.
Top
of page
Turning
Data into Understanding: A Field Guide to Knowledge Support Technology
The guide (in PDF format) developed by the NYS Forum for Information Resource
Management, is designed to help government decision makers determine whether
technology tools can and should be used to better understand the problems
facing their organizations.
Keyword Search Suggestions
- "Data Integration"
- "Data Mining"
- "Data Standards"
- "Data Warehousing"
- "Data Cleansing"
- "Data Extraction"
- "Data Migrating"