Blogs

Optimize business analytics with data lineage

Offering Manager, IBM InfoSphere Information Governance Catalog, IBM

Modern enterprises go to great lengths to ensure both the quality and the trustworthiness of their information, evaluating not only the sources of their data but also how that data has been transformed or aggregated. Indeed, businesses often assign quality scores to the information they take in, knowing that enterprise users base their decisions on the insights they glean from data. Moreover, such decisions—whether descriptive, predictive or prescriptive—provide new ways of understanding data, and in doing so, they continually reshape the relationship between users, data and information systems.

Understand your information

Every data governance solution worth the name allows its users to audit and visualize the movement of information, a capability made possible by data lineage. Moreover, data governance infuses information with added business value, whether in the form of enterprise definitions, of business rules or of stewardship. Thus data lineage—and, by extension, data governance—provides a solid foundation for business analytics, not least because an up-to-date understanding of information helps regulate information while preserving its vitality and force.

A comprehensive data governance initiative preserves data integrity even while defining that same data. But achieving data governance requires the ability to track consumption of information, then convey relevant findings through a data lineage report. Such information can be specific to a given project, but it can also span multiple projects and domains. A data lineage report helps consumers identify the sources from which their data has been aggregated or transformed and in so doing helps develop confidence in the data brought to bear in business intelligence reports and by analytical services.

Take advantage of your data

http://www.ibmbigdatahub.com/sites/default/files/datalineage_embed.jpgData lineage helps empower change management and efficient integration across a range of systems. Accordingly, it can help guide proposed changes, including by driving collaboration initiatives. In particular, data lineage can represent either of the following:

  • Design information reflecting the design and development of extract, transform and load (ETL) jobs, process or scripts
  • Operational information reflecting the runtime results achieved by executing such jobs, processes or scripts

Moreover, data lineage can also make possible the following governance capabilities:

  • Definition of meaning through assignment to glossary terms
  • Specification of requirements by assignment to governance rules
  • Identification of data owner and information subject area, among other facets of information

Seamless data lineage delivery and thorough implementation of automated processes can help provide accurate information about the data lineage of essential enterprise information assets, allowing customers to derive immediate benefits from their information while remaining compliant with an ever-expanding array of government regulations and internal corporate data governance initiatives. Watch this webcast to explore how the IBM Information Governance Catalog supports data lineage capabilities, then find more about the data governance capabilities offered by IBM solutions by visiting the IBM data governance web page.