Modeling Unstructured Data

Be prepared to analyze and design complex data requirements!

 

As data requirements mature, as megabytes become cheaper, as CPU speed becomes faster, we as analysts and modelers will be faced with more complex requirements. Many of these requirements will depend on the efficient storage and retrieval of unstructured data. Merrill Lynch recently estimated that over 85% of all business information exists as unstructured data. The sheer quantity and complexity of unstructured data opens up many new opportunities for the analyst and modeler.

Imagine requirements such as:

  • Show me consumer feedback on my product from all website discussion groups for the last six months.
  • Show me all photographs taken of the fountains in Rome from the summers of 2002 through 2007.
  • Show me all contracts which contain a particular liability clause.

As with many areas of information technology however, there is ambiguity and confusion on the definitions and therefore the business and information technology implications of unstructured data.

This seminar will clearly explain unstructured data as well as related concepts such as ontologies and taxonomies in a context that will help you identify new business intelligence opportunities and design efficiencies for your organizations. Through a combination of lecture and exercises, we will reinforce unstructured data concepts and challenges.

   

View description in pdf format

Listen to Sybase PowerDesigner Webinar on Unstructured Data

Learn:

  • Distinguish between structured, semi-structured, and unstructured data
  • Separate Content from Requirements
  • Build relational and dimensional models which leverage text tagging, ontologies, and taxonomies  

    Outline:

    1. The necessity of ‘Wayfinding’ in today’s rapidly changing business intelligence world
    2. The need to expand our understanding of traditional modeling concepts such as meta data, domains, and class words
    3. The distinguishing factor between structured, semi-structured and unstructured data
    4. The recurring dream every CEO and CIO share
    5. The necessity to separate content from requirements
    6. The challenges that unstructured data will surface
    7. The expanded set of activities the analyst and modeler will perform in the next 2-5 years
    8. Let’s practice! We will reinforce our understanding of structured, semi-structured, and unstructured data by analyzing and modeling a number of challenging projects
    9. The difference between a taxonomy and an ontology