?xml version="1.0" encoding="UTF-8"?> Hotsos Symposium Speakers and Abstracts

Hotsos Symposium Speaker – Marco Gralike

Marco Gralike is a returning Hotsos Symposium presenter.

Biography

Marco GralikeMarco Gralike, working for AMIS Services BV as a Principal Oracle Database Consultant in the Netherlands, has experience as a DBA since 1994 (Oracle 6). Marco is also eager and skillful in other fields, like Operating System Administration and Application Servers, mainly in finding working, performing solutions.

Marco has been specializing in Oracle XMLDB, since 2003, focusing on his old love, database administration and performance. He is an Oracle XMLDB enthusiast. He is also a dedicated contributor of the Oracle User Group community, helping people with their steep XMLDB learning curve. To this purpose, Marco also devoted his personal blog site to XMLDB and other Oracle issues.

Marco is a frequent speaker at conferences like Oracle OpenWorld, UKOUG, RMOUG, ODTUG, BGOUG and local user group seminars in Holland. He has been awarded the Oracle ACE title for his work in the Oracle XMLDB area (2007) and has been since 2009 a member of the OakTable network.

In 2010, he was awarded the Oracle ACE Director status and was asked to become a member of the Oracle XMLDB Customer Advisory Board for the next major release. He has been the AMIS Oracle beta lead for database versions 11.x and 12.1.

Presentation Title

Creating Structure in Unstructured Data

Abstract

The energy and performance drain needed to get something useful out of unstructured data is enormous. Organizations want to do more with all their structured, semi-structured or unstructured data, and when this phenomenon ends up in your database, it will become a serious performance headache. The presentation makes use of the English XML MediaWiki data set (freely available on the internet) and the Oracle 12c database.

This data set, on which this presentation demonstrates its findings, contains enormous amounts of those structured, semi-structured and unstructured components, and consists of more than 10+ million free text XML documents. The presentation will show and demonstrate how to get this data, performance-wise, under control via applying logical and physical design principles. Results are shown and discussed, what the effects are if you add partitioning, securefile storage and specificially crafted domain index methods to this performance-draining monster.

Presentation Materials

Presentation materials are available online to attendees only.

Schedule

The speaker schedule is as follows: