Hotsos Symposium Speaker – Gwen Shapira

Gwen Shapira is a returning Hotsos Symposium presenter.


Gwen ShapiraGwen Shapira is a Senior Consultant at Pythian, an Oracle ACE Director and a member of the OakTable Network. Gwen studied computer science, statistics, and operations research at the University of Tel Aviv, and then went on to spend the next fifteen years in different technical positions in the IT industry.

She specializes in scalability and high-availability solutions such as RAC, Exadata and Hadoop. Gwen is a frequent presenter at Oracle conferences and blogs for the popular Pythian blog at

Presentation Titles

Session 1: Building an Integrated Data Warehouse with Oracle Database and Hadoop

Session 2: Visualizing Database Performance Using R


Session 1: Building an Integrated Data Warehouse with Oracle Database and Hadoop — From tracking customers in online stores to tweets and blog posts, unstructured data is rapidly growing and businesses are looking for ways to analyze it. In this presentation, Gwen will explain why storing and processing unstructured data is a challenge best answered by specialized systems such as Hadoop. She will dive into how Hadoop works and why it is such a scalable solution for managing unstructured data. Gwen will show how to integrate Hadoop with existing data warehouse systems on Oracle to allow using the data in existing BI tools and reports.

Session 2: Visualizing Database Performance Using R — As DBAs, we work with data all day long. Not only are we responsible for effectively storing and reporting business critical data, Oracle's wait interface generates huge amounts of information for us to sift through when troubleshooting. This presentation will introduce you to the world's most powerful data analysis tool — your eyes. Our brains were built to process massive amounts of data very quickly, provided the data is presented in the right form. In this presentation, Gwen will introduce R, the popular open source statistical computing software. She will explain how R can be used to analyze performance data and the most effective ways to use R to visualize large amounts of data: ways to effectively explore the data, come up with relevant insights and communicate the information to others.

Presentation Materials

Presentation materials are available online to attendees only.


The speaker schedule is as follows: