Greenplum Summit

Federated Analytics

 
August 12, 2020
9AM - 11AM PDT

Federated Analytics: Wrangling Business Insight from Disparate Systems


Your organization is producing more data than ever, and you’re tasked with making it all accessible. How do you quickly cleanse and munge data in a useful way? In this track, Greenplum users describe how they parse data in parallel, with ANSI SQL or with languages such as Python, R, and Java. Users will also review how you can stream data in real time through Greenplum with common ELT tools and explore more advanced techniques that improve performance and accuracy.

Agenda



9:00 - 9:10 am Intro
9:10 - 9:30 am
PXF: The query federation engine for the modern enterprise
9:30 - 9:50 am ETL In The Wild - Why Federated Matters
9:50 - 10:10 am Using Nuix and PXF to load unstructured data into Greenplum
10:10 - 10:30 am Ingest data into Greenplum using Apache NiFi
10:30 - 10:50 am Federated Data Wrangling in Practice - An Interview with Conversant
10:50 - 11:00 am Closing

Speakers

jacques.jpeg
Jacque Istok, Vice President Data at VMware
Jacque Istok serves as Head of Data for VMware, responsible for setting data strategy and the activities for data engineering and data science. Prior, he was Field CTO helping customers architect and understand the entire VMware portfolio. A hands on technologist, he has been implementing and advising customers in architecture and back end infrastructure the majority of his career. Before VMware, Mr. Istok co-founded Professional Innovations, Inc. in 1999, a leading consulting services provider in the business intelligence and data warehousing space. Mr. Istok is on the board of several emerging startup companies and serves as their strategic technical advisor.
Divya Bhargov
Divya Bhargov, Engineering Director at VMware
Divya is an Engineering Lead for Tanzu Greenplum Database. She has worked with several areas of Greenplum including query processing and optimization, query federation and Advanced Analytics. She has full stack experience and has worked with companies large and small companies across the world.
Derek Comingore.jpeg
Derek Comingore, Global Presales Lead, Data Platforms & Services at VMware
Derek is a cross-functional field leader with deep expertise spanning presales, customer success and channel functions in early-stage technology companies. In his time at enterprise data companies he has led regional and global presales organizations. Derek has led EBITDA neutral professional services teams as well.
Prior to Pivotal Software, he built and sold a niche distributed systems integrator. Derek has a deep technical background in distributed data platforms with the key ability to communicate associated business value. Aligning channels and alliances for joint success he creates value-driven global partnerships.
Michiel Shortt.jpg
Michiel Shortt, Data Scientist at A42 Labs
Michiel is a Data Scientist with a strong background in optimization and applications of advanced mathematical algorithms. Interested in the limits of computing since a teenager, he has had a lifelong interest in the mathematics behind computer science, in particular AI and machine learning.
His professional experience includes software development and the application of machine learning and optimization algorithms to diverse business problems, especially in the fields of industrial engineering and logistics.
Alexander Denissov.jpeg
Alexander Denissov, Staff Engineer at VMware
Alexander Denissov is leading development of products that integrate Greenplum Database with external systems. He has 20+ years of industry experience with interests in middleware, backend services, big data analytics and data processing. He presented at conferences such as ApacheCon, PostgresConf, Hadoop World and is a committer of several Apache projects.
Francisco_Guerrero (1).jpg
Francisco Guerrero, Senior Member Technical Staff at VMware
Francisco Guerrero is passionate about data, and a machine learning enthusiast and contributes to open source projects including PXF and Greenplum. He holds a Masters in Computer Science, and as part of Greenplum, he contributes to Greenplum's integration to external data ecosystems such as Spark, Hadoop, relational databases, cloud storages among others.
Ashuka_Xue.png
Ashuka Xue, Staff Engineer at VMware
Ashuka Xue joined 2 years ago after interning with the Greenplum Database (GPDB) team during her graduate program. After completing her Masters in Software Engineering, she came back to work on query processing and optimization with GPDB's query optimizer (GPORCA). She recently started working on projects that integrate GPDB with external data ecosystems.
Shaun Litt.png
Shaun Litt, VP Data Warehouse Architecture at Conversant
Shaun Litt is the VP of data platform architecture responsible for big data solutions and systems at Conversant where he built the companies first user accessible data warehouse using Greenplum Database. Currently Conversant leverages Greenplum Database and Hadoop together to manage its multi-petabyte data platform supporting with Shaun and his team working tirelessly to power the business with more data and more automated analytics and reporting. Prior to joining Conversant Shaun was the enterprise architect at Sears Holdings for Data Integration. In total Shaun has spent the past 15 years building big data solutions for some of the nation’s largest companies specializing in ETL, MPP database and Hadoop technologies.  

Code of conduct

VMware is dedicated to providing and promoting a safe and inclusive environment for everyone, regardless of gender identity and expression, sexual orientation, disability, neurodiversity, physical appearance, body size, ethnicity, nationality, race, age, religion, or other protected categories. Because we don’t always know what might make someone else feel uncomfortable or unwelcome, these guidelines are meant to reinforce our values and clarify our expectations of all of those who participate in our community and events. They also define unacceptable behavior and provide instructions on what you can do if you see unacceptable behavior.

We each have different backgrounds and experiences. At VMware, we have three core values that we expect all to abide by: do the right thing, do what works, and be kind.

Expected behavior

  1. Be considerate, respectful, and collaborative.
  2. Refrain from using any demeaning, discriminatory, or harassing behavior and speech.

Unacceptable behavior

Unacceptable behavior may include, but is not limited to, the following:

  1. Harassment and discrimination, which includes: any verbal, physical, or visual conduct based on sex, sexual orientation, gender expression or identity, transgender status, race, age, national origin, disability, or other protected characteristics; inappropriate use of nudity and/or sexual images in public spaces (including presentation slides); deliberate intimidation, stalking, or following; harassing photography or recording; sustained disruption of sessions, talks, or other events; and inappropriate physical contact, or any unwelcome sexual attention.
  2. Using any boisterous, lewd, or offensive behavior or language; using sexually explicit or offensive language or conduct; using profanity, obscene gestures, or racial, religious, homophobic, transphobic, or ethnic slurs; commenting about a person’s body or sexual activity; and discussing sexual activity.
  3. Failing to obey any of the rules or regulations of the event or instructions of the event staff.
  4. Participating in any other illegal activity not already covered above.

Consequences of unacceptable behavior

We won’t tolerate unacceptable behavior. If a participant engages in unacceptable behavior, the event organizers may take any action they deem appropriate, up to and including expulsion from the event without warning or refund and, if appropriate, involving local law enforcement. Anyone asked to stop unacceptable behavior is expected to comply immediately.

How to report unacceptable behavior

If you have a complaint or concern about harassment, discrimination, bullying, or any other conduct that violates the VMware Events Code of Conduct, please contact [email protected].

We thank our attendees in advance for your help in keeping the event welcoming, respectful, and friendly to all participants.

Register

Thank you!

We look forward to you joining us.

Location

675 Ponce de Leon Avenue NE
Suite 7500
Atlanta, GA 30308
Get directions