Querying Data with the Select statement
If you are going to do anything with data you are going to need to learn how to query data. And the language to query the majority of those systems is through SQL (Structured Query Language). The pull data from any systems you are going to be relying on the Select statement. The features in the statement may vary slightly from platform to platform, but the core capabilities are the same.
In this session, we will dive deep into the SELECT statement as the window to getting your data. In this session, we will be using one platform, but the lessons learned here will be applicable to querying any system that supports a SQL language interface. This session will cover things like querying, joins functions, aggregation, subqueries, and much more. This session will give you the foundation to start analyzing data in any database system.
Pandas for Data Processing
Python is a language of choice for data engineers and scientists because of it's ease of use for data processing and analytics. One of the most popular libraries to work with data is Pandas. This session will walk through loading data from JSON file, exploring the data, applying filters, and some basic analysis.
Andrew is a Data Professional that is super passionate about the ability of data to drive change in our world, work, and personal lives. He has over 15 years of experience in multiple roles, companies, and technologies. A passion for learning is core to how he approaches life. Over 10+ years Andrew has spoken at User Groups, Code Camps, SQL Saturdays, and Conferences along with supporting the local community as a SQL Saturday Organizer, User Group leader, and advocate for community involvement for all technology professionals. He was also recognized by Microsoft as a four-time SQL Server MVP for his technical community contributions.Bio: Dustin Vannoy
Profile: With more than a decade working hands-on with data platforms and analytics, Dustin is currently focused on helping organizations move to fast, streaming data flows from the slow, batch ETL of the past. He uses Python and Scala to develop data pipelines that leverage Spark, Kafka, and Hadoop. He is also passionate about doing what he can to increase diversity in tech and to help teach new skills to the communities of Southern California and Baja California.