PyConf Hyderabad 2017

PyConf Hyderabad

Pyspark - Python and Spark

Submitted by Durga Gadiraju (@itversity) on Wednesday, 9 August 2017

videocam_off

Technical level

Intermediate

Status

Submitted

Vote on this proposal

Login to vote

Total votes:  +11

Abstract

Data Engineering at scale using Python and Big Data eco system of tools.

Outline

Here is the high level outline for the workshop: 1) Revision of basic python programming 2) Overview of Big Data eco system 3) Data Engineering at scale with Spark core APIs using Python as programming language 4) Overvew of Spark SQL and Data Frames 5) Development life cycle and execution life cycle. Training will be provided using state of the art 10 node Big Data cluster with hands on approach.

Requirements

  • A laptop (64 bit operating system and 4 GB RAM are highly desired)
  • Browser - Chrome or Firefox
  • Basic understanding of Python programming - loops, exception, file handling and collections

Speaker bio

Durga Gadiraju is technology evangelist and consultant with close to 14 years of experience in building data driven applications at scale. For past 4 years, Durga is primarily focused on Big Data in the areas of consulting, delivery and training. His online platform itversity, is well known in IT community in the areas of Big Data and Cloud. itversity will be a free continuous learning platform for IT professionals.

Links

Comments

  • -1
    Venkatesh Kuntla (@venkateshkuntla) a year ago

    Great initiative.

  • -1
    Tulasi Ram K (@ram11122251) a year ago (edited a year ago)

    Great initiative for learners..

  • -2
    RAMu BOJEDLA a year ago

    Great initiative

  • -3
    Abhishek Singh (@singhabhishek) a year ago

    Registered for this conf only because of this session

Login with Twitter or Google to leave a comment