
Saikumar Chappa
Saikumar Chappa
About
Detail
Data Engineer
New York, United States
• 4+ Years of professional IT experience in all phases of Software Development Life Cycle including hands on experience in Big Data Analytics. • Hands on experience using Hadoop tools like HDFS, Hive, Apache Spark, Apache Sqoop, Flume, Oozie, Apache Kafka, Apache storm, Yarn, Impala, Zookeeper, Hue. • Experience on Migrating SQL database to Azure data Lake, Azure data lake Analytics, Azure SQL Database, Data Bricks and Azure SQL Data warehouse and controlling and granting database access and Migrating On premise databases to Azure Data Lake store using Azure Data factory. • Experience in Developing Spark applications using Spark - SQL in Databricks for data extraction, transformation and aggregation from multiple file formats for analyzing & transforming the data to uncover insights into the customer usage patterns. • Good understanding of Spark Architecture including Spark Core, Spark SQL, Data Frames, Spark Streaming, Driver Node, Worker Node, Stages, Executors and Tasks. • Good understanding of Big Data Hadoop and Yarn architecture along with various Hadoop Demons such as Job Tracker, Task Tracker, Name Node, Data Node, Resource/Cluster Manager, and Kafka (distributed stream processing) . • Experience in Database Design and development with Business Intelligence using SQL Server 2014/2016, Integration Services (SSIS), DTS Packages, SQL Server Analysis Services (SSAS), DAX, OLAP Cubes, Star Schema and Snowflake Schema. • Strong skills in visualization tools Power BI, Confidential Excel - formulas, Pivot Tables, Charts and DAX Commands. • Experience in analyzing data using HiveQL, and MapReduce Programs. • Experienced in ingesting data into HDFS from various Relational databases like MYSQL, Oracle, DB2, Teradata, Postgres using sqoop. • Experienced in importing real time streaming logs and aggregating the data to HDFS using Kafka and Flume. • Well versed with various Hadoop distributions which include Cloudera (CDH), Hortonworks (HDP), Azure HD Insight. • Extending HIVE and PIG core functionality by using custom User Defined Function's (UDF), User Defined Table Generating Functions (UDTF) and User Defined Aggregating Functions (UDAF) for Hive and Pig. • Experience working on NoSQL Databases like HBase, Cassandra and MongoDB. • Experience in Python, Scala, shell scripting, and Spark. • Experience with Testing Map Reduce programs using MRUnit, Junit and EasyMock. • Experience on ETL methodology for supporting Data Extraction, transformations and loading processing using Hadoop. • Worked on data visualization tools like Tableau and also integrated the data using ETL tool Talend. • Hands on development experience with JAVA, Shell Scripting, RDBMS, including writing complex SQL queries, PL/SQL, views, stored procedure, triggers, etc. • Passionate about working on the most cutting-edge Big Data technologies. • Willing to update my knowledge and learn new skills according to business requirement.
Contact Saikumar regarding:
work
Full-time jobs
Starting at
USD95K/year