•9+ years of experience in IT industry specializing as Hadoop/Java Developer with 4+ years of experience in Big Data ecosystem related technologies like Hadoop HDFS, Map Reduce, Apache Pig, Spark, Hive, Sqoop, HBase, Flume, and Oozie.•Strong hands on experience in Hadoop Framework and its ecosystem including HDFS Architecture, MapReduce Programming, Hive, Pig, Sqoop, HBase, Zookeeper, Couchbase, Storm, Solr, Oozie, Spark, Scala, Flume, Strom, and Kafka. •Excellent knowledge on Hadoop Architecture and ecosystems such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and Map Reduce programming paradigm. •Experience in analyzing data using HIVEQL and Pig Latin and custom Map Reduce programs in Java and Scala. •Experience in strong and analyzing data using HiveQL, Pig Latin, HBase and custom Map Reduce programs in Java. •Experience in importing and exporting data into HDFS and Hive using Sqoop. •Good knowledge on Amazon AWS concepts like EMR & EC2 web services which provides fast and efficient processing of Big Data. •Hands-on experience in installing, configuring Cloudera's Apache Hadoop ecosystem components like •Flume-ng, HBase, Zookeeper, Oozie, Hive, Spark, Storm, Sqoop, Kafka, Hue, Pig, Hue with CDH3&4 Clusters •Architected, Designed, and maintained high performing ELT/ETL Processes. •Skilled in managing and reviewing Hadoop log files. •Experienced in loading data to Hive partitions and creating buckets in Hive •Experienced in configuring Flume to stream data into HDFS. •Experienced in real-time Big Data solutions using HBase, handling billions of records. •Processing this data using Spark Streaming API with Scala. •Familiarity with distributed coordination system Zookeeper. •Involved in designing and deploying a multitude application utilizing the entire AWS stack (Including EC2, RDS, VPC, and IAM) focusing on high availability, fault tolerance and auto-scaling. •Good knowledge on building Apache spark applications using Scala. •Experience in developing and designing POCs using Scala and deployed on the Yarn cluster, compared the performance of Spark with Hive and SQL/Teradata. • Have a particularly good understanding and worked with relational databases like MySQL, Oracle, and NoSQL databases like HBase, Mongo DB, Couchbase and Cassandra. • Good work experience on JAVA, JDBC, Servlets, JSP. • Proficient in Java, J2EE, JDBC, Collections, Servlets, JSP, Struts, Spring, Hibernate, JAXB, JSON,XML, XSLT, XSD, JMS, WSDL, WADL, REST, SOAP Web services, CXF, Groovy, Grails, Jersey, Gradle and Eclipse Link.