John G.

John G. Email and Phone Number

Big Data Engineer: Databricks Hadoop Spark Kafka Splunk ElasticSearch | Solr |Accumulo | AWS (EC2, S3, Glue, Lambda, Athena, SNS, Cloudformation, etc) | Nifi | Pentaho | Python | Scala | Java | Shell @ Federal Contractor
Washington, DC, US
John G.'s Location
Washington DC-Baltimore Area, United States, United States
John G.'s Contact Details

John G. work email

John G. personal email

About John G.

Experience working in perform data ingestion, develop and maintain data frame application to ingest hundreds of million records daily. Perform and oversight data ingestion from landing server to HDFS of +70 servers, data mapping, data load to Accumulo, index Solr Cloud, running Hadoop Map/Reduce jobs with Pig, UTF, and cron shell scriptSet up Accumulo table and auth list, set up Cloudera SolrCloud instances and shardings, perform Solr index, Solr performance turning. Monitor Accumlo performance, data node, recycle Job trackers and data nodes across Cloudera cluster. Setup Splunk forwarder server, Search Head, Splunk third party App to continuous ingest Nginx log for analytics and dashboard report to data scientist and stake holder.Profound experience in full software development cycle, Object Orientation, distributed systems, multiple-tier (SOA, EIS, middleware),JEE, public/private cloud, database, and integration

John G.'s Current Company Details
Federal Contractor

Federal Contractor

View
Big Data Engineer: Databricks Hadoop Spark Kafka Splunk ElasticSearch | Solr |Accumulo | AWS (EC2, S3, Glue, Lambda, Athena, SNS, Cloudformation, etc) | Nifi | Pentaho | Python | Scala | Java | Shell
Washington, DC, US
Website:
farbon.co.uk
Employees:
239
John G. Work Experience Details
  • Federal Contractor
    Federal Contractor
    Washington, Dc, Us
  • Federal Contractor
    Big Data Engineer
    Federal Contractor Jan 1999 - Present
    Washington, Dc, Us
    20+ years of IT, 10+ years of big data analytics. Experience in perform end-to-end ETL pipeline from data ingestion, transformation, indexing, catalogue, develop and to ingest hundreds of millions records daily from vast variety data sources using Cloudera Hadoop cluster of +70 server in big data Hadoop ecosystem, Zookeeper, data lake, store and manage large data sets in columnar (NoSQL) Accumulo across cluster, store data in cluster of Elasticsearch of +40 servers.Design schema and sharding the distributed and tolerant powerful text search SolrCloud cluster. Utilize combination of Kafka messaging service, Elasticsearch, Accumulo, complex regex to ingest, clean, transform data for machine learning in data science project. Perform continuous Nginx and S3 streaming log analysis using Splunk search head and forwarder.Perform data transformation of billion records of health care data using Databricks SparkSQL/Scala and graphing capabilities. Experience of using AWS S3, Glue, Athena to build data catalogue and search sustainability data. Setup Google cloud platform data proc cluster to perform data transformation and analytic in Goolge bucket.Experience of cluster of NiFi data flow and Pentaho to build ETL pipeline with Kafka, Elasticsearch, Aws S3, Accumulo. Expertise of data parsing using regular expression regex, data extraction and mapping XML, JSON, binary, base64 encoding/decoding, GPG encryption. Profound experience in full software development cycle, Object Orientation, distributed systems, multiple-tier SOA,JEE, cloud platform, database, and data integration and engineering, data migration, Java development, Scala, Shell Scripting.SkillsAWS Cloud: EC2, S3, Athena, Glue, SNS, Analytics, EMR, RDS Azure datablob/storage, Google Cloud Platform Data Proc, PythonBig Data, Databricks on AWS/Azure, Cloudera Hadoop/YARN, SparkSQL, PySpark, Zookeeper, Accumulo, Hive, Search Engine SolrCloud, ElasticSearch, Splunk, Pentaho, NiFi, Kafka.

John G. Education Details

  • Johns Hopkins Whiting School Of Engineering
    Johns Hopkins Whiting School Of Engineering
    Computer Science

Frequently Asked Questions about John G.

What company does John G. work for?

John G. works for Federal Contractor

What is John G.'s role at the current company?

John G.'s current role is Big Data Engineer: Databricks Hadoop Spark Kafka Splunk ElasticSearch | Solr |Accumulo | AWS (EC2, S3, Glue, Lambda, Athena, SNS, Cloudformation, etc) | Nifi | Pentaho | Python | Scala | Java | Shell.

What is John G.'s email address?

John G.'s email address is jo****@****hoo.com

What schools did John G. attend?

John G. attended Johns Hopkins Whiting School Of Engineering.

Who are John G.'s colleagues?

John G.'s colleagues are Chantel Rexroth, Steven Lopez, Kelli Brockington, Sheila Parker, Vivian Covington, Vishal Harinkhede, Wade Brown.

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.