John G. Email and Phone Number
John G. work email
- Valid
John G. personal email
- Valid
Experience working in perform data ingestion, develop and maintain data frame application to ingest hundreds of million records daily. Perform and oversight data ingestion from landing server to HDFS of +70 servers, data mapping, data load to Accumulo, index Solr Cloud, running Hadoop Map/Reduce jobs with Pig, UTF, and cron shell scriptSet up Accumulo table and auth list, set up Cloudera SolrCloud instances and shardings, perform Solr index, Solr performance turning. Monitor Accumlo performance, data node, recycle Job trackers and data nodes across Cloudera cluster. Setup Splunk forwarder server, Search Head, Splunk third party App to continuous ingest Nginx log for analytics and dashboard report to data scientist and stake holder.Profound experience in full software development cycle, Object Orientation, distributed systems, multiple-tier (SOA, EIS, middleware),JEE, public/private cloud, database, and integration
Federal Contractor
View- Website:
- farbon.co.uk
- Employees:
- 239
-
Federal ContractorWashington, Dc, Us -
Big Data EngineerFederal Contractor Jan 1999 - PresentWashington, Dc, Us20+ years of IT, 10+ years of big data analytics. Experience in perform end-to-end ETL pipeline from data ingestion, transformation, indexing, catalogue, develop and to ingest hundreds of millions records daily from vast variety data sources using Cloudera Hadoop cluster of +70 server in big data Hadoop ecosystem, Zookeeper, data lake, store and manage large data sets in columnar (NoSQL) Accumulo across cluster, store data in cluster of Elasticsearch of +40 servers.Design schema and sharding the distributed and tolerant powerful text search SolrCloud cluster. Utilize combination of Kafka messaging service, Elasticsearch, Accumulo, complex regex to ingest, clean, transform data for machine learning in data science project. Perform continuous Nginx and S3 streaming log analysis using Splunk search head and forwarder.Perform data transformation of billion records of health care data using Databricks SparkSQL/Scala and graphing capabilities. Experience of using AWS S3, Glue, Athena to build data catalogue and search sustainability data. Setup Google cloud platform data proc cluster to perform data transformation and analytic in Goolge bucket.Experience of cluster of NiFi data flow and Pentaho to build ETL pipeline with Kafka, Elasticsearch, Aws S3, Accumulo. Expertise of data parsing using regular expression regex, data extraction and mapping XML, JSON, binary, base64 encoding/decoding, GPG encryption. Profound experience in full software development cycle, Object Orientation, distributed systems, multiple-tier SOA,JEE, cloud platform, database, and data integration and engineering, data migration, Java development, Scala, Shell Scripting.SkillsAWS Cloud: EC2, S3, Athena, Glue, SNS, Analytics, EMR, RDS Azure datablob/storage, Google Cloud Platform Data Proc, PythonBig Data, Databricks on AWS/Azure, Cloudera Hadoop/YARN, SparkSQL, PySpark, Zookeeper, Accumulo, Hive, Search Engine SolrCloud, ElasticSearch, Splunk, Pentaho, NiFi, Kafka.
John G. Education Details
-
Johns Hopkins Whiting School Of EngineeringComputer Science
Frequently Asked Questions about John G.
What company does John G. work for?
John G. works for Federal Contractor
What is John G.'s role at the current company?
John G.'s current role is Big Data Engineer: Databricks Hadoop Spark Kafka Splunk ElasticSearch | Solr |Accumulo | AWS (EC2, S3, Glue, Lambda, Athena, SNS, Cloudformation, etc) | Nifi | Pentaho | Python | Scala | Java | Shell.
What is John G.'s email address?
John G.'s email address is jo****@****hoo.com
What schools did John G. attend?
John G. attended Johns Hopkins Whiting School Of Engineering.
Who are John G.'s colleagues?
John G.'s colleagues are Chantel Rexroth, Steven Lopez, Kelli Brockington, Sheila Parker, Vivian Covington, Vishal Harinkhede, Wade Brown.
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial