Lijun Yang Email and Phone Number
Lijun Yang work email
- Valid
- Valid
- Valid
Lijun Yang personal email
• 18 years of general data analytics; 8 years of C/C++ programming on automatic machine trading; 6 years of Python programming on data engineering and data science. • Solid background in probability theory and statistics, linear algebra and calculus. Have a unified and coherent picture, i.e., linear or nonlinear basis change, to connect classic machine learning algorithms and deep neural network. Reducing the dimension of basis vectors as much as possible while maintaining most information is a key point in tackling most machine learning problems. A “relatively” large number of samples due to the significant reduction of feature dimension will almost always help reduce model variance.• Have a deep understanding on machine learning fundamentals and strong intuition on: (1) What models to choose? Classic, neural networks or their combination? Linear or nonlinear models? (2) Why a model does not work well? Lack of target-feature dependence, bias due to simple model or less features, variance due to lack of samples or too many features or too complex models?• (1) Familiar with various dialects of relational databases and the essential tasks such as join and aggregation for data preparation. (2) Hands-on experiences on working with major distributed databases such as Hadoop ecosystem, Spark, etc.; manipulating data with low-level key-value based ways such as MapReduce, RDD and high-level SQL-query based techniques such as HIVE, PIG, etc. • Familiar with various visualization techniques from 2D to 3D; from static to interactive; from machine learning metrics to statistical plots in general, etc., all of which provide significant insights to either management or business. • I am a humble, easy-going person and get along with team very well. I am a person full of curiosity, a hard-working person and a loyal worker.
Samsung Electronics America
View- Employees:
- 8740
-
Machine Learning EngineerSamsung Electronics America Jan 2022 - PresentPlano, Texas, United States -
Performance Assurance (Data Scientist)Samsung Electronics America Aug 2020 - Jan 2022Plano, Texas, United States -
Data ScientistAt&T Aug 2019 - Apr 2020Raleigh-Durham, North Carolina Area1. ETL and data engineering• Create python scripts to extract data from sources such as MS SQL Server, MySQL, Oracle, Spark, Hive and Teradata.• Develop python scripts to join eight tables from various sources. Design composite primary keys for all tables by applying self-defined or standard aggregation functions. Build lookup table to correct data or fill missing data. • All tables are first joined into two subgroups by relevant primary keys and then into a single table. Reconcile data inconsistency for the same-name columns in different tables under same primary key. 2. Supervised learning• Predict the winning probability of a business opportunity. Group each high cardinality feature up to 20 categorical levels according to its distribution. Logistic regression, SVM, random forest, bagging and boosting algorithms are used. • The dimension of one-hot vector can easily pass a thousand even after feature selection, which increases the model variance. To improve the model, we apply target statistics and feature hashing to decrease the dimension. • This significantly increases the model performance. For the same precision required by business, the recall is increased by 50%. At very high precision the recall may be increased by more than 100%. 3. Unsupervised learning • Investigate what features contribute to a business class. Although an unsupervised problem, labels are used to select features to avoid the disruptive effect of using too many features in clustering. • K-mode and K-prototypes are employed depending on whether continuous variables are discretized. Well defined clusters are obtained as shown in a nonlinearly mapped t-SNE plot. 4. Visualization • Create visualization package for the team. Develop modules to calculate all metrics of classification models for visualization. • Develop functions for plotting interactive 3D stacked bar chart to show business insights. -
Machine Trading C/C++ DeveloperSelf-Employed Jan 2010 - Aug 2019• Used classical time series models such as autocorrelation function, moving average, etc. to predict market trending. • Used linear and nonlinear regression machine learning algorithms, tree-based models and neural networks to calculate the entry and exit prices of financial market. • Developed an automatic stock trading system, which can trade stocks without human intervention and handle up to 50 stock orders per second. • Developed a stock symbol selection system to manage trading risk. High volatility stock symbols are selected and then filtered first by their correlation to the market index and then by the pair correlation among each other. High volatility and small correlations significantly reduce the risk of trading portfolio. • Calculated the general volatility of the portfolio and used put option for the market index to hedge short term risk arising from unexpected events such as geopolitical issues, and from the large bid/ask spread of high volatility stocks. • Designed trading socket clients and their callbacks in different threads based on Interactive Broker’s C/C++ API, which involves multi-thread programming, socket programming and intensive numerical calculations. • Develop historical data downloading client, real-time market data feeding client, which are running in separate threads to serve the main trading client. • Combine three C/C++ projects from open-source technical analysis code, Interactive Broker’s API source code, the self-designed trading system, and together with SQL server, into a fully functional trading system. • C/C++ sample project can be found at https://github.com/ljyang100/dataScience/tree/master/programming/sampleProject.
-
Postdoctoral ResearcherUniversity Of California, Irvine Nov 2005 - Nov 2009Irvine, California• Did research and simulations on the behavior of electrons in semiconductor quantum wells and other nanostructures. • Conducted theoretical and computational investigation of two-dimensional correlation spectroscopy applied to chemical, biological systems such as photosynthetic complex, proteins. • Research papers can be found at: https://github.com/ljyang100/dataScience/blob/master/mathematics/physics/selected%20publications.ipynb
Lijun Yang Education Details
-
Theoretical And Computational Physics
Frequently Asked Questions about Lijun Yang
What company does Lijun Yang work for?
Lijun Yang works for Samsung Electronics America
What is Lijun Yang's role at the current company?
Lijun Yang's current role is Machine Learning Engineer at Samsung Electronics America.
What is Lijun Yang's email address?
Lijun Yang's email address is li****@****att.com
What schools did Lijun Yang attend?
Lijun Yang attended Queen's University.
Not the Lijun Yang you were looking for?
-
Livia (Lijun) Yang
Greater Seattle Area -
Lijun (Reggie) Yang
Reston, Va -
2gmail.com, campbellsoup.ca
-
1hotmail.com
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial