Yuefeng Zhang, PhD, Computer Science




Python Data Preprocessing Using Pandas DataFrame, Spark DataFrame, and Koalas DataFrame

Preparing data for machine learning in Python See publication

See Publication

Object-Oriented Machine Learning Pipeline with mlflow for Pandas and Koalas DataFrames

End-to-end process of developing Spark-enabled machine learning pipeline in Python using Pandas, Koalas, scikit-learn, and mlflow. See publication

See publication

Automatic Machine Learning in Fraud Detection Using H2O AutoML

Machine Learning Automation in Finance. See publication

See publication

Deep Learning in Winonsin Breast Cancer Diagnosis

A deep learning approach for healthcare. See publication

See publication

Deep Learning for Natural Language Processing Using word2vec-keras

A deep learning approach for NLP by combining Word2Vec with Keras LSTM. See publication

See publication

Deep Multi-Input Models Transfer Learning for Image and Word Tag Recognition

A multi-models deep learning approach for image and text understanding. See publication

See publication

Deep Clustering for Financial Market Segmentation

A unsupervised deep learning approach for credit card customer clustering. See publication

See publication

Deep Learning for Image Classification on Mobile Devices

Mobile Image Classification App Development using Expo, React-Native, TensorFlow.js, and MobileNet. See publication

See publication

Deep Learning for Detecting Objects in an Image on Mobile Devices

Mobile Objects Detection App Development using Expo, React-Native, TensorFlow.js, and COCO-SSD. See publication

See publication

Deep Learning for Natural Language Processing on Mobile Devices

Reading Comprehension using Expo, React-Native, TensorFlow.js, and MobileBERT. See publication

See publication

Running Spark NLP in Docker Container for Named Entity Recognition and Other NLP Features

Using Spark NLP with Jupyter notebook for natural language processing in Docker environment. See publication

See publication

Common Time Series Data Analysis Methods and Forecasting Models in Python

Analyzing time series data for forecasting using ARIMA and LSTM models. See publication

See publication

Probabilistic Programming and Bayesian Inference for Time Series Analysis and Forecasting in Python

A Bayesian Method for Time Series Data Analysis and Forecasting in Python. See publication

See publication

Machine Learning for Building Recommender System in Python

Building Recommendation System Using Model-Based Collaborative Filtering in Python. See publication

See publication
Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text Some alt text

About:

Yuefeng, PhD in computer science, is a manager, senior data scientist at McDonald's. He
Lead the research & development of multiple computer vision projects using various advanced deep learning models for object detection, instance segmentation, object tracking, and object re-identification with one camera or across multi-cameras, including:

  • Food order accuracy intelligence with one camera: automatically detect food items in a customer food tray via deep learning object (e.g., food item) detection, and check customer order for any missing and/or wrong food item.
  • Deep learning food order tracking across multi-cameras via food items detection and tracking all of the food items in a food tray as a cluster of points.
  • Work utilization measurement intelligence via deep learning person detection, deep learning pose estimation, and innovative method of detecting active work time of each kitchen device (e.g., grill machine)
  • Personalized work utilization measurement automation via deep learning person detection, deep learning person tracking, deep learning pose estimation, attention-based deep person re-identification, and creative detection of active work time for each of the individual crew members.
  • Driving-through food order automation across multi-cameras via vehicle detection, re-identification, and tracking across multi-cameras using deep learning vehicle detection, deep learning vehicle tracking, and attention-based deep learning vehicle re-identification.


  • Serviced as the leading author of a well received white paper on computer vision for personalized work utilization

    Contributed to trending analytics for foresight marketing intelligence


    Yuefeng was a senior data scientist at Wavicle Data Solutions. He

  • Developed new deep learning computer vision model for fast food item detection using AWS SageMaker, Open CV, and YOLO v5
  • Developed new NLP (Natural Language Processing) model for sentiment analysis from customer feedback
  • Helped clients to enhance machine learning models with 10X CPU performance improvement
  • Developed clinical data statistical analysis and report generation software using AWS SageMaker with R kernel
  • Provided end-to-end machine learning services to clients
  • Authored machine learning project proposal
  • Utilized industry-leading AI platforms and frameworks such as GCP AI Platform, Databricks in AWS, AWS SageMaker, Hugging Face, Tensorflow, PyTorch
  • Contributed white papers on machine learning and deep learning applications
  • Played leading role in the establishment of a new data science team
  • Mentored junior data scientists and data engineers

  • Yuefeng was a senior data scientist at SMS Assist. He

  • Lead the facilities service pricing data analytics of residential and commercial reactive work orders
  • Developed a brand new data preprocessing and features engineering pipeline for the prediction of facilities service fair market price
  • Developed a brand new supervised machine learning system for the prediction of facilities service fair market price, including the work order affiliate invoice total, labor unit and total cost, materials total cost, travel total cost, freight total cost, and other cost
  • Established an industry standard machine learning process based on CRISP-DM (Cross-Industry Standard Process for Data Mining)
  • Guided and worked collaboratively with data engineering and DevOps teams to establish a sharable cloud machine learning environment in AWS using JupyterHub, Jupyter notebook, GitLab, S3, etc.
  • Developed machine learning and deep learning roadmap to management for the automation of residential and commercial reactive work order workflow
  • Identified and documented potential machine learning and deep learning opportunities

  • Yuefeng was a senior data engineer and principal associate at Capital One. He

  • Lead the architecture and development of the logistic regression and Random Forest machine learning engines for the prediction of customer profile matching, achieving more accurate prediction of performance than the legacy system developed by IBM
  • Lead the architecture, design, and development of multi-model machine learning system that can run multiple models concurrently for the prediction of probabilistic matching of different types of customer profiles with different features from different countries
  • Lead the research of innovative probabilistic customer matching methods via face recognition using deep learning algorithms such as convolutional neural networks (ConvNets)
  • Advised and collaborated with Capital One Card organization in the development of anomaly detection and forecasting system using deep learning algorithms such as deep autoencoder and recurrent neural network LSTM
  • Developed open source-based customer profile duplicate management system and presented it in Capital One 2019 Hackathon competition
  • Played leading role in the architecture and development of large-scale real-time data pre- processing, data blocking, features engineering, and model monitoring metrics calculation
  • Lead the architecture and development of large-scale real-time customer core analytical data streaming pipeline using Java, Python, Spark, Kafka, Spring Framework, and column-oriented distributed database
  • Assisted in teaching Capital One official machine learning training course, Chicago, 2019

  • Yuefeng was a senior system and application engineer at UniqueSoft, LLC. He

  • Served as project and team leader. Duties included project planning, management, and implementation of whole software development lifecycle for each project: System requirements solicitation, analysis, architecture/design, implementation, testing, and deployment.
  • Successfully developed several large-scale reverse-engineering/software-data-mining projects that utilize unsupervised machine learning algorithms to automatically extract application knowledge from legacy source code, including domain-specific application system features, architecture elements (architecture diagrams, call/control flow charts, dependency tables, sequence diagrams, etc.), complexity metrics (size, cyclomatic complexity, module dependency complexity, etc.), and major issues/improvement opportunities (e.g., replicated code detection and elimination, dead code detection and elimination, etc.).
  • Successfully developed various types of software products in different domains (e.g., telecommunications, defense, transportation, enterprise, etc.) and languages (TDL, Java, C/C++, PL/SQL, Cobol, JavaScript, etc.) using modern software engineering methodologies (e.g., software modernization via data mining, UML modeling and automatic code generation, Object- Oriented development, Service-Oriented development, Aspect-Oriented programming, Cloud computing, Agile processes, etc.), and Web application technologies such as J2EE, Spring Framework, Web services (SOAP, WSDL, etc.), JavaScript, Python, etc.

  • Yuefeng was a Distinguished Member of Technical Staff at Motorola Solutions. He

  • Worked collaboratively with senior management to develop strategy and approach to establishing Model-Driven Development (MDD) methodology and Agile development process for the development of large-scale parallel and distributed real time telecommunication systems.
  • Established partnerships with product, engineering, software automation teams, and MDD tools vendors such as IBM.
  • Developed and managed Motorola iDEN division-wide MDD roadmap, including software development processes, coding and design standards, protocols, platforms, programming languages, tools, etc.
  • Developed materials and conducted training for both technical and business colleagues.

  • Yuefeng authored book chapter and many peer-reviewed articles on machine learning, deep learning, data science, digital image processing, computer vision, etc. More detailed information about Yuefeng is available in

    LinkedIn.