Machine Learning 101 with Python: Predictions and Insights

Program Description

While this outline serves as a foundational framework with use cases from multiple industries and functions, the final program is fully customized to your industry and internal workflows.

Participants work on real-world problems, not generic examples. We engage in a pre-workshop alignment to inject your specific organizational datasets, pain points, and proprietary use cases directly into the curriculum.

Learning Objectives

Program Details

Content

Day 1: Regression, Classification & Data Prep

  • Deconstructing the difference between traditional programming (Rules-based) and Machine Learning (Inference-based). Navigating the Scikit-Learn ecosystem.
  • Scenario (General): Auditing a legacy rule-based discount system and replacing it with an ML model that predicts “Propensity to Buy.”
  • Hands-on: “The Baseline Script” – Setting up a clean ML environment and performing the initial Train/Test split on a corporate dataset.
  • Expected Impact: Technical clarity on how to structure a data science team’s workflow for maximum reproducibility.
  • Understanding Ordinary Least Squares (OLS), Coefficients, and Intercepts. How to quantify the relationship between independent variables and a target outcome.
  • Demo (Manufacturing/Logistics): Predicting “Estimated Time of Arrival” (ETA) for shipments at Port Klang based on weather, vessel type, and historical port congestion.
  • Hands-on: Building a “Revenue Predictor” – Using Python to forecast next month’s sales based on marketing spend and seasonal indices.
  • Expected Impact: Ability to lead financial and operational forecasting projects with statistical confidence.
  • The Sigmoid function and probability mapping. Moving from “How much?” to “Yes or No?”
  • Scenario (Banking/FinTech): Building a baseline “Loan Default” predictor. Does this applicant fall into the ‘Risk’ or ‘Safe’ category based on their credit history?
  • Hands-on: “The Churn Alert” – Creating a classifier to predict whether a telecom subscriber will port out (churn) based on their usage patterns and complaint history.
  • Expected Impact: Foundational capability to build automated approval and flagging systems.
  • Handling Categorical data (One-Hot Encoding) and Feature Scaling. Technical methods for data anonymization to satisfy Malaysian regulatory requirements.
  • Scenario (HR/Operations): Encoding “Job Role” and “Department” into numerical format for an attrition model while hashing NRIC and names to protect employee privacy.
  • Hands-on: “The Sanitization Pipe” – Building a preprocessing pipeline that handles missing values and scales features while stripping PII (Personally Identifiable Information).
  • Expected Impact: 100% compliance with PDPA 2.0; structural protection of corporate and individual data.

Day 2: Tree-Based Models, Clustering & Evaluation

  • Understanding Gini Impurity and Entropy. Moving from a single tree (overfitting) to an ensemble of trees (Random Forest) for robust predictions.
  • Demo (E-commerce/Retail): Using a Random Forest to identify the “Top 5 Drivers” of high-value cart conversions during a 12.12 sale event.
  • Hands-on: Building a “Lead Scoring” model – Ranking potential B2B clients based on firmographic data and past engagement levels.
  • Expected Impact: Capability to deploy models that are both highly accurate and explainable to non-technical stakeholders.
  • Discovering structure in “Unlabeled” data. Understanding the “Elbow Method” to determine the optimal number of clusters.
  • Scenario (Sales/Marketing): Segmenting a Malaysian retail customer base into “Bargain Hunters,” “Brand Loyals,” and “Occasional Shoppers” based on purchasing behavior.
  • Hands-on: “The Market Map” – Using Python to cluster geographical regions in West/East Malaysia based on economic indicators and consumption power.
  • Expected Impact: Ability to identify new market opportunities and operational efficiencies that are not visible through standard reporting.
  • Why “Accuracy” is a trap for imbalanced data. Mastering the Confusion Matrix, Precision, Recall, and the F1-Score.
  • Scenario (Operations/Risk): Evaluating a fraud detection model where 99% of transactions are legitimate. How to ensure the model actually catches the 1% of fraud (High Recall).
  • Hands-on: “The Performance Audit” – Running a full diagnostic on the Day 1 models to identify “Overfitting” and “Underfitting” using Cross-Validation.
  • Expected Impact: Reduced technical risk; ensuring that deployed models perform reliably in “Real-World” Malaysian market conditions.
  • Consolidating the course into a technical execution plan. Moving from “Lab” to “Pilot” and then “Production.”
  • The Framework: Prioritizing ML projects based on Data Availability vs. Immediate Business Value.
  • Hands-on: Co-creating a “Technical ML Playbook” – defining standards for model documentation, version control (Git), and periodic model retraining.
  • Expected Impact: A clear, sustainable roadmap for transitioning the organization into an AI-driven entity.
Data Analytics Training for IT Professionals

List of Deliverables

Prerequisites

Who Should Attend

Training Methodology

100% HRDC-Claimable

This program is fully registered and compliant with HRDC (Human Resource Development Corporation) requirements under the SBL-Khas scheme, allowing Malaysian employers to offset the training costs against their levy.

Certification of Completion

Participants who successfully complete the program will be awarded a “Professional Certificate in Machine Learning 101 & Predictive Analytics.

Post-Workshop Consulting (Optional)

For organizations looking to bridge the gap between training and execution, we offer optional, paid consulting services. These engagements provide expertise and technical support for specific pilot development or full-scale operational integration of the data- and AI-driven use cases established during the program.

Contact us for In-House Training

    * All fields are required