LogoLogo
Log InSign UpHomepage
  • 👋Welcome
  • Account and Team Setup
    • Sign up
    • Subscription Plans
    • Profile information
    • Account information
    • Roles
    • Users
    • Tags
  • FAQ
  • UNDERSTANDING MACHINE LEARNING
    • What is Graphite Note
      • Graphite Note Insights Lifecycle
    • Introduction to Machine Learning
      • What is Machine Learning
      • Data Analitycs Maturity
    • Machine Learning concepts
      • Key Drivers
      • Confusion Matrix
      • Supervised vs Unsupervised ML
  • Demo datasets
    • Demo Datasets
      • Ads
      • Churn
      • CO2 Emission
      • Diamonds
      • eCommerce Orders
      • Housing Prices
      • Lead Scoring
      • Mall Customers
      • Marketing Mix
      • Car Sales
      • Store Item Demand
      • Upsell
    • What Dataset do I need for my use case?
      • Predict Cross Selling: Dataset
      • Predict Customer Churn: Dataset
      • Predictive Lead Scoring: Dataset
      • Predict Revenue : Dataset
      • Product Demand Forecast: Dataset
      • Predictive Ads Performance: Dataset
      • Media Mix Modeling (MMM): Dataset
      • Customer Lifetime Value Prediction : Dataset
      • RFM Customer Segmentation : Dataset
    • Dataset examples - from online sources
      • Free datasets for Machine Learning
  • Datasets
    • Introduction
    • Prepare your Data
      • Data Labeling
      • Expanding datasets
      • Merging datasets
      • CSV File creating and formatting
    • Data sources in Graphite Note
      • Import data from CSV file
        • Re-upload or append CSV
        • CSV upload troubleshooting tips
      • MySQL Connector
      • MariaDB Connector
      • PostgreSQL Connector
      • Redshift Connector
      • Big Query Connector
      • MS SQL Connector
      • Oracle Connector
  • Models
    • Introduction
    • Preprocessing Data
    • Machine Learning Models
      • Timeseries Forecast
      • Binary Classification
      • Multiclass Classification
      • Regression
      • General Segmentation
      • RFM Customer Segmentation
      • Customer Lifetime Value
      • Customer Cohort Analysis
      • ABC Pareto Analysis
      • New vs Returning Customers
    • Predict with ML Models
    • Overview and Model Health Check
    • Advanced parameters in ML Models
    • Actionable insights in ML Models
    • Improve your ML Models
  • Notebooks
    • What is Notebook?
    • My first Notebook
    • Data Visualization
  • REST API
    • API Introduction
    • Dataset API
      • Create
      • Fill
      • Complete
    • Prediction API
      • Quickstart
      • Request
        • Headers
        • Payload
        • Data
      • Response
        • Response Structure
      • API Limits
    • Model Results API
      • Quickstart
      • Request
        • Headers
        • Body
      • Response
      • Usage Notes
      • Code Examples
Powered by GitBook
On this page
  • Introduction
  • 1. Lead Scoring Model (Converted: Yes/No)
  • 2. Churn Prediction Model (Churned: Yes/No)
  • 3. MQL to SQL Conversion Model (Converted: Yes/No)
  • Conclusion

Was this helpful?

Export as PDF
  1. Datasets
  2. Prepare your Data

Data Labeling

Introduction

Data labeling is the process of tagging data with meaningful and informative labels to train machine learning models. In predictive analytics, labeled data is crucial as it provides the model with examples of correct behavior. This document will guide you through the process of preparing and labeling data for three predictive models:

Lead Scoring,

Churn Prediction,

and MQL to SQL Conversion.

1. Lead Scoring Model (Converted: Yes/No)

Objective: Predict if a lead will convert into a customer.

Dataset Example:

Lead_ID
Industry
Company_Size
Interaction_Count
Converted

001

Tech

50-100

5

Yes

002

Finance

100-500

2

No

Steps:

  1. Data Collection: Gather data on leads, including their industry, company size, and interactions with your platform.

  2. Labeling: For each lead, label them as 'Yes' if they converted into a customer and 'No' if they didn't.

  3. Reasoning: Labeling helps the model understand patterns of conversion based on the features provided.

2. Churn Prediction Model (Churned: Yes/No)

Objective: Predict if a customer will churn or leave your service.

Dataset Example:

Customer_ID
Monthly_Usage
Support_Tickets
Feedback_Score
Churned

A1

50 hrs

2

4.5

No

B2

10 hrs

5

2.8

Yes

Steps:

  1. Data Collection: Gather data on customer usage patterns, support interactions, and feedback scores.

  2. Labeling: For each customer, label them as 'Yes' if they churned and 'No' if they continued using your service.

  3. Reasoning: Labeling helps the model identify signs of customer dissatisfaction or reduced engagement, which might lead to churn.

3. MQL to SQL Conversion Model (Converted: Yes/No)

Objective: Predict if a Marketing Qualified Lead (MQL) will become a Sales Qualified Lead (SQL).

Dataset Example:

MQL_ID
Webinar_Attendance
Downloaded_Content
Email_Click_Rate
Converted

M1

2

Yes

15%

Yes

M2

0

No

5%

No

Steps:

  1. Data Collection: Gather data on MQLs, including their engagement with webinars, content downloads, and email interactions.

  2. Labeling: For each MQL, label them as 'Yes' if they became an SQL and 'No' if they didn't.

  3. Reasoning: Labeling helps the model recognize patterns of engagement that indicate a lead's readiness to move to the sales stage.

Conclusion

Data labeling is a foundational step in predictive analytics. By providing clear, accurate labels, you enable your predictive models to learn from past data and make accurate future predictions. Ensure your labels are consistent and based on well-defined criteria to achieve the best results with Graphite Note's no-code predictive analytics platform.

PreviousPrepare your DataNextExpanding datasets

Last updated 1 year ago

Was this helpful?