PinkMate — Agentic Predictive GTM Copilot

Turn raw leads into a prioritised, outreach-ready pipeline using enrichment signals, predictive ML scoring, and agentic workflow orchestration.

Predictive MLAgentic WorkflowsICP SegmentationGTM Automation

Under active development. Public MVP documentation and technical repository are available below.

Built by Karan Sehgal (ex-Amex, WegoPro, Nokia) — founder-led AI GTM system combining ML scoring + agentic orchestration.

Built With Production-Grade Tools

ML & Data

Python • Scikit-learn • Pandas • NumPy

Agent Framework

LangChain • LangGraph • LangSmith

Orchestration

Custom workflow engine • State management

Data Sources

CSV • LinkedIn Sales Navigator • Apollo.io

How It Works

1
Lead Intake
CSV / LinkedIn / Apollo imports
2
Enrichment
Firmographics + buyer intent signals
3
Predictive Scoring
Conversion likelihood via ML models
4
Segmentation
ICP fit + readiness tiers
5
Outreach Triggers
Email/LinkedIn sequences + feedback loop

System Components

Signals & Enrichment

Firmographics + buyer intent signals as model inputs.

Scoring Layer

Interpretable ML baselines for prioritisation. The MVP combines classical ML models (Naive Bayes, Random Forest, Gradient Boosting) with agent-orchestrated workflows using LangChain and LangGraph to predict conversion likelihood, generate ICP tiers, and trigger GTM actions.

Segmentation Engine

ICP fit + readiness tiers drive routing.

Agentic Orchestration

Workflow transitions and triggers to next actions. Autonomous agents monitor segment changes, execute multi-step sequences (enrichment → scoring → routing → outreach), handle conditional branching, and learn from outcomes to optimize future decisioning paths.

Multi-Agent Pipeline Design

Six specialized agents working in orchestrated harmony

1. IngestionAgent
Accepts CSV uploads, LinkedIn Sales Navigator exports, Apollo data feeds, and API inputs. Normalizes lead records, validates required fields, and flags incomplete data for manual review.
2. EnrichmentAgent
Calls enrichment sources (Clay, Clearbit, Apollo APIs) to append firmographics (company size, industry, revenue, funding stage) and buyer signals (tech stack, hiring activity, recent news). Handles rate limiting and API errors gracefully.
3. ScoringAgent
Applies classical ML models (Naive Bayes for baseline categorization, Random Forest for feature importance, Gradient Boosting for final scoring). Returns conversion probability (0-1), confidence intervals, and feature attribution for transparency.
4. SegmentationAgent
Converts scores into actionable tiers (High/Medium/Low readiness) and applies ICP rules (target sectors, geography, ARR thresholds, role seniority). Routes leads to appropriate workflows based on fit + readiness matrix.
5. OutreachAgent
Decides next-best-action based on segment tier: High tier → personalized email sequence, Medium tier → nurture cadence, Low tier → hold for 60 days. Designed to integrate with CRMs (HubSpot, Salesforce) and outreach tools (Lemlist, Apollo).
6. FeedbackLoop & Analytics
Consumes engagement signals (email opens, meeting bookings, opportunity creation) and closed-won outcomes. Feeds results back to ScoringAgent for model recalibration, adjusts segmentation thresholds, and tracks uplift vs. baseline GTM performance.

Orchestration note: Agents communicate via LangGraph state machines with error handling, retries, and conditional routing. Vector-based memory stores account history and interaction context for personalization.

Predictive Scoring Under the Hood

Model Ensemble

Naive Bayes

Baseline probabilistic classifier for quick lead categorization

Random Forest

Feature importance extraction and non-linear relationship detection

Gradient Boosting (XGBoost/LightGBM)

Primary scoring model with high predictive accuracy

Feature Engineering

The system extracts and engineers features across multiple dimensions:

  • Firmographic signals: Company size, industry, revenue band, growth stage
  • Behavioral signals: Website visits, content engagement, email opens
  • Intent signals: Job postings, technology stack changes, funding events
  • Temporal patterns: Engagement velocity, time-since-last-action
Model Output

Each lead receives:

  • Conversion probability score (0-1) with confidence intervals
  • ICP fit tier (High/Medium/Low)
  • Feature attribution showing which signals drove the score
  • Low-certainty flags for manual review
  • Structured response format for downstream agent consumption
Why Classical ML?
PinkMate deliberately uses interpretable classical models rather than black-box deep learning. This architectural choice enables faster iteration cycles during MVP development, provides transparent feature importance for GTM teams to understand scoring decisions, requires minimal compute infrastructure, and makes debugging and model auditing straightforward. As the product matures and training data accumulates, more complex models can be introduced where they add clear value.

A High-Performance Growth Engine Built on ML + Agentic Intelligence

From Static Lists to Autonomous GTM Execution

Traditional CRMs are storage systems. They hold lead data and depend entirely on manual prioritization, static segmentation rules, and human-driven follow-up. PinkMate is fundamentally different — it's a decisioning and execution layer that combines predictive ML with agentic orchestration.

Predictive Scoring Layer
Classical ML models (Naive Bayes, Random Forest, Gradient Boosting) analyze enrichment signals — firmographics, intent data, engagement history — to predict conversion likelihood. The scoring is interpretable, feature-based, and designed to improve as engagement signals accumulate.
Agentic Orchestration
Built on LangChain and LangGraph, autonomous agents monitor lead state changes, execute multi-step workflows (enrichment → scoring → segmentation → outreach trigger), handle conditional branching based on ICP fit and readiness, and feed outcomes back into the scoring system for continuous improvement.
Workflow Autonomy
Instead of manual list-sorting and static if/then rules, PinkMate's agents make real-time decisions: when a lead crosses a readiness threshold, the system autonomously triggers the next-best action (email sequence, LinkedIn outreach, CRM handoff) and tracks response signals to refine future prioritization.

The Result:

A self-improving GTM system where enrichment feeds prediction, prediction drives segmentation, and segmentation triggers execution — all without manual intervention.

MVP Constraints & Known Limitations

PinkMate is an early-stage MVP under active development. Current limitations include:

  • Processing Mode:Batch processing only — no real-time API yet. Scoring runs are triggered manually or on schedule.
  • Use Case Scope:Optimized for B2B SaaS outbound initially. Other verticals (e-commerce, services) not yet tested or tuned.
  • Enrichment Setup:Manual connector configuration required. Automated enrichment source discovery is planned for Phase 2.
  • Collaboration:Single-user mode. Team-based workflows, role permissions, and shared dashboards are in development.
  • Model Maturity:Baseline models trained on synthetic and limited real-world data. Performance improves as engagement signals accumulate.

These constraints are being addressed systematically in the development roadmap. Feedback from early users directly shapes prioritization.

Use Cases

Outbound for B2B SaaS teams

ABM and ICP tiering

Predictive lead qualification

Pipeline prioritisation and sequencing

What You Get

  • Prioritised lead list with readiness tiers
  • Enriched records with key firmographic fields
  • Scoring rationale (MVP-level transparency)
  • Suggested next-best action trigger per segment
  • Public build artefacts (repo + documentation)

System Architecture Overview

Lead Sources

CSV, LinkedIn, Apollo

Enrichment Layer

Firmographic + intent data

Feature Engineering

Signal extraction

ML Scoring Engine

NB, RF, GBM models

Outreach Execution

Email/LinkedIn triggers

Agentic Router

LangGraph state machine

Segmentation Logic

Readiness + ICP routing

ICP Classifier

High/Med/Low tiers

Outcome Tracking

Meeting booked, opp created

Feedback Loop

Label collection

Model Retraining

Periodic updates

Verification & Build Evidence

GitHub Repository

Technical implementation, ML workflows, and agentic orchestration logic.

View GitHub Repository
Public Documentation

Product spec, workflow definitions, and feature roadmap.

View Documentation
Repository Highlights
  • agents/ - Six specialized agent implementations
  • workflows/ - LangGraph orchestration and routing logic
  • models/ - ML model artifacts and configuration files
  • docs/ - Architecture documentation and workflow diagrams

This is an early MVP and technical proof-of-concept under active development. The landing page, documentation, and repository reflect the current build state.

Frequently Asked Questions

Built by a Founder Who Ships

I'm Karan Sehgal, Founder of PinkMate. I'm designing the GTM workflow, defining the predictive scoring approach (Python-based ML baselines + feature design), and shaping the agentic execution model that moves GTM from manual list-sorting to AI-native prioritisation and action.

Background: Previously at American Express (marketing analytics), WegoPro (growth engineering), and intern at Nokia (network engineering). Building in public — all code, documentation, and development progress is open and verifiable.

Active Development Phases

Phase 1: Foundation
Current
  • Core ML scoring pipeline
  • LangChain + LangGraph orchestration
  • CSV data ingestion
  • Basic segmentation logic
  • Public documentation + GitHub
  • 🔄Feature engineering optimization
  • 🔄Model evaluation framework
Phase 2: Intelligence Layer
Next 8 weeks
  • Expand enrichment sources
  • Behavioral signal tracking
  • Feedback loop implementation
  • Enhanced agentic decision trees
  • A/B testing framework
Phase 3: Scale & Integration
Following
  • LLM-enhanced scoring layers
  • RAG-based context grounding
  • Autonomous outreach sequencing
  • CRM integrations (HubSpot, Salesforce)
  • Real-time scoring API
  • Self-service configuration UI
Live🔄 In Progress Planned

Recent Updates

v0.3
Current
  • Added Gradient Boosting models to scoring ensemble
  • Implemented LangGraph state machine for workflow orchestration
  • Expanded feature engineering pipeline (temporal patterns + intent signals)
  • Published public documentation + technical repository
v0.2
  • Built Random Forest model for ICP classification
  • Created enrichment connectors for CSV + LinkedIn data
  • Defined segmentation logic (readiness tiers)
v0.1
  • Initial Naive Bayes baseline model
  • Basic lead intake + scoring prototype
  • Established core architecture patterns
Next Up

Feedback loop implementation, expanded enrichment sources, model evaluation framework

Ready to Build Smarter GTM?

Join early access or view the GitHub repository for technical details.