Skip to main content

Machine learning for federal missions.

Production ML systems from a Kaggle Top 200 data scientist. Computer vision, NLP, forecasting, anomaly detection — shipped into live federal environments, not slide decks.

Top 200
Kaggle Global
0.1%
Of 200,000+ DS
a federal health agency
Production ML, in production today
15yr
Founder engineering tenure

What we deliver

Every engagement is scoped toward a working model serving real users, not a PowerPoint deck — from data intake through deployed inference with monitoring, ATO alignment, and a handoff package the receiving team can operate.

Top 200
Kaggle global ranking
a federal health agency
Confirmed production ML delivery
FedRAMP High
ATO-ready ML infrastructure
MACHINE LEARNING — what we track
p99
latency budget
eval
domain harness
ATO
NIST 800-53
cost
per mission call
drift
continuous eval
  • Computer vision — object detection, segmentation, OCR, document understanding, satellite/aerial imagery analysis, medical imaging.
  • Natural language processing — classification, named entity recognition, summarization, embeddings-based search, fine-tuned domain models.
  • Time-series forecasting — demand, supply, workforce, equipment, budget. Classical (Prophet, ARIMA) through deep (N-BEATS, TFT).
  • Anomaly detection — fraud, network intrusion, supply chain deviation, health surveillance, predictive maintenance.
  • Tabular prediction — gradient-boosted trees (XGBoost, LightGBM, CatBoost) plus ensembling. The workhorse pattern behind most production federal prediction systems.
  • MLOps — model registry, drift monitoring, shadow deployment, A/B testing, rollback paths.

FEDERAL ML DEPLOYMENT COVERAGE

Scikit-learn and XGBoost on GovCloud
92%
PyTorch (on-prem and GovCloud)
88%
SageMaker Government
85%
Azure ML Government
82%
Databricks MLflow on GovCloud
88%

Why Kaggle matters for federal ML

A Kaggle Top 200 ranking means you've consistently beaten 199,800+ other data scientists on held-out test sets across diverse problem types. It is a direct publicly-benchmarked, adversarial test of modeling skill that exists.

For federal work, it matters because many federal ML projects fall short not from lack of data but from modeling choices — wrong architecture, leaked features, overfit validation, no uncertainty quantification. Competition ML training directly addresses these failure modes.

Stack

Modeling

PyTorch, scikit-learn, XGBoost, LightGBM, CatBoost, HuggingFace Transformers, timm, detectron2.

Experiment tracking

MLflow, Weights & Biases.

Serving

TorchServe, ONNX Runtime, Triton Inference Server, FastAPI wrappers.

Feature stores & data

Feast, Parquet + DuckDB, Postgres, Spark for batch feature engineering.

Cloud

AWS GovCloud (SageMaker, EC2 GPU), Azure Government, on-prem CUDA clusters.

Past performance highlights

a federal health agency

Production machine learning system

Designed and deployed a live machine learning system at the a federal health agency. Serves real users. Passed full federal security review. In production today. See full past performance →

Federal ML, answered.
What machine learning work can a small business realistically deliver for federal agencies?

Everything a prime can deliver for the actual ML workload — we've shipped production ML at a federal health agency that passed federal security review. What a small business does not deliver is the 200-person prime overhead wrapped around a 2-person ML team. For the ML itself, a lean specialist delivers faster and with more direct accountability.

What is your Kaggle ranking?

Top 200 globally out of 200,000+ active data scientists, putting Bo in the top 0.1%. Experience spans NLP, computer vision, tabular prediction, and ensemble methods — all competition-proven on held-out adversarial test sets. Profile: kaggle.com/bopengiowa.

Have you shipped production ML in a federal environment?

Yes. Production ML at a federal health agency (a federal health agency). Live. Real users. Passed federal security review. Not a prototype, not a POC.

Can you handle classified or air-gapped ML environments?

Design and modeling yes. Production deployment in classified environments requires the contracting path to include facility clearance — typically through a cleared prime partner. We'd partner rather than pretend to hold clearances we don't.

Do you do MLOps and monitoring, or just model training?

Both. A federal ML system without drift monitoring, model registry, and rollback paths is a liability. We build these in from day one, not bolt them on later.

1 business day response

Turn data into mission advantage.

Production ML for federal agencies. Ready to deliver.

Contact the PISee which agencies we serve →
UEI Y2JVCZXT9HP5CAGE 1AYQ0NAICS 541512SAM.GOV ACTIVE