Synthetic data
Synthetic long-horizon histories for recent Australian Betashares ETFs (DHHF, BGBL, GHHF, GGBL)
Logos-SIE (Synthetic Information Ecosystem) is a large-scale synthetic benchmark designed to model the complete lifecycle of information formation within a simulated world.
Realistic log generators for testing data pipelines at volume - web, IoT, syslog, Windows, Cisco ASA, CEF/LEEF, JSON app, cloud audit, Kubernetes, PostgreSQL. Requires only uv.
Turn natural language into a training-ready CV dataset automatically. YOLOv11 + SAM2 + Vision LLM + Neural DQS
Public ErdosBench smoke test: 14 public-source Erdős-style problems and baseline reports
FlowLet: Conditional 3D Brain MRI Synthesis using Wavelet Flow Matching - Accepted at Medical Image Analysis, Elsevier
End-to-end financial fraud detection for M-Pesa, bank, and KRA transactions — XGBoost + SMOTE on synthetic Kenyan data, with a live Streamlit demo.
Paste a TypeScript interface or Zod schema. Get realistic fixture code instantly.
WinLOLBIN-GT: A Behavioural Ground Truth Dataset for Machine Learning-Based Detection of Windows Living-Off-the-Land Binary Abuse
PyTorch implementation of a Selective State Space Model (SSM) using the Mamba architecture. Designed as a diagnostic framework, it features a custom signal-to-noise synthetic dataset generator to evaluate latent state memory trajectories.
Parameterizable synthetic dataset generator for AI-driven cyberattacks
Synthetic RNA-seq cohorts for data sharing: a discovery-aware benchmark at transcriptome scale — Nanda & Saha, 2026
Harbor-format AI evaluation tasks for synthetic adtech revenue operations workflows
Dataset for the paper: Political Neutrality as Balanced Approval: A Large-Scale Human Evaluation of AI Responses
Run a lightweight AI focus group for ideas, messaging, and product concepts — built around synthetic personas and Likert scoring.
Convert your agentic tool logs into useable training data or calibrations for GGUF quantization.
Draw a store, generate LLM personas, and watch them shop — an isometric 3D sandbox for synthetic-consumer experiments.