A deep learning framework for 4-stage Alzheimer's Disease classification using T1 MRI scans. It features a ResNet-18 architecture with Squeeze-and-Excitation (SE) blocks. To handle severe class imbalance, Focal Loss and Weighted Sampling are used. Achieves 78.89% accuracy and 100% recall for Moderate Demented cases.

#Computer vision #AI image generation

kaistmm/SeeandSniff

New 2026

[ECCV 2026] Official Pytorch implementation for See & Sniff: Learning Visuo-Olfactory Representations

#Computer vision #AI image generation

amory123k-commits/Face_Anonymizer

New 2026

"A Python script that automatically detects faces using MediaPipe Tasks API and applies a smart padded Gaussian Blur with OpenCV to protect privacy."

#Computer vision

princeton-prism/veritas

New 2026

#Computer vision #AI agent #Robotics

harpreetsahota204/plane_finder

New 2026

A FiftyOne Plugin that flags dark, elongated, smooth blobs as raw carbon-fiber aircraft candidates in aerial survey imagery

#Computer vision

KevinyWu/hug

415

New 2026

Human Universal Grasping

#Computer vision

andyhuo520/ppocrv6-studio

8812

New 2026

# 简短要点总结 1. 基于PP-OCRv6（Tiny/Small/Medium三版本）搭建本地OCR工作台 2. 苹果硅芯片支持CoreML硬件加速 3. 提供模型一键切换功能 4. 配套OmniDocBench评测能力

#Computer vision #Document processing

infosave2007/svetoch

101

New 2026

Optical neural computation on a commodity smartphone: OLED screen + mirror + front camera as an analog matrix engine. 101 experiments, bilingual docs, and 6 Zenodo papers.

#Computer vision

pawvej/vidgrid

New 2026

Give an AI eyes for video — turn a clip into a numbered frame grid + transcript a vision LLM can read.

#Computer vision #AI video generation

OneMuppet/img2px

New 2026

Experiment upload image, generate 3d point cloud.

#Computer vision #3D generation

machinefi/trio-retina

10431

New 2026

Model-agnostic state layer for world models — turn any detector (YOLO · VLM · DINO) into one standard, queryable stream of events + latent state. numpy-only, runs on CPU at the edge.

#Computer vision #AI infrastructure

rebwarai/CNN-MNIST-From-Scratch-CPP

New 2026

✨️ A Convolutional Neural Network implemented completely from scratch in C++ featuring custom tensors, Conv2D and Dense layers, forward and backward propagation, max pooling, dropout, cross-entropy loss, Adam optimization, and a custom JPEG decoder. No TensorFlow, PyTorch, OpenCV, or neural network libraries.

#Computer vision

chriswritescode-dev/opencode-eyesight

New 2026

OpenCode plugin that lets text-only models work with user-provided and tool-returned images by sending each image to a vision-capable model first, then replacing the image with a text description.

#Computer vision

Guney-olu/dental-cbct-segmentation-research

New 2026

Open research demo for dental CBCT anatomy segmentation using a fine-tuned NVIDIA NV-Segment-CT workflow, with 2D slice review, 3D mesh visualization

#Computer vision #AI workflow

AI agent29.7k starsAI coding assistant12.4k starsAI skills10.7k starsSelf-hosted6k starsCurated list5.2k starsMCP5.1k starsAI video generation4.9k starsAI infrastructure4.8k starsAI workflow4.8k starsWorkflow automation4.3k stars

Computer vision

Other topics