cv_vision_tracker

🤖 AI Vision Tracker - YOLOv8 DeepSORT Streamlit

A powerful, user-friendly web application for real-time object detection, tracking, and counting using state-of-the-art YOLOv8 models with DeepSORT tracking algorithm. Built with Streamlit for an intuitive interface.

🚀 Live Demo & Resources

Live Application

GitHub Pages: https://shreyansh123185655.github.io/cv_vision_tracker/
Report Google Drive: https://drive.google.com/file/d/1rz5oC1BeXpZphffsHS4AI3XFRGLiEaAK/view?usp=sharing

Quick Access

🌐 Live Demo: Interactive web application
📁 Resources: Additional files and documentation
🔗 Direct Links: Instant access to project

🎯 Project Overview

AI Vision Tracker combines cutting-edge computer vision technologies to provide:

Real-time Object Detection using YOLOv8 models
Multi-Object Tracking with DeepSORT algorithm
Object Counting with directional analysis (in/out)
Interactive Web Interface powered by Streamlit
Multiple Input Sources: Images, Videos, and Webcam

🖥️ Computer Vision in Project

This project demonstrates advanced computer vision techniques through real-world applications:

📸 Vision Pipeline Architecture

The system processes visual data through multiple stages:

Image Acquisition: Captures frames from images, videos, or webcam feeds
Preprocessing: Resizes and normalizes input data for optimal model performance
Object Detection: YOLOv8 identifies and localizes objects with confidence scores
Feature Extraction: Deep learning models extract visual features for tracking
Motion Analysis: Kalman filtering predicts object movement between frames
Data Association: Hungarian algorithm matches detections to existing tracks
Counting Logic: Virtual line crossing detection for directional counting

📊 Computer Vision Impact

Real-time Processing: Achieves 30+ FPS on GPU for live video analysis
Multi-object Handling: Simultaneously tracks dozens of objects across frames
Robust Detection: Handles various lighting conditions and object orientations
Occlusion Management: Maintains tracking during partial object visibility
Scalable Architecture: Supports different model sizes for various performance needs

📈 Visual Intelligence Features

Spatial Awareness: Understands object positions and movements in 2D space
Temporal Consistency: Maintains object identity across time sequences
Semantic Understanding: Classifies objects into 80+ categories
Behavioral Analysis: Tracks entry/exit patterns and movement directions

📸 Application Interface

Streamlit Interface

Live Detection Demo

Detection Demo 1

Model Configuration and Results

Detection Demo 2

The web interface provides real-time visualization with:

Live object detection with bounding boxes
Confidence scores for each detection
Directional counting statistics
Interactive model selection controls

Key interface features:

Sidebar Controls: Model selection (yolov8n, yolov8s, etc.)
Confidence Adjustment: Real-time threshold tuning (30-100%)
Input Sources: Image, Video, and Webcam options
Live Detection: Real-time object tracking with bounding boxes
Counting Statistics: In/out vehicle movement tracking

🏗️ Project Architecture

ai-vision-tracker/
├── app.py                    # 🎯 Main Streamlit application
├── config.py                 # ⚙️ Configuration settings
├── utils.py                  # 🔧 Utility functions
├── requirements.txt          # 📦 Python dependencies
├── packages.txt             # 🖥️ System dependencies
├── weights/detection/        # 🧠 Pre-trained YOLOv8 models
│   ├── yolov8n.pt           # 📱 Nano model (fastest)
│   └── yolov8s.pt           # 🚀 Small model (balanced)
├── videos/                   # 🎬 Sample video files
│   └── videoplayback.mp4    # 📹 Demo video
├── images/                   # 🖼️ Sample images
└── README.md                # 📚 Documentation

🔧 How It Works

1. Object Detection Engine (YOLOv8)

Model Loading: Dynamically loads YOLOv8 models based on user selection
Real-time Inference: Processes frames at configurable confidence thresholds
Multi-class Detection: Identifies 80+ COCO dataset objects
Performance Optimization: Supports different model sizes (n, s, m, l, x)

2. Tracking System (DeepSORT)

Feature Extraction: Uses deep learning to extract appearance features
Kalman Filtering: Predicts object motion between frames
Data Association: Hungarian algorithm for optimal track assignment
Track Management: Handles track creation, deletion, and state updates

3. Counting Algorithm

Directional Analysis: Tracks object movement across virtual lines
In/Out Detection: Monitors entry and exit points
Multi-class Counting: Separate counters for different object types
Real-time Statistics: Live updates of object counts

4. Web Interface (Streamlit)

Responsive Design: Adapts to different screen sizes
Interactive Controls: Model selection, confidence adjustment
Media Upload: Drag-and-drop file upload interface
Live Preview: Real-time visualization of detection results

🎮 Features & Capabilities

📊 Detection Capabilities

80+ Object Classes: Person, car, truck, bicycle, dog, cat, etc.
Bounding Box Visualization: Color-coded detection boxes
Confidence Scores: Real-time confidence display
Class Labels: Automatic object classification

🎯 Tracking Features

Multi-Object Tracking: Simultaneous tracking of multiple objects
Track Persistence: Maintains object identity across frames
Occlusion Handling: Robust tracking during partial occlusions
Cross-Frame Tracking: Seamless tracking in video sequences

📈 Counting System

Directional Counting: Separate in/out counters
Class-specific Counting: Individual counts per object type
Real-time Statistics: Live count updates
Historical Data: Track counting over time

🎨 User Interface

Model Selection: Choose from YOLOv8 variants (n, s, m, l, x)
Confidence Control: Adjustable detection threshold (30-100%)
Source Selection: Switch between Image, Video, Webcam
Upload Interface: Easy file upload with preview

🚀 Supported Input Formats

📸 Image Processing

Formats: JPG, PNG, JPEG, BMP
Resolution: Automatic resizing for optimal performance
Batch Processing: Single image analysis
Result Display: Annotated image with detections

🎬 Video Analysis

Formats: MP4, AVI, MOV, MKV
Frame-by-Frame: Real-time video processing
Progress Tracking: Video processing progress bar
Export Options: Save annotated videos

📹 Webcam Integration

Live Feed: Real-time webcam processing
FPS Control: Adjustable frame rate
Continuous Tracking: Persistent object tracking
Interactive Mode: Live detection and tracking

🧠 YOLOv8 Model Variants

Model	Size (pixels)	mAP^val50-95	Speed (CPU)	Speed (GPU)	Parameters	FLOPs
YOLOv8n	640	37.3	80.4ms	0.99ms	3.2M	8.7B
YOLOv8s	640	44.9	128.4ms	1.20ms	11.2M	28.6B
YOLOv8m	640	50.2	234.7ms	1.83ms	25.9M	78.9B
YOLOv8l	640	52.9	375.2ms	2.39ms	43.7M	165.2B
YOLOv8x	640	53.9	479.1ms	3.53ms	68.2M	257.8B

🛠️ Technical Stack

Core Technologies

YOLOv8 - Object Detection
DeepSORT - Multi-Object Tracking
Streamlit - Web Framework
OpenCV - Computer Vision
PyTorch - Deep Learning Framework

Dependencies

Python 3.7+ - Core programming language
NumPy - Numerical computations
Pillow - Image processing
Matplotlib - Visualization
Requests - HTTP handling

📦 Installation & Setup

1. Clone Repository

git clone https://github.com/Shreyansh123185655/ai-vision-tracker.git
cd ai-vision-tracker

2. Create Virtual Environment

python -m venv ai-vision-env
source ai-vision-env/bin/activate  # On Windows: ai-vision-env\Scripts\activate

3. Install Dependencies

pip install -r requirements.txt
pip install streamlit ultralytics

4. Run Application

streamlit run app.py

🌐 Deployment Options

Streamlit Cloud (Recommended)

URL: ai-vision-tracker.streamlit.app
Features: Free hosting, auto-scaling, GitHub integration
Setup: Connect repository → Auto-deploy

Hugging Face Spaces

URL: hf.co/spaces/Shreyansh123185655/ai-vision-tracker
Features: Free GPU, ML community, easy sharing

Render

Features: Full-featured hosting, custom domains, SSL

Docker Deployment

docker build -t ai-vision-tracker .
docker run -p 8501:8501 ai-vision-tracker

🎯 Use Cases & Applications

🚗 Traffic Management

Vehicle counting and classification
Traffic flow analysis
Parking space monitoring
Speed estimation

🏪 Retail Analytics

Customer counting and tracking
Store layout optimization
Heat map generation
Dwell time analysis

🏭 Industrial Monitoring

Production line monitoring
Safety compliance
Equipment tracking
Quality control

🔒 Security & Surveillance

Intrusion detection
Perimeter monitoring
Crowd analysis
Anomaly detection

🎮 How to Use

1. Select Model

Choose YOLOv8 variant from sidebar
Adjust confidence threshold (30-100%)

2. Choose Input Source

Image: Upload image file
Video: Upload video file
Webcam: Use live camera feed

3. Configure Settings

Set detection confidence
Choose tracking parameters
Adjust counting regions

4. Run Analysis

Click “Start Detection”
View real-time results
Monitor object counts
Export results if needed

📊 Performance Metrics

Detection Accuracy

mAP@0.5: 50-95% (depending on model)
Processing Speed: 30-100 FPS (GPU)
Memory Usage: 2-8GB (model dependent)

Tracking Performance

Track Accuracy: 85-95%
ID Switch Rate: <5%
Tracking Duration: Persistent across frames

🔧 Configuration Options

Model Settings

# Model selection
MODEL_TYPE = "yolov8s"  # n, s, m, l, x

# Detection confidence
CONFIDENCE_THRESHOLD = 0.5  # 0.3 - 1.0

# Input resolution
INPUT_SIZE = 640  # 320, 416, 512, 640, 1280

Tracking Parameters

# DeepSORT settings
MAX_DISAPPEARANCE = 30  # frames
MAX_TRACK_AGE = 100     # frames
N_INIT = 3             # consecutive detections

🐛 Troubleshooting

Common Issues

1. Model Loading Error

Ensure model files exist in weights/detection/
Check file permissions
Verify model integrity

2. GPU Not Detected

Install CUDA-compatible PyTorch
Check GPU drivers
Verify CUDA availability

3. Webcam Access Denied

Check browser permissions
Ensure HTTPS connection
Test with different browsers

Performance Optimization

1. Increase Speed

Use smaller models (YOLOv8n/s)
Reduce input resolution
Lower confidence threshold

2. Improve Accuracy

Use larger models (YOLOv8m/l/x)
Increase input resolution
Fine-tune confidence threshold

🤝 Contributing

Development Setup

git clone https://github.com/Shreyansh123185655/ai-vision-tracker.git
cd ai-vision-tracker
git checkout -b feature/your-feature-name
# Make changes
git commit -m "Add your feature"
git push origin feature/your-feature-name

Code Style

Follow PEP 8 guidelines
Add docstrings to functions
Include type hints
Write unit tests

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Ultralytics - YOLOv8 framework
Streamlit - Web framework
DeepSORT - Tracking algorithm
OpenCV - Computer vision library
PyTorch - Deep learning framework

📞 Contact & Support

GitHub Issues: Report bugs
Discussions: Feature requests
Email: shreyanshgupta@example.com

🚀 Quick Start

# Clone and run in one command
git clone https://github.com/Shreyansh123185655/ai-vision-tracker.git && cd ai-vision-tracker && pip install -r requirements.txt && streamlit run app.py

🎉 Your AI Vision Tracker will be running at http://localhost:8501!

This site is open source. Improve this page.