Discover Computer Vision Projects Efficiently

Leverage Vollna to streamline your search for "Computer Vision" projects on Upwork. Use advanced filters, get instant updates, and monitor your performance to boost success.
Signup for free to get access to all filter attributes and instant notifications when new jobs are posted.
Setup filter



Get access to over 30+ filter attributes, setup instant notifications, integrate with your CRM and marketing tools, and more.
Start free trial
306 projects published for past 72 hours.
Job Title Budget
AI/ML/CV/OCR Model Development on Azure -- 2
~146 - 438 USD 1 day ago
Client Rank - Excellent

Payment method verified
$86 178 total spent
82 hires , 3 active
10 open job
5.00 of 14 reviews
Registered at: 14/10/2021
IN India
Excellent
I'm looking for a skilled data scientist or machine learning engineer to develop an AI/ML/CV/OCR model. This model will be deployed on Azure and will process pictures taken by a camera from different angles and sides of a screen. The screen displays a dashboard with varying orientations and layouts of the same values in different fonts, sizes, and locations.

Key Responsibilities:
- Data scraping for increasing model accuracy
- Pre-processing for real-life issues such as shadows, wide cam lights, glares, and movement obstructions
- Detecting values of the Regions of Interest (ROIs)
- Validating the detected values

The ideal candidate should:
- Be proficient in Python or any other cost-effective, scalable language suitable for handling multiple inputs
- Have experience with Azure cloud platform
- Have strong background in handling glare, shadow reduction, and movement obstruction correction

I'm looking for someone who can handle the project from start to finish, including deployment. My tech team will assist you along the way.

Skills: Python, Django, Machine Learning (ML), Computer Vision, Microsoft Azure
Fixed budget: 12,500 - 37,500 INR
1 day ago
  • Websites, IT & Software, Engineering & Science, Python, Django, Computer Vision, Machine Learning (ML), Microsoft Azure
AR Filter Needed
500 USD 1 day ago
Client Rank - Excellent

Payment method verified
$134 639 total spent
261 hires
136 jobs posted
100% hire rate, open job
4.67 of 104 reviews
US United States
Excellent
Overview:
Build a face-tracking AR filter that uses image recognition to measure eyes for lash application. The filter draws lines on the user’s face, identifies eye shape (almond, round, slim), plane (ascending, descending, even), set (wide, close, proportionate), and eyelid type, then recommends lash styles.
Responsibilities:

- **Filter Creation:** Implement real-time face tracking and overlay lines for eye measurements.
- **Image Recognition:** Detect eye landmarks, measure distances/angles, classify shapes, and eyelid types.
- **User Interface:** Design a clear, intuitive AR experience aligned with brand colors and fonts.
- **Optimization & Testing:** Ensure smooth performance on various devices and in different lighting.
- **Iteration:** Update features based on user feedback and maintain compatibility with AR platforms.
**Requirements:**
- Proficiency in AR development (Spark AR, Lens Studio, Unity) and computer vision (e.g., OpenCV).
- Experience translating facial measurements into on-screen overlays.
- Strong collaboration skills with design and beauty teams.

To apply, submit your resume and samples of relevant AR/computer vision working
Skills: Augmented Reality, Facial Recognition
Fixed budget: 500 USD
1 day ago
  • Web, Mobile & Software Dev, Other - Software Development
AI-Powered Cloud Inference Pipeline Development
~146 - 438 USD 1 day ago
Client Rank - Excellent

Payment method verified
$86 178 total spent
82 hires , 3 active
10 open job
5.00 of 14 reviews
Registered at: 14/10/2021
IN India
Excellent
Project Title:
Cloud-Based AI Pipeline (YOLO + OCR) for Extracting Data from Dynamic Screens via Image Input

Description:
We are developing a system that captures images of dashboards/screens and extracts key labeled values (e.g., “Label AB,” “Label XY”) using a camera.
Our internal team handles the hardware/camera system and uploads images to the cloud.

We are looking for an experienced AI/ML + Azure cloud developer to build the cloud-side inference pipeline using computer vision + OCR techniques. The entire system must be Dockerized and deployed on Azure.

Scope of Work:
☁️ 1. AI/ML Inference Pipeline on Azure
Receive uploaded screen images from Azure Blob or an API

Preprocess images for:

Blur detection

Glare/reflection handling

Shadow correction

Screen cutoff validation

Use object detection (e.g., YOLO) to detect key screen regions ("Label AB", "Label CD", etc.)

Use OCR to extract numeric or alphanumeric values next to those labels

Match labels → values dynamically (layout agnostic)

Return structured JSON output including:

Detected values

Confidence scores

Any errors or quality issues flagged

? 2. Error & Validation Handling
Flag and tag issues such as:

Image is too blurry

Value not detected

Label missing

Low OCR confidence

Screen not fully visible

Provide this metadata in a clean JSON format

Example:

json
Copy
Edit
{
"label_ab": {
"value": "97",
"confidence": 0.91
},
"label_cd": {
"value": null,
"error": "Label not detected"
},
"image_quality": {
"sharpness_score": 28.5,
"glare_detected": true,
"issues": ["Blurry", "Cutoff at top"]
}
}
? 3. API Development (for mobile app integration)
REST API endpoints to:

Receive an image from the device

Return extracted values + validation flags

(Optional) retrieve previous results by ID or timestamp

API must include image error feedback that our mobile app team can consume and display to users during or after upload.

? 4. Dockerization & Deployment
Package the full pipeline into a Docker container

Deploy to Azure Container Instance or similar

Provide documentation for:

Building and running the container

Updating the model or logic

Environment variables and settings

? 5. Feedback Loop + Retraining Folder Setup
Save flagged/failed cases in a retraining-friendly folder structure

bash
Copy
Edit
/flagged_cases/
/retrain_data/
(Optional) provide a script or structure for retraining the model with new labeled data later

Support manual model upgrading (drop-in new model + rebuild Docker)

?️ Tech Stack Required:
Python (preferred)

OpenCV, PyTorch or TensorFlow

YOLOv5/YOLOv8 or similar detection model

OCR: EasyOCR, Tesseract, or PaddleOCR

Azure Blob Storage, Azure Functions or HTTP Trigger

Azure Container Instance (ACI) or App Service

REST API: FastAPI or Flask

Docker

✅ What We Provide:
Sample labeled screen images (2 dashboard layouts)

Base model checkpoint (if needed)

Azure environment + access

Input/output format specs

Our team handles image capture and uploads

? Deliverables:
Dockerized AI pipeline

Deployed on Azure (ACI preferred)

REST API with:

JSON outputs for detected labels + values

Error tagging + image quality flags

Folder structure + simple script for future retraining

Clear deployment + update documentation

✨ Nice to Have (Bonus):
Experience with model versioning

CI/CD familiarity (Docker push → deploy flow)

Ability to collaborate on feedback-based model improvement

Skills: Machine Learning (ML), Docker, OpenCV, Computer Vision, Microsoft Azure
Fixed budget: 12,500 - 37,500 INR
1 day ago
  • Websites, IT & Software, Engineering & Science, Docker, OpenCV, Computer Vision, Machine Learning (ML), Microsoft Azure
Generative AI Developer for Healthcare Startup
15 - 30 USD / hr
1 day ago
Client Rank - Risky

Payment method verified
$1 266 total spent
1 hires
0.00 of 1 reviews
AU Australia
Risky
We are an Australian-based healthcare startup seeking a talented developer with experience in Generative AI, Supabase, and Flutterflow. The ideal candidate will help us enhance our platform by integrating AI-driven solutions and optimizing database queries. If you have a passion for innovative health tech and the skills to bring ideas to life, we want to hear from you. Collaboration and communication are key, as you'll be working closely with our team to create impactful solutions that improve patient care.
Skills: RESTful API, Artificial Intelligence, Supabase, Google Cloud Platform, FlutterFlow, NodeJS Framework, Computer Vision
Hourly rate: 15 - 30 USD
1 day ago
  • Web, Mobile & Software Dev, AI Apps & Integration
MapleStory Resource Farmer Needed
2 - 3 USD / hr
1 day ago
Client Rank - Risky

Payment method not verified
1 open job
no reviews
Registered at: 24/03/2025
US United States
Risky
I'm in need of an experienced MapleStory player to help me with farming resources in the game. The primary focus will be on gathering Gold/mesos and materials for crafting.

Ideal Skills and Experience:
- understanding of MapleStory and its mechanics
- Proven track record in resource farming
- Ability to play solo and efficiently
- Available for 40 hours a week
- Commitment to working for $1.50-2.00 an hour

Your tasks will not include joining specific parties or guilds, but rather playing solo and maximizing resource acquisition. This project offers a steady weekly playtime, with the potential for future work based on performance.

Skills: Time Management, Computer Vision
Hourly rate: 2 - 3 USD
1 day ago
  • Websites, IT & Software, Data Entry & Admin, Computer Vision, Time Management
Generative AI Expert on OPT - Looking for Urgent Contract
25 - 80 USD / hr
1 day ago
Client Rank - Medium

Payment method verified
no reviews
US United States
Medium
Only freelancers located in the U.S. may apply.
URGENTLY NEEDED! 🚨

I am on F-1 OPT and need to extend my employment through STEM OPT. My current employer is not E-Verified, and I need an E-Verified employer who can provide a contract within the next 10 days. Pay doesn't matter much. If you or anyone you know can help, please contact me immediately.

About Me: I am a machine learning scientist with a Ph.D. from UC Irvine, specializing in artificial intelligence. My expertise spans LLMs, VLMs, and multi-modal models, with hands-on experience in building scalable pipelines and infrastructure for research and production across various data modalities. At AlgoFace, I developed a facial attribute analysis and search system using CLIP, DALL·E, and GPT-4, with fine-tuning and multi-modal pipelines that outperformed all prior baselines. At Pandita AI, I designed all layers of the AI Stack (AI-focused IaaS and PaaS) and delivered the core technical strategy that led to a multi-million-dollar investment. At Rootick, I built a custom LLM pipeline for large-scale product classification and created the full dataset via web scraping and SQL-backed preprocessing.
I’m fluent in Python, PyTorch, Hugging Face, TensorFlow, and the OpenAI API, and experienced with RAG pipelines, prompt engineering, and model fine-tuning (LoRA, QLoRA). I’ve deployed across AWS, GCP, Azure, and Lambda Labs.



Thank you!
Skills: Diffusion Model, Large Language Model, Multimodal Large Language Model, AI Chatbot, AI Content Creation, ChatGPT, AI Text-to-Image, AI Content Writing, Facial Recognition, Image Processing, Image Recognition, Image-to-Image Translation, Natural Language Generation, Natural Language Understanding, Sequence Modeling, Synthetic Data Generation, Text Summarization, Generative Adversarial Network, Vision Transformer, Variational Autoencoder, Transformer Model, Autoencoder, CLIP Guidance, ChatGPT API Integration, AI Model Training, LLM Prompt Engineering, Generative AI Prompt Engineering, Generative AI, Retrieval Augmented Generation, BERT, Claude 3, Claude 2, DALL-E, DALL-E 2, DALL-E 3, Gemini, GPT-3.5, GPT-4, GPT-3, Artificial Intelligence, Natural Language Processing, Machine Learning, Python, Computer Vision, Deep Learning, Artificial Neural Network, Data Science, TensorFlow, JavaScript
Hourly rate: 25 - 80 USD
1 day ago
  • Data Science & Analytics, AI & Machine Learning
AI App Development with GPT, Voice Synthesis & Video Avatars
not specified 23 hours ago
Client Rank - Risky

Payment method not verified
no reviews
CA Canada
Risky
We are seeking an experienced developer to create an innovative AI application leveraging OpenAI's GPT for natural language processing, ElevenLabs for voice synthesis, and DeepBrain AI for video avatar integration. The ideal candidate should have a strong background in AI and deep learning technologies, along with a passion for building cutting-edge applications. If you have experience working with these tools and can deliver a seamless user experience, we want to hear from you!
Skills: Artificial Intelligence, Machine Learning, TensorFlow, Deep Learning, Computer Vision
Budget: not specified
23 hours ago
  • Data Science & Analytics, AI & Machine Learning
Siamese Network Python Application Development using Detectron2 and Pytorch
1,000 USD 23 hours ago
Client Rank - Good

Payment method verified
$6 050 total spent
2 hires
1 jobs posted
100% hire rate, open job
5.00 of 1 reviews
US United States
Good
**Contract Opportunity: Siamese Network Developer (short Project)**

We seek an experienced deep learning engineer for a computer vision project implementing a production-ready siamese network application. This contract role requires 30-40 hrs/week commitment with remote flexibility.

**Tech Stack Requirements:**
- Python (PyTorch, Detectron2)
- Tkinter GUI development
- Dataset curation/processing
- Model optimization & deployment

**Key Responsibilities:**
1. Develop image pair processing pipeline
2. Implement siamese architecture with custom similarity metrics
3. Create intuitive GUI for model training/inference
4. Achieve 85% (or higher) matching accuracy on test set
5. Document technical implementation

**Ideal Candidates Have:**
- 2+ years DL experience with published models
- Portfolio showing similarity learning projects
- Experience with metric learning techniques
- Familiarity with FaceScrub/LFW datasets
Skills: Python, TensorFlow, Machine Learning, Neural Network, Keras
Fixed budget: 1,000 USD
23 hours ago
  • Data Science & Analytics, AI & Machine Learning
Looking for AI Image Editing Expert for learning use of tools to alter anatomy/kinesiology images
300 USD 22 hours ago
Client Rank - Medium

Payment method verified
no reviews
US United States
Medium
I need to alter and edit anatomy/kinesiology/osteopathy images to suit my teaching and marketing needs. I want to be abble to use AI based tools for this. Looking for someone who can successfully do this using tools that they can then show me.
Skills: Adobe Photoshop, Adobe Illustrator, Machine Learning, Artificial Intelligence, Graphic Design, Computer Vision, Image Processing, Photo Editing, Image Editing
Fixed budget: 300 USD
22 hours ago
  • Design & Creative, Graphic, Editorial & Presentation Design
Senior AI/ML Engineer
20 - 30 USD / hr
21 hours ago
Client Rank - Medium

Payment method verified
no reviews
US United States
Medium
Only freelancers located in the U.S. may apply.
Job Description:
We’re looking for a Senior AI/ML Engineer to design, build, and optimize AI-driven solutions. You’ll work on machine learning models, fine-tune LLMs, and deploy scalable AI applications. If you have strong experience in Python, TensorFlow, PyTorch, OpenAI API, and cloud platforms (AWS, GCP, or Azure), we’d love to hear from you!

Responsibilities:
- Develop and optimize AI/ML models for real-world applications.
- Fine-tune and deploy large language models (LLMs) and deep learning solutions.
- Build scalable machine learning pipelines and integrate AI into applications.
- Work with NLP, computer vision, and recommendation systems.
- Optimize models for performance, accuracy, and cost efficiency.
- Deploy AI solutions using Docker, Kubernetes, and cloud services.
- Collaborate with developers, data scientists, and product teams.

Requirements:
- 5+ years of experience in AI/ML engineering.
- Strong expertise in Python, TensorFlow, PyTorch, or Scikit-learn.
- Experience with LLMs (GPT, BERT, etc.), NLP, and computer vision.
- Hands-on experience with AWS, GCP, or Azure for AI model deployment.
- Solid understanding of MLOps, CI/CD pipelines, and model optimization.
- Experience with big data tools like Spark, Dask, or Ray is a plus.
- Strong problem-solving skills and ability to work independently.
Skills: Machine Learning, TensorFlow, Artificial Intelligence
Hourly rate: 20 - 30 USD
21 hours ago
  • Data Science & Analytics, AI & Machine Learning
AI model developer
not specified 17 hours ago
Client Rank - Excellent

Payment method verified
$23 470 total spent
135 hires
83 jobs posted
100% hire rate, open job
4.31 of 58 reviews
US United States
Excellent
We are seeking a highly motivated AI Engineer – LLM Development to lead the design, training, and deployment of cutting-edge Large Language Models (LLMs). This role offers the opportunity to work at the forefront of Generative AI, developing language models that power intelligent applications, conversational agents, creative tools, and AI co-pilots.

More information will be given in next step.

Preferring whole team havingresearchers, data scientists, MLOps engineers
Skills: Artificial Intelligence, Machine Learning, Python, TensorFlow, Artificial Neural Network, Neural Network, Deep Learning, Natural Language Processing, Computer Vision
Budget: not specified
17 hours ago
  • Data Science & Analytics, AI & Machine Learning
Computer vision enginer
18 - 45 USD / hr
17 hours ago
Client Rank - Excellent

Payment method verified
$11 124 total spent
22 hires
1 jobs posted
100% hire rate, open job
4.93 of 14 reviews
US United States
Excellent
I'm looking for a skilled AI developer or machine learning engineer to help build a tool that can analyze a photo of an item and identify what it is.

The goal is to create an AI system that can take an image (uploaded by a user or captured via camera) and return a description, category, or even a specific product name. This could be something like:

User uploads a picture of a sneaker → Tool returns: "Nike Air Max 90" or "Running Shoe"

Picture of a mug → Tool returns: "Ceramic Coffee Mug" or "Drinkware"

🛠️ Skills & Experience Required:
Strong experience in Computer Vision and Image Recognition
Familiar with object detection and image classification
Proficiency in TensorFlow, PyTorch, or similar ML frameworks

Experience with pre-trained models (e.g. YOLO, ResNet, EfficientNet) and/or building custom models
Knowledge of OpenCV and image preprocessing techniques

Ability to work with labeled datasets or guide data collection/labeling

Bonus: Experience deploying models to the cloud or on mobile apps

🔍 What I Need Help With:
Consulting on the best approach for this tool

Model development (using existing models or training a custom one)

Model evaluation and refinement

(Optional) Basic prototype or UI to test the model

💬 To Apply:
Please share:

Relevant past projects (especially anything involving image classification or object detection)

Your preferred tech stack

Any thoughts on how you'd approach a project like this

Your availability and rough estimate of how long something like this might take

Looking forward to hearing from you!
Skills: AI Model Training, AI Model Development, Computer Vision, Machine Learning
Hourly rate: 18 - 45 USD
17 hours ago
  • Data Science & Analytics, AI & Machine Learning
Back-End Developer Needed for Object Recognition in Meta Ray-Ban Glasses Livestream
10 - 25 USD / hr
15 hours ago
Client Rank - Good

Payment method verified
$1 767 total spent
6 hires
4 jobs posted
100% hire rate, open job
3.66 of 4 reviews
AE United Arab Emirates
Good
Description:
We are looking for an experienced AI developer to help us integrate a real-time object recognition system using Meta Ray-Ban Smart Glasses. The goal is to livestream to Instagram while analyzing the feed to recognize and identify objects in real-time, displaying the results on a connected mobile phone.

Key Requirements:
- Experience with computer vision and real-time object recognition
- Proficiency in Python, OpenCV, TensorFlow, or similar AI/ML frameworks
- Familiarity with Meta Ray-Ban Glasses SDK and Instagram Live API
- Ability to process live video streams and overlay recognized objects
- Experience with API integration for real-time data transfer to mobile

Project Scope:
- Enable Instagram Live streaming from the Meta Ray-Ban Glasses
- Process the live video feed to detect and recognize objects
- Stream real-time object recognition results to a mobile app or overlay on the video
Skills: API, Computer Vision, Real Time Stream Processing
Hourly rate: 10 - 25 USD
15 hours ago
  • Web, Mobile & Software Dev, Web Development
Sensor fusion machine vision
150 USD 15 hours ago
Client Rank - Medium

Payment method verified
$400 total spent
1 hires
1 jobs posted
100% hire rate, open job
5.00 of 1 reviews
ZA South Africa
Medium
This project aims to implement a robust object detection system utilizing the MobileNet SSD model
combined with automotive radar for sensor fusion. This system will detect, track, and count objects (mining
machines and pedestrians) while ensuring no objects are lost during detection and tracking. The system will
also predict the movement paths of detected objects to enhance situational awareness in dynamic
environments.
Skills: Deep Learning, Edge AI, Computer Vision, Python, Machine Learning, CAN Bus
Fixed budget: 150 USD
15 hours ago
  • Data Science & Analytics, AI & Machine Learning
AI Model Trainer and Augmented Reality Software Developer
18 - 35 USD / hr
14 hours ago
Client Rank - Risky

Payment method not verified
no reviews
SG Singapore
Risky
About Us:

We are an innovative team passionate about creating cutting-edge augmented reality experiences. We're developing a groundbreaking AR application that will revolutionize how users interact with their environment. Our app will utilize smartphone camera input to detect and recognize real-world objects, providing dynamic visual overlays and interactive information. We are seeking a talented and driven AR App Developer to bring this vision to life.

Our project involves developing an AR application that:

Uses the smartphone's camera feed to detect and recognize objects in real-time.

Employs computer vision and machine learning techniques for accurate object identification. (Knowing how to train or pull from YOLO websockets will be a plus)

Generates and displays dynamic visual overlays based on the detected objects.

Potentially integrates interactive elements and user input.

Optimizes performance for smooth and responsive AR experiences on mobile devices, with specific emphasis on Android.

Required Skills and Qualifications:

Proven experience in developing AR applications for mobile platforms (Android).

Strong proficiency in Unity (or other relevant game engine) and C# (or similar).

Experience with AR SDKs such as ARKit, ARCore, or Vuforia.

Understanding of computer vision and machine learning concepts.

Experience with object detection and recognition libraries (e.g., TensorFlow Lite, OpenCV).

Experience with the Android SDK, including configuration, building, and deployment.

Strong understanding of mobile app development principles and best practices.

Excellent problem-solving and debugging skills.

Ability to work independently and collaboratively with non-technical stakeholders to understand their requirements from a more layman perspective.

Experience with optimizing mobile applications, writing code with the ability to be cross-platform down the line and code reusability by coding onto standardised languages.

Preferred Skills:

Knowledge of cloud-based services for data storage and processing.

Experience with cross-platform development.

Experience with machine learning model creation.

Experience in creating AR app that was focused for Education and Training purposes

Experience in creating customised 3D art to be used in AR

Communication of technical terms and code to provide an efficient and lean coding solution, providing layman explanations on how to reconfigure app and putting in brand new plugins down the line on what was built.


Disclaimer: "All code, assets, and deliverables created by the developer as part of this project will be the sole and exclusive property of us. The developer agrees not to use, distribute, or reproduce any of the code or assets created for this project for any other purpose, including personal or commercial projects, without the express written consent from us. This includes any algorithms, machine learning models, UI elements, or other software components developed during the term of this contract."
Skills: Artificial Intelligence, Machine Learning, Augmented Reality, TensorFlow, Virtual Reality
Hourly rate: 18 - 35 USD
14 hours ago
  • Web, Mobile & Software Dev, Other - Software Development
Computer Vision Project
50 USD / hr
9 hours ago
Client Rank - Risky

Payment method not verified
1 open job
no reviews
Registered at: 04/03/2025
IL Israel
Risky
I'm looking for an expert in computer vision for a project focused on image recognition of vehicles from traffic camera footage. This project will involve developing and implementing algorithms to accurately identify and classify vehicles in real-time.

Ideal skills and experience for the job include:
- Proficiency in computer vision and image processing techniques
- Experience with traffic camera footage analysis
- Strong background in machine learning and algorithm development
- Ability to work on real-time image recognition tasks

Please provide examples of similar projects you've completed in your proposal.

Skills: Python, Matlab and Mathematica, Algorithm, C++ Programming, Mathematics
Hourly rate: 50 USD
9 hours ago
  • Websites, IT & Software, Engineering & Science, Python, C++ Programming, Matlab and Mathematica, Algorithm, Mathematics
Completion and optimization of our system
not specified 8 hours ago
Client Rank - Excellent

Payment method verified
$13 456 total spent
11 hires
1 jobs posted
100% hire rate, open job
5.00 of 8 reviews
LV Latvia
Excellent
These are acceptance criteria for backend code

1) Run 30+ cameras on one t4 instance
2) Make real-time streaming stable, without flickering and delays
3) Check mongoDB usage and optimize it
4) Fix bottlenecks
5) These changes cannot affect current system functions and should work as it was but with improvements (so nothing breaks in functionalities when implementing updates)
6) Launchable tool after improvements
Skills: Artificial Intelligence, Machine Learning, Python, Computer Vision, Natural Language Processing, Large Language Model, Generative AI, Digital Signal Processing, Deep Learning, Time Series Analysis, Chatbot Development, OpenCV, PyTorch, TensorFlow, Data Preprocessing
Budget: not specified
8 hours ago
  • Data Science & Analytics, AI & Machine Learning
Aircraft dataset annotation
not specified 7 hours ago
Client Rank - Medium

Payment method verified
no reviews
UA Ukraine
Medium
dataset is available here: https://github.com/hapless19/UAV2UAV-dataset.
direct download link is: https://drive.google.com/drive/folders/1ZPmR2jIAetkypgrc6VurUXxqE7_LUxFK (2.6 Gb archive).

I need each aircraft in each sequence to be annotated with a bounding box, annotation format should be Yolo, each sequence should be kept in a separate folder like it is now.
Skills: Data Entry, CVAT, DICOM, Computer Vision, Adobe Photoshop, Data Annotation, Machine Learning, Fact-Checking, Data Segmentation, Image Editing, Image Processing, Data Labeling, Video Annotation, Image Annotation, GIMP
Budget: not specified
7 hours ago
  • Admin Support, Virtual Assistance
Implement API for skeleton model
2,500 USD 7 hours ago
Client Rank - Good

Payment method verified
$4 929 total spent
7 hires
4 jobs posted
100% hire rate, open job
no reviews
HK Hong Kong
Good
Tasks
1. Integrate VitPose Model: We will replace the existing MediaPipe model with the VitPose model in our codebase. This transition aims to leverage VitPose's advanced capabilities for pose estimation, leading to improved accuracy and performance in analyzing golf swings.

2.Implement Golf Club Tracking API: The current implementation using YOLO v5 will be substituted with a more advanced golf club tracking API, which utilizes Grounding DINO. This upgrade is expected to enhance the precision of golf club detection, allowing for more detailed analytics and insights into the player's performance.

3. Transform Functionality into API: We will revamp the entire analytics functionality along with the existing code, converting it into a well-structured API. This transformation will facilitate seamless integration for future enhancements, making it easier to access and utilize the analytics features across various platforms.

Current State of the Code
Currently, our codebase includes an existing model for golf club tracking and pose estimation, which generates output based on video input. We also have an API that features an updated golf club tracking API; however, this has yet to be integrated into the system. A video demonstration showcasing the current output is available, highlighting the effectiveness of our existing model and the areas where improvements can be made.
Skills: Python, API, Java, Computer Vision
Fixed budget: 2,500 USD
7 hours ago
  • Web, Mobile & Software Dev, Web Development
Python Code Review for Object Detection
30 - 250 USD 7 hours ago
Client Rank - Risky

Payment method not verified
1 open job
no reviews
Registered at: 25/03/2025
SA Saudi Arabia
Risky
I'm looking for an experienced Python developer to proofread my code running on a Linux server.

Key tasks include:
- Identifying and correcting syntax errors
- Debugging any logical errors
- Optimizing for performance issues

The Python code is designed for object detection on a camera, so experience with image processing or computer vision libraries would be highly beneficial. Please ensure that you're comfortable working on a Linux server.

Skills: Python, Linux, Artificial Intelligence
Fixed budget: 30 - 250 USD
7 hours ago
  • Websites, IT & Software, Python, Linux, Artificial Intelligence
Gohighlevel work
not specified 5 hours ago
Client Rank - Medium

Payment method verified
no reviews
US United States
Medium
need help on
Automation
Workflow
Phone setup
Website setup
Skills: Computer Vision, Machine Learning Model, Machine Learning, Natural Language Processing, Python, Docker, Product Development, Engineering & Architecture, Software Development, Artificial Intelligence, Data Visualization, GitHub, GitLab, Jupyter Notebook, Raspberry Pi
Budget: not specified
5 hours ago
  • Web, Mobile & Software Dev, Scripts & Utilities
AI GPU Optimization Engineer
not specified 4 hours ago
Client Rank - Good

Payment method verified
$9 318 total spent
16 hires
11 jobs posted
100% hire rate, open job
4.99 of 5 reviews
US United States
Good
We are seeking a highly skilled developer to create an advanced system for optimizing AI workloads across various GPU platforms.
This role focuses on designing efficient load-balancing techniques using C++ and Python to maximize computational performance.
The ideal candidate should have a deep understanding of GPU architectures and experience in AI model acceleration.
Join us to develop cutting-edge solutions that push the boundaries of AI efficiency.
Skills: C++, TensorFlow, Machine Learning, Artificial Intelligence, Python, Computer Vision, GPU
Budget: not specified
4 hours ago
  • Data Science & Analytics, AI & Machine Learning
**Investment & Market Analysis Consultant**
8 - 15 USD / hr
3 hours ago
Client Rank - Risky

Payment method not verified
1 open job
no reviews
Registered at: 25/03/2025
AE United Arab Emirates
Risky
I am looking for a seasoned professional well-versed in Investment and Market Analysis. My primary focus is on an AI project in the VIP sector, specifically utilizing computer vision for object detection.

Skills and experience needed:
- Expertise in investment and market analysis
- Extensive knowledge in AI, particularly in machine learning and computer vision
- Proven track record in developing strategies for the VIP sector
- Excellent understanding of object detection within AI projects

I need someone who can provide tailored strategies to help navigate this complex project, and ultimately lead it towards success. Please reach out if you possess these qualifications.

Skills: Real Estate, AI (Artificial Intelligence) HW/SW, Business Strategy, Business Consulting, FinTech
Hourly rate: 8 - 15 USD
3 hours ago
  • Websites, IT & Software, Engineering & Science, Business, Accounting, Human Resources & Legal, AI (Artificial Intelligence) HW/SW, Real Estate, Business Strategy, Business Consulting, FinTech
Block ai image processing
not specified 3 hours ago
Client Rank - Risky

Payment method not verified
no reviews
US United States
Risky
Hi
I am looking for someone that can help block image recognition of a digital picture that I upload to the internet on a specific platform I have many pictures that need that i would a program done that a laymen should be able to operate it.
I need the pictures still be visible to the eye but not be able to have image processing to extract any info of it or have the picture match up to other pictures etc
Skills: Computer Vision, Machine Learning, Feature Extraction, Deep Learning, Image Processing, Digital Signal Processing, Artificial Intelligence, Augmented Reality, Pattern Recognition, Mixed Reality, AI Development, AI App Development, AI Consulting, Generative AI, AI Text-to-Image
Budget: not specified
3 hours ago
  • Data Science & Analytics, AI & Machine Learning
AI/ML Engineer – Model Training, Optimization & Deployment
not specified 3 hours ago
Client Rank - Good

Payment method verified
$5 196 total spent
14 hires
5 jobs posted
100% hire rate, open job
5.00 of 1 reviews
CA Canada
Good
Role Overview:
We’re looking for an AI/ML Engineer who can take over the existing AI model, train it further, improve performance, and prepare it for scalable deployment. You’ll be working closely with our team to shape the backbone of SoleBot AI—our intelligent shoe assistant.

Responsibilities:

Review and understand our current AI/ML model and training data.

Enhance and fine-tune the model for better performance, accuracy, and speed.

Expand the model’s capabilities based on our roadmap (e.g., shoe recognition, smart recommendations, visual ID, conversational assistance).

Optimize model architecture for low-latency usage within our app environment.

Handle deployment (cloud-based or edge) and integration with our existing system (mobile + backend).

Collaborate with our software and product team for seamless integration and testing.

Assist in establishing a feedback loop to continuously improve model performance over time.

Required Skills & Experience:

Proven experience developing, training, and deploying AI/ML models.

Proficiency in Python, TensorFlow, PyTorch, or similar frameworks.

Experience with computer vision models (image classification, object detection, etc.).

Strong background in NLP is a bonus, especially for conversational AI integration.

Familiarity with model optimization techniques for mobile and real-time applications.

Experience with model deployment via cloud platforms (AWS, GCP, Azure) or on-device.

Ability to write clean, well-documented code and collaborate with cross-functional teams.

Nice to Have:

Experience in sneaker or fashion tech (or interest in the culture).

Familiarity with AI-driven chat interfaces.

AR/AI integration experience is a big plus
Skills: AI Bot, AI Development, AI App Development, AI Implementation, AI Model Development, Machine Learning, Artificial Intelligence, Deep Learning, Data Science, Natural Language Processing
Budget: not specified
3 hours ago
  • Data Science & Analytics, AI & Machine Learning
Body Scanning & AI Integration for E-commerce
~146 - 438 USD 2 hours ago
Client Rank - Risky

Payment method not verified
1 open job
no reviews
Registered at: 25/03/2025
IN India
Risky
I'm looking for an expert in body scanning and computer vision AI technology to enhance my existing e-commerce site. The main goal is to provide accurate sizing and fitting recommendations for clothing.

Key Requirements:
- Develop a body scanning technology that can be accessed through a web-based tool.
- Integrate this technology seamlessly into my existing e-commerce platform.
- Ensure the system is capable of providing sizing and fitting recommendations based on the body scans.

Ideal Skills:
- Proven experience with body scanning technology and computer vision AI.
- E-commerce platform development and integration expertise.
- Strong understanding of clothing sizing and fitting processes.

This project is aimed at improving customer satisfaction and reducing return rates due to sizing issues.

Skills: PHP, Python, Shopping Cart Integration, eCommerce, HTML
Fixed budget: 12,500 - 37,500 INR
2 hours ago
  • Websites, IT & Software, Design, Media & Architecture, Python, Shopping Cart Integration, eCommerce, HTML
AI-Powered Video Analytics Software Developer
not specified 1 hour ago
Client Rank - Excellent

Payment method verified
$89 167 total spent
393 hires
192 jobs posted
100% hire rate, open job
4.87 of 210 reviews
IN India
Excellent
We are looking for an experienced developer (or team) to build an AI-powered video analytics software similar to [Veesion](https://veesion.com/).

The software should use computer vision and deep learning to detect anomalies and suspicious activities in real-time video streams.

**Key Responsibilities:**
- Develop and train AI models for real-time object detection and anomaly recognition.
- Integrate machine learning models with a video processing pipeline.
- Optimize performance for real-time analysis with minimal latency.
- Implement a web-based dashboard for monitoring and alerts.
- Ensure scalability and compatibility with multiple camera feeds.
- Maintain security and data privacy best practices.

**Required Skills & Experience:**
- Strong experience in AI/ML, particularly in computer vision (YOLO, OpenCV, TensorFlow, PyTorch).
- Proficiency in programming languages like Python and C++.
- Experience in handling real-time video processing.
- Knowledge of cloud computing (AWS, GCP, or Azure) for AI model deployment.
- Frontend/backend development skills for web-based monitoring tools (React, Node.js, Django, or Flask).
- Prior experience working on video analytics or surveillance systems is a plus.

**Additional Information:**
- The project may evolve into a long-term collaboration.
- Developers with past experience in anomaly detection or similar projects will be preferred.
- Please share links to your previous work or relevant case studies.

Looking forward to working with talented developers who can bring this vision to life!
Skills: Artificial Intelligence, Machine Learning, TensorFlow
Budget: not specified
1 hour ago
  • Web, Mobile & Software Dev, Web Development
3D Reconstruction of Indoor Environments from RGB-D Images
not specified 1 hour ago
Client Rank - Risky

Payment method not verified
no reviews
UZ Uzbekistan
Risky
I am in my final year of a BA degree in Software Engineering at New Uzbekistan University, and I am currently working on my graduation research paper. My topic is "3D Reconstruction of Indoor Environments from RGB-D Images."

My supervisor has advised me to take open-source research papers on this topic and experiment with different algorithms and methods to improve the reconstruction results. I am looking for someone who can assist me with this—someone who can explain the concepts, guide me through the implementation, and help me understand how everything works.
Skills: 3D Modeling, Computer Vision
Budget: not specified
1 hour ago
  • Engineering & Architecture, 3D Modeling & CAD
Help with back testing a trading strategy!
not specified 1 hour ago
Client Rank - Risky

Payment method not verified
no reviews
GB United Kingdom
Risky
Looking for a talented Quant/coder who is able to back test a variety of strategies based on us equities.
I have a manually back tested hours worth of data but looking for someone with the knowledge of modern API integration and coding that can fast track the back testing and give me accurate results and fine tune the strategy.

Must have knowledge and experience in the financial sector and familiar with stock trading strategies.
Skills: Artificial Neural Network, Convolutional Neural Network, Python, Deep Learning, Computer Vision, Data Science, Machine Learning, Quantitative Finance, Data Visualization, Financial Analysis, Automation, Stock Option Agreement, Microsoft Excel, Quantitative Analysis, Artificial Intelligence
Budget: not specified
1 hour ago
  • Web, Mobile & Software Dev, Scripts & Utilities
Vapi
not specified 21 minutes ago
Client Rank - Risky

Payment method not verified
no reviews
US United States
Risky
I’m looking for assistance with setting up my Vapi Cold Calling Service
Skills: Computer Vision, Machine Learning Model, Machine Learning, Natural Language Processing, Python, Docker, Product Development, Engineering & Architecture, Software Development, Artificial Intelligence, Data Visualization, GitHub, GitLab, Jupyter Notebook, Raspberry Pi
Budget: not specified
21 minutes ago
  • Web, Mobile & Software Dev, Scripts & Utilities
"Sensei" The King of Games
not specified 20 minutes ago
Client Rank - Medium

Payment method verified
no reviews
US United States
Medium
Proposal for Sensei Trading Card AI Development
Project Overview:
We are developing Sensei, an AI system designed to be the best trading card investor. It will collect and analyze trading card data (e.g., prices, sales trends) from sources like TCGPlayer and PriceCharting. Over time, Sensei will learn to predict which cards are good investments and automate buying/selling decisions.

Goals:
Collect Card Data:

Pull current price and sales data for cards from platforms like TCGPlayer and PriceCharting.

Collect historical data for the past year or more, such as price trends, volume of sales, and transaction history.

Analyze Data:

Use machine learning to predict future card prices and identify good investment opportunities.

Alert System:

Build an alert system that notifies when it’s a good time to buy or sell a card.

Scope of Work:
Data Collection:

Write scripts to gather data from PriceCharting and TCGPlayer APIs, focusing on both current prices and historical data for each product.

Historical data includes price trends over time, transaction volumes, and price variations.

Data Analysis:

Process the collected data and develop models to predict future card prices.

Alerts & Notifications:

Set up a basic alert system to notify users when there’s a good buy or sell opportunity.

Deliverables:
Working Data Scraper:

A script that pulls historical data and current pricing data from PriceCharting and TCGPlayer.

Predictive Model:

A basic model that predicts future card prices based on the historical data collected.

Alert System:

A simple alert system to notify users of opportunities.
Skills: Full-Stack Development, AI Agent Development, AI Chatbot, Machine Learning, AI Model Training, Automation, SaaS Development, AI Development, Computer Vision, AI Platform, Deep Learning, Data Analysis, OpenAI API, API Integration, AI App Development
Budget: not specified
20 minutes ago
  • Data Science & Analytics, AI & Machine Learning
Call to action
Freelancing is a business
Make it more profitable with Vollna

Streamline your Upwork workflow and boost your earnings with our smart job search and filtering tools. Find better clients and land more contracts.