Discover Computer Vision Projects Efficiently

Leverage Vollna to streamline your search for "Computer Vision" projects on Upwork. Use advanced filters, get instant updates, and monitor your performance to boost success.
Signup for free to get access to all filter attributes and instant notifications when new jobs are posted.
Setup filter



Get access to over 30+ filter attributes, setup instant notifications, integrate with your CRM and marketing tools, and more.
Start free trial
322 projects published for past 72 hours.
Job Title Budget
AI Image Combination Expert Needed
500 USD 1 day ago
Client Rank - Medium

Payment method verified
Phone number verified
1 jobs posted
1 open job
no reviews
Registered: Apr 23, 2025
United States
United States
7:25 AM
3
We are seeking a talented freelancer to leverage AI technology to merge three distinct images into a cohesive single picture. The project requires creativity and a strong understanding of AI tools that can manipulate and combine visual elements effectively. The ideal candidate will have experience in image processing and a keen eye for detail. If you have the skills to integrate features from multiple images seamlessly, we would love to see your portfolio and discuss this exciting opportunity further.
Fixed budget: 500 USD
1 day ago
  • Data Science & Analytics, AI & Machine Learning
Medical Image Classification Expert
30 - 250 USD 1 day ago
Client Rank - Medium

Payment method verified
1 open job
no reviews
Registered: Nov 18, 2022
Pakistan
Pakistan
3
I need an expert in computer vision, particularly in medical image classification. The project will involve classifying radiology images, including X-ray, CT, and MRI scans.

Ideal skills for this project will include:
- Strong background in computer vision and image processing
- Experience working with medical images, especially Radiology Images (X-ray, CT, MRI)
- Proficiency in current image classification techniques

The goal is to accurately classify different types of radiology images. Therefore, a track record of projects with high precision in classifications tasks is desirable. Please be prepared to provide examples of your work in medical image classification.

Skills: Python, Machine Learning (ML), Artificial Intelligence, Computer Vision
Fixed budget: 30 - 250 USD
1 day ago
  • Websites, IT & Software, Engineering & Science, Python, Artificial Intelligence, Computer Vision, Machine Learning (ML)
Mobile App Developer (LiDAR + AI) for Property Scanning & Report Generation
1,000 USD 1 day ago
Client Rank - Good

Payment method verified
$3 404 total spent
7 hires, 2 active
47 jobs posted
15% hire rate, 2 open job
4.25 /hr avg hourly rate paid
719 hours paid
5.00 of 3 reviews
Company size: 10
Registered: Dec 11, 2017
United States
United States
Rowlett 8:25 AM
4
I'm looking to hire a skilled mobile app developer (or development team) to help build an advanced property inspection and reporting app. The core of the app involves using smartphone LiDAR (iOS/Android) to capture interior measurements and generate basic floor plans. Users will also take photos throughout the home, which the app will process to identify certain features and conditions. This data will then be used to generate a clean, professional report. Experience with ARKit, ARCore, or similar spatial scanning technologies is strongly preferred.

The backend will require AI/ML integration to analyze the captured data and images, provide structured recommendations, and estimate two types of values based on condition. This will be a multi-stage project, so I’m looking for someone who can help scope and prioritize the MVP — with potential to expand into e-commerce integration, valuation tools, and smart reporting features. If you have experience working on AR, AI/computer vision, and real estate or inspection apps, I’d love to discuss the project in more detail.
Fixed budget: 1,000 USD
1 day ago
  • Data Science & Analytics, AI & Machine Learning
AI Discussion Platform Development
750 - 1,500 USD 1 day ago
Client Rank - Medium

Payment method verified
1 open job
no reviews
Registered: May 14, 2024
Philippines
Philippines
3
I'm looking to build an interactive platform where students and scientists can engage in discussions about AI topics such as machine learning, NLP, and computer vision.

Key Features:
- Discussion forums for topic-based conversations
- Live chat for real-time interactions
- Content sharing for articles, papers, and resources
- User authentication and account management via email/password login
- Accessible on both web and mobile platforms

Ideal Skills & Experience:
- Experience in building discussion forums and chat functionalities
- Strong background in user authentication systems
- Proficient in developing responsive platforms for both web and mobile
- Familiarity with AI topics is a plus

Immediate start required. Please provide your portfolio and relevant experience in your bid.

Skills: Website Design, Machine Learning (ML), Web Development, Chatbot, Database Management
Fixed budget: 750 - 1,500 USD
1 day ago
  • Websites, IT & Software, Design, Media & Architecture, Engineering & Science, Business, Accounting, Human Resources & Legal, Web Development, Website Design, Machine Learning (ML), Chatbot, Database Management
Lidar and Floor Plan Specialist Needed for Vision Framework Project
2,000 USD 1 day ago
Client Rank - Excellent

Payment method verified
$383 373 total spent
98 hires, 31 active
159 jobs posted
62% hire rate, 2 open job
31.37 /hr avg hourly rate paid
9 659 hours paid
4.75 of 102 reviews
Company size: 100
Registered: Mar 23, 2008
United States
United States
Tampa 7:25 AM
5
We are seeking a skilled professional with expertise in Lidar technology to assist in creating detailed room and floor plans for our ongoing project. Your role will involve utilizing Lidar data to generate accurate representations of indoor spaces, ensuring high precision in the floor plan designs. Familiarity with vision frameworks is essential as you will integrate your outputs within our existing systems. If you have a strong background in architectural planning or spatial data analysis, we'd love to hear from you.
Fixed budget: 2,000 USD
1 day ago
  • Data Science & Analytics, AI & Machine Learning
GPT SaaS Platform (Arabic + Global)
not specified 1 day ago
Client Rank - Risky

Payment method not verified
Phone number verified
1 open job
no reviews
Registered: Nov 5, 2022
Türkiye
Turkey
2:25 PM
1
need to launch GPT SaaS Platform (Arabic + Global)
Budget: not specified
1 day ago
  • Data Science & Analytics, AI & Machine Learning
UI Path - Computer Vision Specialist needed URGENTLY
5 - 30 USD / hr
1 day ago
Client Rank - Medium

Payment method verified
Phone number verified
1 open job
United States
United States
West Lafayette 7:25 AM
3
I need to create a workflow that detects the coordinates of the specified ui element using UI path inbuild Computer Vision Library. example : find safari icon - program returns the coordinates of the safari icon on the screen.

Please respond ASAP if you can do this.

Thanks,
Eshaan
Hourly rate: 5 - 30 USD
1 day ago
  • Web, Mobile & Software Dev, Web & Mobile Design
AI-Powered Stock Counting from Videos
250 - 750 USD 1 day ago
Client Rank - Risky

Payment method not verified
1 open job
no reviews
Registered: May 21, 2024
Vietnam
Vietnam
1
I need help with a video analysis task. We have several MP4 videos showing stock keeping units (SKUs) being moved by a reach truck in a warehouse. Your job will be to count the number of boxes on each pallet.

Requirements:
- Extract 2 images (front and top views) of each pallet from the videos.
- Use an AI model to count the number of boxes at each pallet location. Note: Labels are on the shelf, not on the pallet.

Ideal Skills and Experience:
- Proficiency in video processing and image extraction.
- Experience with AI models for object detection or segmentation and counting.
- Familiarity with warehouse operations and pallet configurations.

Please provide a deadline with your bid.

Skills: Data Processing, Machine Learning (ML), Image Processing, Video Processing, Computer Vision, Deep Learning, Image Recognition, Image Analysis, AI Model Development, Object Detection
Fixed budget: 250 - 750 USD
1 day ago
  • Websites, IT & Software, Design, Media & Architecture, Data Entry & Admin, Engineering & Science, Computer Vision, Editing, Image Processing, Data Processing, Machine Learning (ML), Video Processing, Deep Learning, Image Recognition, Image Analysis, AI Model Development, Object Detection
Backend AI ML Engineer needed
not specified 1 day ago
Client Rank - Good

Payment method verified
$4 540 total spent
7 hires, 3 active
10 jobs posted
70% hire rate, 3 open job
21.63 /hr avg hourly rate paid
94 hours paid
5.00 of 4 reviews
Industry: Art & Design
Company size: 2
Registered: May 20, 2022
India
India
Bandra w Mumbai India 4:55 PM
4
Phenomenal AI is a pioneering text-to-video platform building state-of-the-art generative AI systems to convert natural language into dynamic, engaging video content. Our mission is to simplify storytelling at scale, enabling creators, educators, and enterprises to generate high-quality videos directly from text prompts.
We are a fast-growing team of engineers, researchers, and designers passionate about shaping the future of content creation using AI.

Job Description:
We are looking for a skilled ML/AI Backend Engineer with 2–3 years of experience to join our core development team. The ideal candidate will be responsible for building the backend infrastructure that supports large generative models and scales the delivery of AI-generated videos to thousands of users.

Key Responsibilities:
• Develop and maintain robust backend systems for large-scale generative AI models.
• Manage and process large datasets, including scraping, cleaning, and versioning.
• Design and optimize inference pipelines for transformer or diffusion-based models.
• Deploy and monitor AI models using cloud services (AWS, Azure, GCP).
• Collaborate closely with ML researchers to integrate and scale new models in production.
• Ensure security, reliability, and performance of backend services.
• Handle containerization and infrastructure-as-code setups (e.g., Docker, Terraform).

What We’re Looking For:
• 2–3 years of experience in backend development with exposure to AI/ML systems.
• Proficiency in Python and backend frameworks (e.g., FastAPI, Flask).
• Experience with cloud platforms like AWS, Azure, or GCP.
• Understanding of deploying and scaling ML models in production.
• Familiarity with MLOps practices is a strong plus.
• Experience working in fast-paced startup environments is a bonus.
Budget: not specified
1 day ago
  • Data Science & Analytics, AI & Machine Learning
AI ML Technical Lead for Text to Video Platform
not specified 1 day ago
Client Rank - Good

Payment method verified
$4 540 total spent
7 hires, 3 active
10 jobs posted
70% hire rate, 4 open job
21.63 /hr avg hourly rate paid
94 hours paid
5.00 of 4 reviews
Industry: Art & Design
Company size: 2
Registered: May 20, 2022
India
India
Bandra w Mumbai India 4:55 PM
4
About the Role
We are looking for a highly experienced and visionary AI/ML Technical Lead to spearhead the development of our text-to-video platform. You will be responsible for leading the design, architecture, development, and deployment of cutting-edge machine learning systems, with a focus on natural language processing (NLP), computer vision, and generative media models. This is a strategic leadership role where you'll work closely with product, research, and engineering teams to deliver a scalable and intelligent platform that transforms text into high-quality video content.

Key Responsibilities
Lead the end-to-end technical architecture for AI/ML systems powering our text-to-video platform.


Experience: 5+ years in AI/ML development, with at least 2 years in a technical leadership role.

-Design and implement ML pipelines for text analysis, video generation, voice synthesis, and scene composition.

-Collaborate with product managers and designers to align AI capabilities with business and user needs.

-Drive model training, fine-tuning, and evaluation of large language models (LLMs), diffusion models, transformers, and multimodal models.

-Build scalable infrastructure for inference, training, and experimentation (leveraging frameworks like PyTorch, TensorFlow, Ray, or Hugging Face).

-Mentor and manage a team of ML engineers, ensuring high code quality and reproducible experimentation.

-Stay updated with advancements in generative AI (e.g., Sora, Runway, Pika, OpenAI, Stability) and guide strategic technology adoption.

-Ensure the ethical use of AI and enforce content safety standards through responsible AI practices.

-Own performance optimization, latency reduction, and cost-efficient deployment of AI models in production.

-Work with backend and DevOps teams to scale AI systems to support large volumes of user-generated content.

Requirements
Must-Have:
Strong background in Machine Learning, Deep Learning, NLP, or Computer Vision.

Hands-on experience building and deploying generative models (e.g., GANs, VAEs, Diffusion Models, LLMs).

Expertise in Python and ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face Transformers).

Experience with prompt engineering and fine-tuning LLMs or multimodal models.

Deep understanding of data preprocessing, tokenization, embeddings, and video/audio synthesis.

Experience with cloud platforms (AWS, GCP, Azure) and distributed training frameworks.

Proven ability to lead technical teams and manage research-to-production pipelines.

Nice-to-Have:
Experience with video generation platforms, media pipelines (FFmpeg, WebRTC, etc.), or synthetic video datasets.

Familiarity with reinforcement learning, self-supervised learning, or retrieval-augmented generation (RAG).

Contributions to open-source ML libraries or published research in top-tier conferences (NeurIPS, CVPR, ICML, etc.).
Budget: not specified
1 day ago
  • Data Science & Analytics, AI & Machine Learning
Python Script for Stipple Art (Weighted Voronoi Dot Placement Based on Image Brightness)
20 USD 1 day ago
Client Rank - Risky

Payment method not verified
Phone number verified
1 open job
India
India
4:55 PM
1
I need a Python script that can generate stipple art (dot-based portrait) from an input image, using brightness values to control dot density.

🎯 Goal:
- Place more dots in *dark areas* (like hair), fewer in light areas (like face/skin)
- Dots must be *uniform in size* (0.9mm) — no variable dot sizes
- Dot placement should follow a *weighted Voronoi stippling* method (Lloyd’s relaxation or similar)
- The result must look balanced and *face must be clearly recognizable*
- Avoid equal dot density between hair and face

🖼 Input:
- PNG or JPG image
- Size: 1169 × 1190 px

📤 Output:
- PNG image with black dots on white background (no image behind)
- Or CSV file with (x, y) dot coordinates
- Adjustable: dot count (~15,000 -20,000) and image input

💻 Requirements:
- Pure Python script (preferred without GPU libraries)
- Efficient, clean code (usable on basic Windows laptop)
- Bonus: ability to preview result with optional GUI or save snapshots

📌 Reference:
I’m trying to achieve a similar result as shown here:
https://observablehq.com/@mbostock/voronoi-stippling

This is a good example of the output I’m aiming for:
[Attached: obama_stipple_reference.jpg]

💰 Budget: $18 – $24 USD
I'm open to negotiation depending on your experience and output quality.

📦 Files: I will provide a sample input image (sham.png) after contract starts.

If this sounds like your kind of project, I’d love to work with you!
1. Milestone 1: Basic Prototype (Preview with fixed image & dot placement)

Generate PNG with uniform dots

Density follows brightness

No background image — only dots on white
💵 Budget: $8



2. Milestone 2: Adjustable Parameters (dot count, input image)

Accept external images

Let user set number of dots

Optionally export to CSV
💵 Budget: $10-$12
Fixed budget: 20 USD
1 day ago
  • Web, Mobile & Software Dev, Scripts & Utilities
AI Developer for Bulk Background Removal from Product Photography
19 - 40 USD / hr
1 day ago
Client Rank - Medium

Payment method verified
Phone number verified
2 jobs posted
3 open job
no reviews
Industry: Retail & Consumer Goods
Company size: 10
Registered: Jun 18, 2025
United Kingdom
United Kingdom
12:25 PM
3
Responsibilities:
Develop or fine-tune an AI/ML model to identify and remove backgrounds from product images.

Support batch image processing (hundreds of images at once).

Ensure high-quality output with crisp, clean edges around products.

Optionally, integrate a simple interface or script for internal use.

Optimize for speed and accuracy.

Requirements:
Strong experience with computer vision, deep learning, and image segmentation.

Familiarity with tools like TensorFlow, PyTorch, OpenCV, or similar.

Previous work with background removal or semantic segmentation is a big plus.

Ability to work independently and deliver production-ready code.

Good communication and documentation skills.

Preferred Qualifications:

Familiarity with U-2-Net, MODNet, or other background-removal models.

Ability to suggest improvements and best practices for photo quality and lighting.

Project Type:
One-time project with potential for follow-up support or enhancement.

Deliverables:
A trained model and/or script for batch background removal.

Documentation for running the process on our internal machines.

Sample results on a subset of our photos.


To Apply:
Please share:

Examples of similar projects you’ve worked on.

Which frameworks/tools you'd use for this task.

Estimated timeline for delivery.
Hourly rate: 19 - 40 USD
1 day ago
  • Data Science & Analytics, AI & Machine Learning
Vision AI Engineer
18 - 25 USD / hr
1 day ago
Client Rank - Risky

Payment method not verified
Phone number verified
1 open job
India
India
4:55 PM
1
We are seeking a reliable and proactive freelancer based in Japan to support a cutting-edge Vision AI project focused on real-world data collection. This work is critical to building large-scale training datasets for next-generation AI models.

This is a hands-on, field-based freelance role where AI meets the streets — ideal for individuals with a technical background who can manage edge systems independently and respond quickly to operational issues.

Responsibilities
* Oversee and monitor the Vision AI edge system daily (approx. 5–6 hours/day)
* Ensure uninterrupted and effective data collection
* Troubleshoot hardware and software issues in real time
* Maintain clear, simple logs and coordinate remotely with the technical team

Location:
On-site in Atsugi, Japan
(Applicants must currently reside in Japan)

Duration:
1 month (extendable to 2 months)

Start Date: Immediate

Who We’re Looking For
* Based in Japan and available for daily on-site work
* Background in engineering, robotics, computer science, or related fields
* Strong problem-solving skills and hands-on experience with hardware/software systems
* Reliable, detail-oriented, and able to work independently with remote guidance

Why Join Us?
* Work at the intersection of AI and real-world deployment
* Contribute to the development of next-gen computer vision datasets
* Collaborate with a global team working on advanced AI applications

If you're excited about contributing to meaningful AI innovation on the ground — we want to hear from you.
Hourly rate: 18 - 25 USD
1 day ago
  • Data Science & Analytics, AI & Machine Learning
AI Data Labeling for Object Detection
30 - 250 USD 1 day ago
Client Rank - Risky

Payment method not verified
1 open job
no reviews
Registered: Nov 10, 2009
China
China
1
Label piano or drawing pictures.
The ideal candidate will be a female, live in Vietnam, Laos or Myanmar and familiar with oral or written Chinese.

Skills: Data Processing, Data Entry, Machine Learning (ML), Data Science, Computer Vision, Data Management, Machine Learning Algorithms, AI Model Development, AI Research, Object Detection
Fixed budget: 30 - 250 USD
1 day ago
  • Websites, IT & Software, Data Entry & Admin, Engineering & Science, Computer Vision, Editing, Data Processing, Data Entry, Machine Learning (ML), Data Science, Data Management, Machine Learning Algorithms, AI Model Development, AI Research, Object Detection
Generative Video AI Expert for Watch Advertisement
10 - 30 USD / hr
1 day ago
Client Rank - Risky

Payment method verified
Phone number verified
$412 total spent
2 hires
6 jobs posted
33% hire rate, 1 open job
7.91 /hr avg hourly rate paid
46 hours paid
2.92 of 3 reviews
Registered: Oct 7, 2024
South Korea
Korea, Republic of
Paju 7:25 PM
1
**Job Description:**

We are looking for a Generative Video AI Expert to create innovative advertisements for our luxury watch collection. This role combines cutting-edge technology with creative storytelling to shape our brand's visual identity.

**Key Responsibilities:**
1. Develop high-quality video advertisements using generative AI technologies, transforming static frames into captivating sequences.
2. Collaborate with the creative team to ensure brand consistency and manage the entire video creation process, from concept to final editing.
3. Stay updated on generative AI tools and techniques to innovate our advertising efforts.

**Qualifications:**
1. Proven experience in generative video creation with a strong portfolio.
2. Strong understanding of visual storytelling, cinematography, and post-production.
3. Ability to translate brand concepts into engaging video narratives and collaborate effectively.
4. Passion for innovation in AI advertising.

**Portfolio Requirements:**
Your portfolio should showcase:
1. Creative storytelling with a narrative developed through generative video.
2. Visual excellence in aesthetics and high-fidelity content.
3. Technical proficiency with generative AI tools, including project descriptions.
4. Originality and innovation in pushing the boundaries of generative AI.

We want to see your best work!
Client's questions:
  • Describe your recent experience with similar projects
  • Describe your typical design process and methods
  • Where do you get inspiration from?
Hourly rate: 10 - 30 USD
1 day ago
  • Design & Creative, Video & Animation
Fiber-Reinforced Composite Image Segmentation
~7 - 17 USD 1 day ago
Client Rank - Medium

Payment method verified
1 open job
no reviews
Registered: Mar 23, 2024
United Kingdom
United Kingdom
3
Project Objective: Segmenting Voids and Resin-Rich Areas in Fiber-Reinforced Composites

Goal
I am working with microscopy images of carbon fiber resin composites. As a beginner in image processing and MATLAB (or ImageJ), my main goal is to accurately segment voids and resin-rich areas in the composite microstructure through a clear, step-by-step, beginner-friendly workflow.

End Objective
To process and segment the image into two distinct regions:

Resin-rich areas

Voids

Tasks to Perform in ImageJ OR Matlab
1. Image Preprocessing
Load the microscopy image.

Apply basic preprocessing techniques such as:

Noise reduction

Contrast enhancement or histogram equalization

2. Segmentation
Apply image processing techniques such as:

Thresholding (manual or adaptive)

Clustering (e.g., k-means)

Morphological operations (e.g., opening, closing)

Distinguish voids from resin-rich regions based on pixel intensity and texture

Requirements
The solution must be beginner-friendly and clearly explained.

Include step-by-step MATLAB code or ImageJ macro, with comments to help understand the logic.

Make the code or workflow modular and easy to adapt to different image sets.

Skills: Matlab and Mathematica, Mathematics, Image Processing, Computer Vision, MATLAB, Image Analysis
Fixed budget: 600 - 1,500 INR
1 day ago
  • Websites, IT & Software, Design, Media & Architecture, Engineering & Science, Computer Vision, Editing, Image Processing, Matlab and Mathematica, Mathematics, MATLAB, Image Analysis
Recherche experts en automatisation & IA pour missions récurrentes (Make, agents vocaux, IA gen)
not specified 1 day ago
Client Rank - Risky

Payment method not verified
Phone number verified
1 open job
no reviews
Registered: Sep 25, 2023
Switzerland
Switzerland
1:25 PM
1
Bonjour,
Je suis étudiant en informatique et à côté j’ai lancé une agence spécialisée dans l’optimisation des PME à travers l’automatisation et l’IA.
Nous aidons nos clients à gagner du temps, réduire leurs coûts et automatiser leurs processus répétitifs (emails, CRM, facturation, service client, etc.).

Nous recherchons actuellement des prestataires de confiance pour rejoindre notre réseau de partenaires sur des missions régulières.

🔧 Compétences recherchées (un ou plusieurs domaines) :
• Automatisation avec Make (Integromat), Zapier, n8n, etc.
• Création d’agents IA (chat ou vocal) : Voiceflow, Twilio, Whisper, ElevenLabs, OpenAI, etc.
• Utilisation de ChatGPT / GPT-4 / Claude dans des workflows métiers
• Intégrations API no-code ou low-code
• Création de dashboards et outils internes (Airtable, Notion, Retool, Coda…)
• Web scraping / automatisation de prospection
• Expertise en IA générative (textes, emails, résumés, classification, etc.)

💼 Ce que je cherche :
• Des freelances fiables, proactifs, avec de bonnes compétences en communication
• Capables de livrer rapidement des solutions fonctionnelles, simples et robustes
• Ouverts à une collaboration sur le long terme, en marque blanche

⏱️Exemples de missions :
• Automatiser le tri de mails pour une PME
• Créer un agent vocal qui appelle des clients pour obtenir des infos
• Mettre en place un système de suivi client automatisé
• Implémenter un bot qui résume des appels ou des documents

👉 Si tu es intéressé, merci de répondre à ce message avec :
1. Ton expérience pertinente (liens, portfolio, cas concrets)
2. Tes disponibilités
3. Ton TJM (tarif journalier moyen)
4. Et ce que tu préfères automatiser ou construire :)

À très bientôt,
Luc
Budget: not specified
1 day ago
  • Web, Mobile & Software Dev, Scripts & Utilities
BSL Video Generation System Training
~27 - 336 USD 23 hours ago
Client Rank - Excellent

Payment method verified
$34 619 total spent
41 hires
3 open job
5.00 of 24 reviews
Registered: Mar 13, 2019
United Kingdom
United Kingdom
5
I need a system trained to generate accurate BSL videos from text input. The videos should feature a person signing BSL and will be used for educational content.

The training data should come from:
- YouTube
- Signworld
- bslsignbank.ucl.ac.uk
- many other sources I can send

The main outcomes I'm looking for are:
- Accurate translation of text to BSL videos
- Fast processing time

Ideal Skills & Experience:
- Experience with video generation models (e.g., Veo3)
- Familiarity with BSL and its educational content
- Proficiency in data sourcing and training machine learning models
- Ability to ensure high accuracy and efficiency in video output

Please provide examples of relevant work.

Skills: Video Services, Video Production, Video Editing, Video Processing, Computer Vision, Deep Learning, Natural Language Processing, AI Text-to-video, AI Model Development, AI Development
Fixed budget: 20 - 250 GBP
23 hours ago
  • Websites, IT & Software, Design, Media & Architecture, Engineering & Science, Computer Vision, Editing, Video Services, Video Production, Video Editing, Video Processing, Deep Learning, Natural Language Processing, AI Text-to-video, AI Model Development, AI Development
Senior Full Stack Engineer with Relevant experience with AI Computer Vision
25 USD / hr
23 hours ago
Client Rank - Medium

Payment method verified
Phone number verified
7 jobs posted
7 open job
no reviews
Industry: Energy & Utilities
Company size: 2
Registered: Jul 15, 2025
Poland
Poland
1:25 PM
3
Position Senior Full Stack Engineer with Relevant experience with AI Computer Vision.
Must to have 5 years of totaly experience
Preference for profiles between 5 and 10 years of total experience
Must to have Senior experience with React
Must to have Senior experience with Node.js
Must to have Senior experience with Nest.js
Must to have relevant experience with Fabric.js
Must to have relevant experience with TypeScript
Must to have relevant experience with Python
Must to have relevant experience with PostgreSQL
Must to have relevant experience with AWS S3 / Python-based AWS Lambda
Must to have relevant experience with Computer vision with Python
Must to have relevant experience with Computer vision with geometry, canvas transformations, image recognition of engineering drawings
Must to have relevant experience with mathematical concepts and coordinate systems
Must to have relevant experience with OpenCV
Must to have relevant experience with ClaudeAI
Must to have relevant experience with GitHub


Good level of english and good communication
CET Time zone
100 % remote
ASAP
Long term period
Must to have included the Github profile

Project
Functioning SaaS product in the manufacturing industry,
Opportunity to get hands-on experience with AI in the computer vision field, having the most variate challenges
Technologies, Programming Languages, and Frameworks:
The project utilized TypeScript and Python across different layers.
The frontend was built using ReactJS, Typescript, with Material UI for component styling and FabricJS for advanced canvas interactions.
The backend APIs were developed using NestJS with TypeScript and PostgreSQL for structured data storage.
For AI and image recognition tasks, we used Python-based AWS Lambda functions.
Tech Stack Overview:
- Frontend: ReactJS + Material UI + FabricJS (for dynamic drawing, annotations, and dimension editing)
- Backend: NestJS (TypeScript), PostgreSQL- AI/Computer Vision Layer: Python-based AWS Lambda functions- Infrastructure: AWS-native (S3 for file storage, Lambda for computer
AI & Computer Vision Approaches:
The AI component involves image recognition of engineering drawings and extracting dimension tolerances from visual data.
A combination of OpenCV for image preprocessing and prompt-driven inference (via ClaudeAI) for interpreting annotations.
The architecture supports both automated and manual workflows for annotation verification.
AI/ML Libraries & Tools:
OpenCV for core image processing tasks, and in some flows, the ClaudeAI API was leveraged to extract semantic meaning from structured or semi-structured drawing annotations
RESPONSABILITIES
Designing a high-precision, user-friendly canvas tool with FabricJS
Building a modular, scalable backend with NestJS and clean architecture principles
Working with cross-disciplinary challenges at the intersection of mechanical engineering and AI
Delivering a tightly integrated workflow between manual annotation and AI-assisted recognition
The codebase is large and deeply interconnected, with individual frontend and backend files running into thousands of lines and covering numerous conditions and edge cases. Needs to add task boards or written acceptance criterias,
The project involves advanced geometry, canvas transformations, and image recognition, which required a solid grasp of mathematical concepts and coordinate systems.
on a computer vision-heavy project at this level must to have interpret what the code was doing, particularly in areas involving tolerance extraction, bounding boxes, and annotation mapping.

PLEASE attach complete matrix and your cv to your application, thank you!!
Hourly rate: 25 USD
23 hours ago
  • Web, Mobile & Software Dev, Web Development
Educational Robot for Interactive Learning
~2,904 - 5,807 USD 22 hours ago
Client Rank - Risky

Payment method not verified
3 open job
no reviews
Registered: Jul 17, 2025
India
India
1
Job Posting: Robotics Engineer for Educational AI Robot

Project Overview

We are seeking a skilled Robotics Engineer to design and develop a simple, user-friendly, and safe educational robot for interactive learning. The robot should engage users in dynamic, fun, and effective learning experiences across a broad age range. It will incorporate AI (e.g., ChatGPT , gemini and many such via API) for voice interaction and educational content delivery, running on affordable hardware .

Key Requirements

Develop a Simple Educational Robot: Build a minimal, cost-effective robot focused on interactive learning, with basic movement (e.g., wheels screens), voice interaction, and educational features.

User-Friendly and Safe: Ensure the robot is intuitive to use and safe for users of all ages (e.g., rounded edges, low-power components, and simple controls).

Incorporate Engagement Features: Add interactive elements like images on screen using AI (e.g., AI API for natural language processing).

Adaptable Design: Create a robot that can be used in various learning environments (e.g., classrooms, homes, or workshops) with modular or customizable features.

API Integration: Integrate AI APIs (e.g., ChatGPT) for real-time voice interaction and educational content delivery.

Responsibilities

Design and prototype a simple robot chassis using CAD software (e.g., Fusion 360 or SolidWorks).

Program a Raspberry Pi (or similar microcontroller) for robot control, movement, and AI integration.

Integrate sensors (e.g., ultrasonic or basic camera) for basic environmental interaction or navigation.

Implement voice input/output using microphones and speakers, connected to AI APIs for interactive learning.

Ensure safety features (e.g., emergency stop, durable materials) and user-friendly controls (e.g., mobile app or simple buttons).


Test and iterate the robot to ensure reliability and educational value in diverse settings.

Ideal Skills and Experience

Robotics Engineering:

Proficiency in CAD design for simple robot chassis (e.g., Fusion 360, SolidWorks).

Experience with motor control (e.g., DC motors or servos) for basic movement.

Knowledge of sensor integration (e.g., ultrasonic sensors, good cameras) for environmental awareness.

Embedded Systems:

Expertise in programming Raspberry Pi or any microcontrolar using a any coding language

Familiarity with Robot Operating System (ROS) or similar frameworks for robot control.

Experience with GPIO programming for sensor and actuator integration.

API Integration:

Strong experience with ChatGPT API or similar NLP APIs for voice-based interaction.

Ability to integrate and optimize AI models for real-time educational content delivery.

Educational Technology:

Experience designing interactive learning tools or edtech products.

Understanding of interactive learning principles (e.g., gamification, adaptive learning).

Safety and User Experience:

Skills in designing safe, durable, and user-friendly robotic systems for diverse age groups.

Familiarity with UX principles for intuitive controls (e.g., mobile app or voice commands).

Additional Skills (preferred):

Knowledge of lightweight AI model deployment (e.g., LLaMA) on edge devices like Raspberry Pi.

Experience with audio processing for voice input/output.

Basic computer vision (e.g., OpenCV) for visual engagement features (if applicable).

Preferred Qualifications

Proven experience building autonomous or educational robots (e.g., Raspberry Pi-based projects).

Portfolio showcasing robotics or edtech projects, ideally with AI integration.

Familiarity with safety standards for educational products (e.g., CE or UL compliance).

Strong communication skills to collaborate on project goals and provide updates.

Application Process

Please submit:


A resume or portfolio highlighting relevant robotics, edtech, or AI integration projects.



Examples of similar projects (e.g., autonomous robots, voice-controlled systems).



A brief explanation of your approach to building a simple, safe, and engaging educational robot.


Familiarity with the referenced YouTube video (https://www.youtube.com/watch?v=e-nbSGRFP4Q&t=15s) is a plus.

Budget and Timeline

Budget: Flexible, based on experience (250k-500k inr).

Timeline: Prototype within 6 months, with iterative testing.

We’re excited to collaborate with a passionate Robotics Engineer to create an innovative educational tool that makes learning fun and effective!

Skills: Mechanical Engineering, Robotics, Software Development, Artificial Intelligence, Embedded Systems, Robotic Process Automation, Prototyping, Mechanical Design, Robot Operating System (ROS), AI Development
Fixed budget: 250,000 - 500,000 INR
22 hours ago
  • Websites, IT & Software, Engineering & Science, Software Development, Artificial Intelligence, Editing, Mechanical Engineering, Robotics, Embedded Systems, Robotic Process Automation, Prototyping, Mechanical Design, Robot Operating System (ROS), AI Development
AI/ML Web and Mobile Developer Needed
45 - 70 USD / hr
22 hours ago
Client Rank - Risky

Payment method not verified
Phone number verified
1 open job
United States
United States
6:25 AM
1
We’re looking for a freelancer or agency with experience in AI/ML, web, and mobile app development to help us build a smart, user-friendly product. This project involves integrating machine learning features (e.g., recommendations, computer vision, NLP, or predictive analytics) into modern web and mobile platforms.

✅ Key Responsibilities:
Design and implement AI/ML models for real-world use cases
Build or support development of web and mobile interfaces (React, Flutter, or similar)
Develop RESTful APIs or backend services to serve AI results
Ensure smooth deployment and integration of ML models in apps
Collaborate on UI/UX logic that aligns with the AI features
Optimize performance across platforms (web & mobile)

🎯 Ideal Skills and Experience:
Strong proficiency in Python and ML frameworks (e.g. TensorFlow, PyTorch, scikit-learn)
Experience with React.js, Next.js, or Vue.js for web development
Experience with Flutter, React Native, or Swift/Kotlin for mobile apps
Backend knowledge: Node.js, Django, FastAPI, or similar
Familiarity with cloud deployment (AWS/GCP/Azure) and Docker
Experience connecting AI pipelines to real-time or user-facing applications
Clear, proactive communication and documentation

💼 Project Scope:
Project duration: ~6–12 weeks (potential for ongoing work)
Deliverables: Functional AI-powered web/mobile app (MVP stage)
Collaboration: Weekly updates and task tracking via Trello, Notion, or Jira
Timezone overlap: Preferred but not required

💬 To Apply:
Please include:
A brief summary of relevant AI, web, and mobile projects you’ve done
Links to portfolio, GitHub, or live apps (if available)
Your availability and estimated hourly or fixed rate
Any questions or suggestions you have for our project
Hourly rate: 45 - 70 USD
22 hours ago
  • Data Science & Analytics, AI & Machine Learning
Computer Vision Specialist for Droplet Freezing Analysis in Video
30 - 60 USD / hr
21 hours ago
Client Rank - Good

Payment method verified
$1 117 total spent
4 hires, 3 active
4 jobs posted
100% hire rate, 1 open job
24.26 /hr avg hourly rate paid
43 hours paid
5.00 of 2 reviews
Registered: Feb 18, 2021
Canada
Canada
Yellowknife 1:25 PM
4
We are seeking an experienced computer vision and data analysis expert to develop a script that automates the detection of freezing events for an array of water droplets from video recordings. The goal is to determine the precise temperature at which each individual droplet freezes.

We have video footage of an array of small droplets on a surface that is being cooled. As the temperature drops, the droplets freeze, causing a distinct change in their appearance from transparent to opaque. We also have a corresponding data log file that records the temperature at specific video frame intervals.

Project Description & Workflow:

The required script should perform the following sequence of tasks:

Droplet Detection & Tracking: The code must first identify and locate each individual droplet in the video frames. The droplets are arranged in a grid-like pattern, which should simplify detection. It should be able to track each droplet throughout the video.

Freezing Event Detection: For each droplet, the script needs to monitor its appearance frame by frame. The key indicator of freezing is a significant change in opacity. The droplet will transition from being clear (showing the metallic surface underneath) to a dark, opaque solid. The code must accurately detect the exact frame in which this transition occurs for each droplet.

Data Correlation: Once the freezing frame for a droplet is identified, the script must look up the corresponding timestamp or frame number in our provided data log file (e.g., a CSV file).

Temperature Extraction: From the data log, the script will extract the temperature associated with that specific frame.

Output: The final output should be a structured data file (e.g., CSV) that lists each droplet (e.g., by its coordinates or an assigned ID) and its corresponding freezing temperature.


Relevant Skills:
- Computer Vision
- Image Processing
- Algorithm Development
- Video Analysis
- Programming (Python, OpenCV, etc.)
- Data Analysis
Hourly rate: 30 - 60 USD
21 hours ago
  • Data Science & Analytics, AI & Machine Learning
Computer Vision and 3d reconstruction
40,000 USD 14 hours ago
Client Rank - Risky

Payment method not verified
1 jobs posted
1 open job
no reviews
Company size: 10
Registered: Jan 6, 2019
United States
United States
8:25 AM
1
We are looking for expertise in Stereo Vision, IPC, Structure from Motion, Scene Graphs etc. - using both classical and deep learning based approaches. C++ and Python programming skills are required. Familiarity with deep learning using PyTorch is important. The work will require exploring existing frameworks and understanding recent developments. Good written english skills are necessary.

The goal is to develop software modules that use commonly available hardware (Livox/Ouster/Realsense etc.) and a combination of edge and cloud based compute (e.g. Jetson + AWS) to produce 3D models in real-time at a minimal computational cost.

This is a remote positionRequirements:

**- Bachelor's degree in Computer Science, Electrical Engineering, or related field; Master's degree preferred.- Proven experience working with computer vision libraries such as OpenCV, TensorFlow, or PyTorch.- Strong programming skills in languages like Python, C++, or MATLAB.- Familiarity with 3D reconstruction techniques like structure-from-motion (SfM), stereo matching, point cloud processing.- Experience with deep learning frameworks for image analysis tasks is a plus.**

Qualifications:

**- Excellent problem-solving abilities and analytical thinking skills.- Ability to work independently as well as part of a team in a fast-paced environment.- Strong communication skills to collaborate effectively with colleagues from diverse backgrounds.
Fixed budget: 40,000 USD
14 hours ago
  • Engineering & Architecture, 3D Modeling & CAD
AI/Computer Vision Developer for Real-Time Image Data Measurement Tool
19 - 40 USD / hr
12 hours ago
Client Rank - Medium

Payment method verified
Phone number verified
2 jobs posted
50% hire rate, 2 open job
no reviews
Registered: Jun 17, 2025
United States
United States
Los Angeles 6:25 AM
3
I'm seeking an experienced AI or computer vision developer to help in building a web-based tool capable of measuring and extracting precise measurement data from real-time images or video.

The ideal candidate will have a strong background in image processing and machine learning, as well as experience in developing applications that handle and analyze live video or webcam feeds.

Your work will directly support the creation of a high-accuracy measurement system, similar in concept to pupillary distance scanners used by eyewear apps like Warby Parker.

Ideal skills:
Facial landmark detection (OpenCV, MediaPipe, Dlib, etc.)
Depth estimation or 3D facial modeling
Experience with precision measurement from 2D/3D input
Bonus: background in beauty tech, biometrics, or mobile camera scanning tools
Client's questions:
  • Please list any certifications related to this project
  • Describe your recent experience with similar projects
Hourly rate: 19 - 40 USD
12 hours ago
  • Web, Mobile & Software Dev, AI Apps & Integration
Fullstack Developer for Cutting-Edge AI Applications
25 - 45 USD / hr
12 hours ago
Client Rank - Excellent

Payment method verified
$10 286 total spent
12 hires, 5 active
21 jobs posted
57% hire rate, 3 open job
18.03 /hr avg hourly rate paid
415 hours paid
4.98 of 6 reviews
Industry: Tech & IT
Company size: 2
Registered: Apr 8, 2022
United States
United States
San Fransisco 4:25 AM
5
Hello,

We are a San Francisco based Startup looking for an experienced Full-Stack Developer to join our team and help build the world's most cutting-edge AI applications. The ideal candidate will have expertise in both frontend and backend development and experience integrating AI/ML models into web applications.

Opportunity to grow into full-time role with potential for visa for USA.

Responsibilities:
- Design, develop, and maintain scalable web applications with AI-driven functionalities.
- Collaborate with ex-AI researchers.
- Develop responsive and intuitive front-end interfaces using modern frameworks.
- Implement secure, efficient, and scalable backend services and APIs.
- Optimize applications for performance, reliability, and scalability.
- Conduct testing and debugging to ensure smooth functionality.
- Stay updated with the latest AI, ML, and full-stack development trends.

Requirements:
- Proven experience as a Full-Stack Developer (3+ years preferred).
- Strong proficiency in frontend technologies such as React.js, Vue.js, or Angular.
- Expertise in backend development with Node.js, Django, Flask, or Ruby on Rails.
- Experience working with AI/ML libraries and frameworks (OpenAI API, etc.).
- Proficiency in cloud services such as AWS, GCP, or Azure.
- Experience with databases like PostgreSQL, MySQL, or MongoDB.
- Knowledge of API design, authentication, and security best practices.
- Strong problem-solving and debugging skills.
- Excellent communication and collaboration skills.

Nice to Have:
- Experience with DevOps practices and CI/CD pipelines.
- Background in Natural Language Processing (NLP) or Computer Vision.
- Familiarity with MLOps and deploying AI models into production.

How to Apply:
If you're passionate about building AI-powered applications and have the required skill set, please submit your proposal with:
- Your resume and portfolio showcasing relevant projects.
- A brief cover letter explaining your experience and approach to AI integration.
- Your availability and hourly/fixed-rate expectations.

We look forward to working with talented developers who want to push the boundaries of AI applications.
P.S. Put "SFAI" in your the first row of your application if you read this.
Hourly rate: 25 - 45 USD
12 hours ago
  • Web, Mobile & Software Dev, Web Development
CV Developer for 2D to 3D Conversion using Python/OpenCV
25,000 USD 11 hours ago
Client Rank - Risky

Payment method not verified
Phone number verified
1 open job
Canada
Canada
8:25 AM
1
We are seeking a skilled CV Developer to convert 2D images into 3D models using Python and OpenCV. The ideal candidate will have a strong background in computer vision and experience with 3D modeling. You will work closely with our team to ensure high-quality outputs and timely delivery. This project is expected to last for four months, with a budget in the range of $15,000 to $25,000. If you are passionate about computer vision and ready to tackle challenging tasks, we would love to hear from you!
Client's questions:
  • Describe a complex computer vision project you've completed. What made it technically challenging?
  • What's your experience with document processing and extracting structured data from 2D images?
  • Have you worked on projects involving 3D coordinate generation or mesh creation? Describe your approach.
  • How do you typically approach a computer vision problem where you need high accuracy on varied input data?
  • Rate your experience level (1-10) with: OpenCV, NumPy, 3D file formats, and Python optimization techniques.
Fixed budget: 25,000 USD
11 hours ago
  • Design & Creative, Video & Animation
CVAT Annotation Expert Needed for Blueprint Symbol Annotation
10 - 25 USD / hr
9 hours ago
Client Rank - Good

Payment method verified
Phone number verified
$4 580 total spent
9 hires, 2 active
8 jobs posted
100% hire rate, 1 open job
32.89 /hr avg hourly rate paid
106 hours paid
5.00 of 8 reviews
Registered: Dec 23, 2024
United States
United States
Davenport, IA, 52803 6:25 AM
4
We are seeking a CVAT Annotation Expert to assist with annotating blueprint symbols from a collection of commercial blueprints for an important machine learning project. The ideal candidate will have experience in computer vision annotation using CVAT and an understanding of architectural or engineering blueprints. Your contributions will be essential in training our machine learning models effectively. If you are detail-oriented and have a passion for data annotation, we would love to hear from you!
Hourly rate: 10 - 25 USD
9 hours ago
  • Admin Support, Data Entry & Transcription Services
Image Classification and Detection Models
80 USD 9 hours ago
Client Rank - Medium

Payment method verified
Phone number verified
1 open job
Pakistan
Pakistan
4:25 PM
3
I need computer vision models trained for image classification and detection using real-world datasets..

Key Requirements:
- Train model on specified datasets.
- Classify and detect various objects/features.
- Ensure high accuracy and reliability.
- Flask API development for trained models.
- Explainable AI (XAI)

Ideal Skills & Experience:
- Expertise in computer vision, especially with training models.
- Strong background in handling and processing real-world and microscopic images.
- Proficiency in relevant programming languages and frameworks (e.g., Python, TensorFlow, PyTorch).
- Experience in model evaluation and optimization.

Please provide a portfolio showcasing relevant projects and experience.
Fixed budget: 80 USD
9 hours ago
  • Data Science & Analytics, AI & Machine Learning
Computer Vision Developer – Real-Time Firearm Detection with YOLO & OpenCV
not specified 1 hour ago
Client Rank - Good

Payment method verified
Phone number verified
$7 120 total spent
10 hires, 9 active
18 jobs posted
56% hire rate, 7 open job
19.99 /hr avg hourly rate paid
359 hours paid
4.67 of 2 reviews
Industry: Sales & Marketing
Company size: 2
Registered: May 15, 2025
Colombia
Colombia
Bogotá 5:25 AM
4
We’re looking for a computer vision developer to help build an AI-powered surveillance system that can detect firearms, track individuals in real time, and trigger automated responses through DJI drones.

The goal is to process video feeds from CCTV or drones, identify when a firearm is present, follow the subject, and send alerts with tracking data to external systems or law enforcement dispatch centers. This system will eventually connect to DJI Matrice drones using the Dock system.

Scope of work includes:
- Integrating or fine-tuning a pre-trained firearm detection model (YOLOv8, Roboflow, or similar)
- Connecting to RTSP or RTMP camera feeds
- Overlaying bounding boxes and detection confidence on live video
- Adding real-time tracking of individuals once a firearm is detected
- Outputting position data (pixel or GPS) to be used by a drone or mapping system
- Triggering DJI drone responses via SDK (FlightHub 2 or OSDK)
- Sending detection alerts via webhook or REST API
- (Optional) Building a lightweight dashboard that shows alerts and map-based tracking

Ideal candidate should have:
- Strong experience with Python, OpenCV, and PyTorch or TensorFlow
- Familiarity with YOLOv5 or YOLOv8, Ultralytics, or Roboflow pipelines
- Experience working with RTSP/RTMP streams
- Experience with object tracking (DeepSORT, ByteTrack, or similar)
- Knowledge of DJI SDKs and drone integration is a big plus
- Background in surveillance, security, or similar AI applications is preferred

We’re aiming to complete the first build in next few weeks. There is potential for long-term work as the system evolves.

If this sounds like something you’ve built before, please apply with a few examples of related work.
Budget: not specified
1 hour ago
  • Data Science & Analytics, AI & Machine Learning
Video Processing Optimization Expert
30 - 250 USD 50 minutes ago
Client Rank - Good

Payment method verified
$2 865 total spent
5 hires
1 open job
5.00 of 4 reviews
Registered: Dec 29, 2016
Pakistan
Pakistan
4
I'm looking for a computer vision expert to optimize a video processing script. The primary goal is to increase speed.

Key focus areas include:
- Optimise video processing (right now it takes 3 minutes to process 1 second video, should get it down to almost 30 second)
- Algorithm efficiency

The current code is written in Python.

Ideal skills and experience:
- Proficiency in Python
- Strong background in computer vision
- Experience with video processing and optimization techniques
- Familiarity with handling frame rates and improving algorithm efficiency
- Expert in multi-processing / multi-threading
- Expert in openCV
- Expert in AI models like Roboflow, Yolo, Kmeans and other models.

Please provide examples of similar work done.

Skills: Python, Matlab and Mathematica, Software Architecture, Machine Learning (ML), Image Processing, OpenCV, Video Processing, Computer Vision, Deep Learning, YOLO
Fixed budget: 30 - 250 USD
50 minutes ago
  • Websites, IT & Software, Design, Media & Architecture, Engineering & Science, Python, Software Architecture, OpenCV, Computer Vision, Editing, Image Processing, Matlab and Mathematica, Machine Learning (ML), Video Processing, Deep Learning, YOLO
Expert Needed for Watermarking and Computer Vision Projects
30 - 60 USD / hr
40 minutes ago
Client Rank - Excellent

Payment method verified
Phone number verified
$10 750 total spent
13 hires, 5 active
23 jobs posted
57% hire rate, 1 open job
63.53 /hr avg hourly rate paid
71 hours paid
4.92 of 8 reviews
Industry: Tech & IT
Company size: 2
Registered: Aug 14, 2023
France
France
Paris 1:25 PM
5
We are seeking an expert in watermarking, forensics, and computer vision to assist with specialized projects. Your role will involve developing, implementing, and analyzing watermarking techniques to protect digital content. You should have a strong background in forensic analysis and the ability to apply computer vision methodologies to enhance our current systems. If you have a proven track record in these areas and are passionate about digital rights management, we want to hear from you!
Hourly rate: 30 - 60 USD
40 minutes ago
  • Data Science & Analytics, AI & Machine Learning
Call to action
Freelancing is a business
Make it more profitable with Vollna

Streamline your Upwork workflow and boost your earnings with our smart job search and filtering tools. Find better clients and land more contracts.