MulticoreWare is a global software solutions & products company with its HQ in San Jose, CA, USA. With worldwide offices, it serves its clients and partners in North America, EMEA and APAC regions. Started by a group of researchers, MulticoreWare has grown to serve its clients and partners on HPC & Cloud computing, GPUs, Multicore & Multithread CPUS, DSPs, FPGAs and a variety of AI hardware accelerators.
MulticoreWare was founded by a team of researchers that wanted a better way to program for heterogeneous architectures. With the advent of GPUs and the increasing prevalence of multi-core, multi-architecture platforms, our clients were struggling with the difficulties of using these platforms efficiently.
We started as a boot-strapped services company and have since expanded our portfolio to span products and services related to compilers, machine learning, video codecs, image processing and augmented/virtual reality. Our hardware expertise has also expanded with our team; we now employ experts on HPC and Cloud Computing, GPUs, DSPs, FPGAs, and mobile and embedded platforms. We specialize in accelerating software and algorithms, so if your code targets a multi-core, heterogeneous platform, we can help.
Job Title: AI/ML Engineer (Computer Vision)
Job Location: Chennai
Job Type: Full-Time
Experience: 4+ years
Job Description
We’re looking for a high skilled AI/ML Engineer with strong grounding in deep learning and computer vision systems. The role focuses on building, optimizing, and deploying models that work reliably in real-world environments.
The role will involve data handling, model training with optimization and deployment on edge or production systems. If you also have exposure to NLP, that’s a plus, but the core expectation is solid computer vision expertise.
Responsibilities
- Design, train, and evaluate deep learning models for computer vision tasks (classification, detection, segmentation)
• Work with advance CNN architectures and optimize models for performance, latency, and memory (quantization, pruning, fp16, batching strategies) • Handle end-to-end pipelines including data preprocessing, augmentation, training, validation, and testing
- Port and deploy models across platforms (CPU, GPU, edge devices using ONNX, TensorRT, TFLite, etc.)
- Benchmark models and analyze trade-o s between accuracy, speed, and resource usage
- Debug model behavior and improve robustness in real-world scenarios to integrate models into production systems
- Maintain clear documentation of experiments, model versions, and performance metrics
- Stay updated with advancements in computer vision and applied ML
- Design, train, and evaluate deep learning models for computer vision tasks (classification, detection, segmentation)
• Work with advance CNN architectures and optimize models for performance, latency, and memory (quantization, pruning, fp16, batching strategies) • Handle end-to-end pipelines including data preprocessing, augmentation, training, validation, and testing
- Port and deploy models across platforms (CPU, GPU, edge devices using ONNX, TensorRT, TFLite, etc.)
- Benchmark models and analyze trade-o s between accuracy, speed, and resource usage
• Debug model behavior and improve robustness in real-world scenarios to integrate models into production systems • Maintain clear documentation of experiments, model versions, and performance metrics
- Stay updated with advancements in computer vision and applied ML
Requirements
Bachelor’s or Master’s degree in Computer Science, AI, Electrical Engineering, or related field
4+ years of hands-on experience in AI/ML, with strong focus on computer vision
Solid understanding of CNNs and deep learning fundamentals
Strong experience with frameworks like PyTorch or TensorFlow
Good programming skills in Python
Experience with model training, evaluation, and hyperparameter tuning
Practical experience in model optimization techniques (quantization, pruning, distillation)
Knowledge of edge AI constraints and deployment challenges
Experience in model deployment and porting (ONNX, TensorRT, OpenVINO, TFLite, etc.)
Familiarity with Linux environments and basic system-level understanding
Experience with version control systems like Git
Good to Have
Exposure to NLP concepts or transformer-based models
Experience working with multimodal models (vision + text)
Familiarity with CUDA or hardware acceleration techniques
Experience with MLOps practices (experiment tracking, CI/CD for models)
What We’re Looking For
Strong problem-solving mindset with attention to real-world constraints
Ability to go beyond model accuracy and think in terms of system performance
Clear communication and ability to work across teams