OCR and Document Search Web Application
-
Updated
Nov 24, 2024 - Jupyter Notebook
OCR and Document Search Web Application
"Smart Vision Technology for Quality Control" uses computer vision to automate product inspections, extracting details like product name, quantity, expiry date, and freshness from images. Built for Flipkart Grid 6.0, it enhances accuracy and efficiency in quality control, minimizing manual checks.
基于多模态大模型的智能搜索助手,通过AI技术实现小红书平台的智能化信息检索和知识整合
This repo is for Amazon ML Challenge 2024. The challenge was to develop a Machine Learning model to extract product details directly from the product images.
Colaboratory上でQwenLM/Qwen2-VLをお試しするサンプル
practical projects using LLM, VLM and Diffusion models
This project demonstrates how to use the Qwen2-VL model from Hugging Face for Optical Character Recognition (OCR) and Visual Question Answering (VQA). The model combines vision and language capabilities, enabling users to analyze images and generate context-based responses.
Running Large Language Model easily.
A Challenging Multi-Modal Mathematical Reasoning Benchmark
An open-source server implementation for inference Qwen2-VL series model using fastapi.
Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums
A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.
An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Add a description, image, and links to the qwen2-vl topic page so that developers can more easily learn about it.
To associate your repository with the qwen2-vl topic, visit your repo's landing page and select "manage topics."