Caffe: a fast open framework for deep learning.
-
Updated
Jul 31, 2024 - C++
Caffe: a fast open framework for deep learning.
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)【安全加固,暂停交互,请耐心等待】
Automate browser-based workflows with LLMs and Computer Vision
build ai agents that have the full context, open source, runs locally, developer friendly. 24/7 screen, mic, keyboard recording and control
📸 A powerful, high-performance React Native Camera library.
One UI is all done with chatgpt web, midjourney, gpts,suno,luma,runway,viggle,flux,ideogram,realtime,pika,udio; Simultaneous support Web / PWA / Linux / Win / MacOS platform
TEN Agent is a conversational AI powered by TEN, integrating Gemini 2.0 Multimodal Live API, OpenAI Realtime API, RTC, and more. It offers real-time capabilities to see, hear, and speak, along with advanced tools like weather checks, web search, and RAG.
Open source hardware and software platform to build a small scale self driving car.
The Open Source Framework for Machine Vision
⬆️ Media Capture in Swift
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Videos, notes and experiments to understand deep learning
[Deprecated] 🇨🇳中国二代身份证光学识别
🌝 MLKit是一个强大易用的工具包。通过ML Kit您可以很轻松的实现文字识别、条码识别、图像标记、人脸检测、对象检测等功能。
Add a description, image, and links to the vision topic page so that developers can more easily learn about it.
To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."