shinalok/youlbot - youlbot - [SLCloud] Gitea: Git with a cup of tea

shinalok/youlbot

T

shinalok 68f741af72 Phase 17: Multimodal image understanding via analyze_image tool

Dual-model approach (C): Qwen3-8B handles conversation, Qwen2.5-VL-7B
analyzes images on demand via analyze_image LangChain tool.

- services/model/mlx_vision_model.py: MlxVisionModel (mlx-vlm wrapper, lazy load)
- services/agent/tools.py: make_vision_tool(vision_model, image_path)
- agent_service.py: stream_response(image_path=None), dynamic tool binding
  via config["image_path"] — thread-safe per-request rebinding
- container.py: vision_model Singleton provider
- config.py: vision_enabled, vision_model_id, vision_max_tokens
- api.py: image_base64 in ChatRequest, decode to temp file, cleanup after stream

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-06-02 13:52:10 +09:00

- **Bootstrap IoC-based architecture with modular services.**

2026-04-25 01:14:37 +09:00

- **Bootstrap IoC-based architecture with modular services.**

2026-04-25 01:14:37 +09:00

Phase 17: Multimodal image understanding via analyze_image tool

2026-06-02 13:52:10 +09:00

Fix RAGAS eval: increase timeout for local LLM, safe score extraction

2026-06-01 19:41:32 +09:00

Phase 17: Multimodal image understanding via analyze_image tool

2026-06-02 13:52:10 +09:00

.env.example

Implement Phase 22: REST API (FastAPI + SSE streaming)

2026-05-29 20:11:49 +09:00

.gitignore

- **Bootstrap IoC-based architecture with modular services.**

2026-04-25 01:14:37 +09:00

api.py

Phase 17: Multimodal image understanding via analyze_image tool

2026-06-02 13:52:10 +09:00

app.py

Implement Phase 22: REST API (FastAPI + SSE streaming)

2026-05-29 20:11:49 +09:00

chat.py

- **Bootstrap IoC-based architecture with modular services.**

2026-04-25 01:14:37 +09:00

CLAUDE.md

init project

2026-04-23 18:00:36 +09:00

config.py

Phase 17: Multimodal image understanding via analyze_image tool

2026-06-02 13:52:10 +09:00

container.py

Phase 17: Multimodal image understanding via analyze_image tool

2026-06-02 13:52:10 +09:00

ingest.py

Implement Phase 4~14: LangGraph Agent, RAG pipeline, Gradio Web UI, voice interface

2026-05-27 14:06:22 +09:00

main.py

Implement Phase 4~14: LangGraph Agent, RAG pipeline, Gradio Web UI, voice interface

2026-05-27 14:06:22 +09:00

requirements.txt

Implement Phase 22: REST API (FastAPI + SSE streaming)

2026-05-29 20:11:49 +09:00

youlbot.iml

init project

2026-04-23 18:00:36 +09:00

사고과정.png

Tag metadata tokens as {\"__meta\"} to separate TTS from progress messages

2026-05-31 23:08:14 +09:00