youlbot

Author	SHA1	Message	Date
shinalok	c061aef220	Separate thinking tokens from meta — use __thinking key for display stream_response() now yields {"__thinking": str} for thinking content instead of {"__meta": str}. Removed [사고 과정]/[/사고 과정] marker yields; consumers handle thinking display independently. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-01 10:33:06 +09:00
shinalok	67821250fd	Tag metadata tokens as {\"__meta\"} to separate TTS from progress messages stream_response() now yields plain str for actual answer tokens and {\"__meta\": str} dicts for progress/thinking/source metadata. Consumers (WebUI, Telegram) can filter __meta tokens for TTS/display. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 23:08:14 +09:00
shinalok	e4c56a9b6c	Implement Phase 19: Query Rewriting via LangGraph node - query_rewrite 노드 추가 (agent → query_rewrite → tools 순서) - route_after_agent: search_documents 호출 시에만 query_rewrite 라우팅, 그 외 직접 tools - tools_condition(prebuilt) 제거 → 커스텀 라우팅 함수로 대체 - query_rewrite_node: 구어체 쿼리를 키워드 중심 문장으로 변환 - 이전 대화 2턴 컨텍스트로 대명사·지시어 해소 - enable_thinking=False 바인딩으로 불필요한 사고 과정 제거 - __query_rewrite 커스텀 이벤트 emit → RAG_VERBOSE 시 변환 결과 출력 - QUERY_REWRITE_ENABLED=true 로 활성화 (기본값 false) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 17:55:13 +09:00
shinalok	145b0cc96f	Implement Phase 12 feedback, Phase 13 Semantic Chunker, Phase 13-B Reranker, Bug 5 thinking fix - Phase 12: FeedbackRepository + td_feedback 테이블, Gradio 👍/👎 이벤트, run_id 추적, LangSmith create_feedback() 연동 - Phase 13: 커스텀 _SemanticSplitter 제거 → langchain_experimental.SemanticChunker 교체, buffer_size/threshold_type 환경변수 적용 - Phase 13-B: RerankService (Cross-Encoder), RetrieverService.search()에 reranker 통합, tools.py as_retriever() → search() 전환 - Bug 5: mlx_chat_model enable_thinking 런타임 오버라이드, agent_service stream_mode=["messages","custom"] 이중 스트림, thinking 토큰 custom 이벤트로 emit - ROADMAP: LLM 모델명 8B 반영, RAG에 Reranker 추가, 추천 진행 순서 갱신 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 17:41:36 +09:00
shinalok	b4b628ab78	Fix age calculation: inject today's date, add Korean/international age - Prepend today's date to system prompt on every call so LLM uses correct year - Calculate both Korean age (현재연도-출생연도+1) and 만 나이 with exact birthday handling - Support full date (생년월일) and year-only (생년) profile values - Update remember_user_info to encourage storing full birth date - Strengthen get_current_date tool description for age-related queries Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-27 15:10:30 +09:00
shinalok	06bcdb03ac	Implement Phase 4~14: LangGraph Agent, RAG pipeline, Gradio Web UI, voice interface - Upgrade LLM to Qwen3-14B-4bit with Thinking mode (MlxChatModel as LangChain BaseChatModel) - Add LangGraph ReAct agent with tool calling loop (search_documents, web_search, get_current_date, remember/recall_user_info) - Add RAG pipeline: BAAI/bge-m3 embeddings + Qdrant vector store + semantic chunking (SemanticSplitter via cosine similarity) - Replace fixed-size RecursiveCharacterTextSplitter with meaning-based SemanticSplitter (numpy only, no extra deps) - Add Gradio Web UI (app.py): chat, document ingestion, document management tabs - Add multi-user support (user_id isolation in DB + per-user agent cache + dropdown selector) - Add conversation history restore from MySQL on agent init (Phase 11) - Add UserProfileRepository for persistent user profile (remember/recall tools) - Add thread-local DB connections to fix pymysql thread-safety with LangGraph ToolNode - Add Phase 14 voice interface: Whisper STT (microphone → text) + macOS TTS (say -v Yuna) - Enforce search_documents-first policy in system prompt and tool descriptions - Update ROADMAP2.md: Phase 14 완료, Phase 13 청킹 부분 완료 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-27 14:06:22 +09:00

6 Commits