feat: group chat session lifecycle, typing recovery, mention highlighting (#186)
* feat: restore group chat system with Socket.IO and SQLite persistence - GroupChatServer: Socket.IO server with room management, message history, typing indicators - SQLite storage for rooms, messages, and agent configuration - AgentClients: manages AI agent connections via socket.io-client, forwards @mentions to Hermes gateway - REST API: room CRUD, agent management, invite codes - Agent auto-restoration on server restart - Tests for all REST endpoints Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: add context-engine design document for group chat compression Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: handle special-character session search * fix: keep unicode dotted session search on quoted FTS path * feat: add context engine and group chat frontend UI - Context engine: three-zone compression (head/tail/summary) with LLM summarization, incremental updates, TTL cache, and graceful degradation - Frontend: group chat page with Socket.IO client, room sidebar, message list, agent/member display, create/join-by-code modals - Integration: wire context engine into agent-clients before /v1/runs - Refactor ChatStorage to use global DB (getDb/ensureTable) with gc_ prefix - Add i18n keys for group chat to all 8 locales - Add sidebar nav entry and router for group chat page Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: remove leftover main branch code from merge conflict resolution The `isNumericQuery`, `hasUnsafeChars`, and `runLikeContentSearch` functions no longer exist — they were replaced by HEAD's `shouldUseLiteralContentSearch` and `runLiteralContentSearch`. This dead code block caused a TypeScript compile error after the merge. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: install missing socket.io dep and type ack params Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: enable WebSocket proxy and fix socket.io transport for group chat - Add ws: true to Vite proxy config so WebSocket upgrade requests are forwarded to the backend - Allow both polling and websocket transports on server and client (polling as fallback when WebSocket upgrade fails through proxy) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: separate socket.io path from REST routes for group chat socket.io was mounted at /api/hermes/group-chat which intercepted all REST requests to /api/hermes/group-chat/rooms etc, returning "Transport unknown". Changed socket.io path to /api/hermes/group-chat/ws to avoid conflicts. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: improve group chat UI, agent management, and socket.io reliability - Redesign GroupChatPanel with Naive UI, stacked agent avatars, and popover management - Match GroupChatInput style with single chat input, add IME composition handling - Add agent add/remove per room with profile selection and duplicate prevention - Use @multiavatar for SVG avatar generation with caching - Decouple joinRoom from socket.io, use REST API for data loading - Switch socket.io to default path with /group-chat namespace to avoid proxy conflicts - Restore agent connections after server is listening - Add getRoomDetail REST endpoint and duplicate agent prevention (409) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: server-side @mention routing with context compression status and queue - Move @mention detection from agent socket listeners to server-side processMentions() - Add per-room processing lock to block mention dispatch during compression - Queue mentions during processing, drain only the latest when ready - Emit context_status events (compressing/replying/ready) to room via Socket.IO - Frontend displays compression status indicator above input - Token-based compression trigger (100k threshold) with CJK-aware estimation - Fix compressor type errors (countTokens parameter type) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: improve group chat profile handling and session sync Refine group chat room/session behavior with per-room compression controls, sidebar updates, and better stale session cleanup so multi-profile group chat state stays consistent. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: group chat improvements — session lifecycle, typing recovery, mention highlighting - Fix cross-profile session deletion with deferred delete queue - Move saveSessionProfile to after gateway response confirmation - Replace all console.log with logger in group-chat modules - Add server-side typing/context_status state tracking for room rejoin - Fix @ mention popup position to follow cursor - Add @ mention highlighting (blue) in chat message content - Fix mention regex to match all occurrences after HTML tags - Enable esbuild minify and treeShaking - Move @multiavatar/multiavatar to devDependencies - Add i18n keys for group chat features - Update tests for new functionality Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: bump version to 0.4.5 and move @multiavatar to devDependencies Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Zhicheng Han <zhicheng.han@mathematik.uni-goettingen.de>
This commit is contained in:
@@ -3,6 +3,7 @@ import { mkdir, writeFile } from 'fs/promises'
|
||||
import { basename, join } from 'path'
|
||||
import { tmpdir } from 'os'
|
||||
import * as hermesCli from '../../services/hermes/hermes-cli'
|
||||
import { drainPendingSessionDeletes } from '../../services/hermes/group-chat'
|
||||
import { getGatewayManagerInstance } from '../../services/gateway-bootstrap'
|
||||
import { logger } from '../../services/logger'
|
||||
|
||||
@@ -118,7 +119,17 @@ export async function switchProfile(ctx: any) {
|
||||
} catch (err: any) {
|
||||
logger.error(err, 'Ensure config failed')
|
||||
}
|
||||
ctx.body = { success: true, message: output.trim() }
|
||||
const drainResult = await drainPendingSessionDeletes(name)
|
||||
logger.info('[switchProfile] drain result for profile "%s": %d deleted, %d failed', name, drainResult.deleted.length, drainResult.failed.length)
|
||||
if (drainResult.failed.length > 0) {
|
||||
logger.warn({ profile: name, failed: drainResult.failed }, 'Failed to drain some pending session deletes after profile switch')
|
||||
}
|
||||
ctx.body = {
|
||||
success: true,
|
||||
message: output.trim(),
|
||||
drained_session_deletes: drainResult.deleted.length,
|
||||
failed_session_deletes: drainResult.failed.length,
|
||||
}
|
||||
} catch (err: any) {
|
||||
ctx.status = 500
|
||||
ctx.body = { error: err.message }
|
||||
|
||||
@@ -7,6 +7,9 @@ import {
|
||||
import { listSessionSummaries, searchSessionSummaries } from '../../db/hermes/sessions-db'
|
||||
import { deleteUsage, getUsage, getUsageBatch } from '../../db/hermes/usage-store'
|
||||
import { getModelContextLength } from '../../services/hermes/model-context'
|
||||
import type { ConversationDetail, ConversationSummary } from '../../services/hermes/conversations'
|
||||
import { getActiveProfileName } from '../../services/hermes/hermes-profile'
|
||||
import { getGroupChatServer } from '../../routes/hermes/group-chat'
|
||||
import { logger } from '../../services/logger'
|
||||
|
||||
function parseHumanOnly(value: unknown): boolean {
|
||||
@@ -20,6 +23,35 @@ function parseLimit(value: unknown): number | undefined {
|
||||
return Number.isFinite(parsed) && parsed > 0 ? parsed : undefined
|
||||
}
|
||||
|
||||
function getPendingDeletedSessionIds(): Set<string> {
|
||||
return getGroupChatServer()?.getStorage().getPendingDeletedSessionIds() || new Set<string>()
|
||||
}
|
||||
|
||||
function isPendingDeletedSession(sessionId: string): boolean {
|
||||
return getPendingDeletedSessionIds().has(sessionId)
|
||||
}
|
||||
|
||||
function filterPendingDeletedSessions<T extends { id: string }>(items: T[]): T[] {
|
||||
const pendingIds = getPendingDeletedSessionIds()
|
||||
if (pendingIds.size === 0) return items
|
||||
return items.filter(item => !pendingIds.has(item.id))
|
||||
}
|
||||
|
||||
function filterPendingDeletedConversationSummaries(items: ConversationSummary[]): ConversationSummary[] {
|
||||
return filterPendingDeletedSessions(items)
|
||||
}
|
||||
|
||||
function hasPendingDeletedConversation(detail: ConversationDetail): boolean {
|
||||
const pendingIds = getPendingDeletedSessionIds()
|
||||
if (pendingIds.size === 0) return false
|
||||
if (pendingIds.has(detail.session_id)) return true
|
||||
return detail.messages.some(message => pendingIds.has(message.session_id))
|
||||
}
|
||||
|
||||
function getGroupChatStorage() {
|
||||
return getGroupChatServer()?.getStorage() || null
|
||||
}
|
||||
|
||||
export async function listConversations(ctx: any) {
|
||||
const source = (ctx.query.source as string) || undefined
|
||||
const humanOnly = parseHumanOnly(ctx.query.humanOnly)
|
||||
@@ -27,14 +59,14 @@ export async function listConversations(ctx: any) {
|
||||
|
||||
try {
|
||||
const sessions = await listConversationSummariesFromDb({ source, humanOnly, limit })
|
||||
ctx.body = { sessions }
|
||||
ctx.body = { sessions: filterPendingDeletedConversationSummaries(sessions) }
|
||||
return
|
||||
} catch (err) {
|
||||
logger.warn(err, 'Hermes Conversation DB: summary query failed, falling back to CLI export')
|
||||
}
|
||||
|
||||
const sessions = await listConversationSummaries({ source, humanOnly, limit })
|
||||
ctx.body = { sessions }
|
||||
ctx.body = { sessions: filterPendingDeletedConversationSummaries(sessions) }
|
||||
}
|
||||
|
||||
export async function getConversationMessages(ctx: any) {
|
||||
@@ -43,7 +75,7 @@ export async function getConversationMessages(ctx: any) {
|
||||
|
||||
try {
|
||||
const detail = await getConversationDetailFromDb(ctx.params.id, { source, humanOnly })
|
||||
if (!detail) {
|
||||
if (!detail || hasPendingDeletedConversation(detail)) {
|
||||
ctx.status = 404
|
||||
ctx.body = { error: 'Conversation not found' }
|
||||
return
|
||||
@@ -55,7 +87,7 @@ export async function getConversationMessages(ctx: any) {
|
||||
}
|
||||
|
||||
const detail = await getConversationDetail(ctx.params.id, { source, humanOnly })
|
||||
if (!detail) {
|
||||
if (!detail || hasPendingDeletedConversation(detail)) {
|
||||
ctx.status = 404
|
||||
ctx.body = { error: 'Conversation not found' }
|
||||
return
|
||||
@@ -69,14 +101,14 @@ export async function list(ctx: any) {
|
||||
|
||||
try {
|
||||
const sessions = await listSessionSummaries(source, limit && limit > 0 ? limit : 2000)
|
||||
ctx.body = { sessions }
|
||||
ctx.body = { sessions: filterPendingDeletedSessions(sessions) }
|
||||
return
|
||||
} catch (err) {
|
||||
logger.warn(err, 'Hermes Session DB: summary query failed, falling back to CLI')
|
||||
}
|
||||
|
||||
const sessions = await hermesCli.listSessions(source, limit)
|
||||
ctx.body = { sessions }
|
||||
ctx.body = { sessions: filterPendingDeletedSessions(sessions) }
|
||||
}
|
||||
|
||||
export async function search(ctx: any) {
|
||||
@@ -88,7 +120,7 @@ export async function search(ctx: any) {
|
||||
|
||||
try {
|
||||
const results = await searchSessionSummaries(q, source, limit && limit > 0 ? limit : 20)
|
||||
ctx.body = { results }
|
||||
ctx.body = { results: filterPendingDeletedSessions(results) }
|
||||
} catch (err) {
|
||||
logger.error(err, 'Hermes Session DB: search failed')
|
||||
ctx.status = 500
|
||||
@@ -97,6 +129,12 @@ export async function search(ctx: any) {
|
||||
}
|
||||
|
||||
export async function get(ctx: any) {
|
||||
if (isPendingDeletedSession(ctx.params.id)) {
|
||||
ctx.status = 404
|
||||
ctx.body = { error: 'Session not found' }
|
||||
return
|
||||
}
|
||||
|
||||
const session = await hermesCli.getSession(ctx.params.id)
|
||||
if (!session) {
|
||||
ctx.status = 404
|
||||
@@ -107,14 +145,44 @@ export async function get(ctx: any) {
|
||||
}
|
||||
|
||||
export async function remove(ctx: any) {
|
||||
const ok = await hermesCli.deleteSession(ctx.params.id)
|
||||
if (!ok) {
|
||||
ctx.status = 500
|
||||
ctx.body = { error: 'Failed to delete session' }
|
||||
const sessionId = ctx.params.id
|
||||
const storage = getGroupChatStorage()
|
||||
const currentProfile = getActiveProfileName()
|
||||
const mapped = storage?.getSessionProfile(sessionId) || null
|
||||
|
||||
logger.info('[remove] sessionId=%s, currentProfile=%s, mapped=%j', sessionId, currentProfile, mapped)
|
||||
|
||||
if (!mapped) {
|
||||
logger.info('[remove] no mapping found, deleting directly')
|
||||
const ok = await hermesCli.deleteSession(sessionId)
|
||||
if (!ok) {
|
||||
ctx.status = 500
|
||||
ctx.body = { error: 'Failed to delete session' }
|
||||
return
|
||||
}
|
||||
deleteUsage(sessionId)
|
||||
ctx.body = { ok: true }
|
||||
return
|
||||
}
|
||||
deleteUsage(ctx.params.id)
|
||||
ctx.body = { ok: true }
|
||||
|
||||
if (mapped.profile_name === currentProfile) {
|
||||
logger.info('[remove] same profile, deleting directly')
|
||||
const ok = await hermesCli.deleteSession(sessionId)
|
||||
if (!ok) {
|
||||
ctx.status = 500
|
||||
ctx.body = { error: 'Failed to delete session' }
|
||||
return
|
||||
}
|
||||
storage?.deleteSessionProfile(sessionId)
|
||||
deleteUsage(sessionId)
|
||||
ctx.body = { ok: true }
|
||||
return
|
||||
}
|
||||
|
||||
logger.info('[remove] cross-profile detected, enqueued deferred delete for profile=%s', mapped.profile_name)
|
||||
storage?.enqueuePendingSessionDelete(sessionId, mapped.profile_name)
|
||||
deleteUsage(sessionId)
|
||||
ctx.body = { ok: true, deferred: true }
|
||||
}
|
||||
|
||||
export async function usageBatch(ctx: any) {
|
||||
|
||||
@@ -159,20 +159,73 @@ function containsCjk(text: string): boolean {
|
||||
return false
|
||||
}
|
||||
|
||||
function isNumericQuery(text: string): boolean {
|
||||
return /^\d+(?:\s+\d+)*$/.test(text.trim())
|
||||
function escapeLikePattern(value: string): string {
|
||||
return value.replace(/[\\%_]/g, (match) => `\\${match}`)
|
||||
}
|
||||
|
||||
function hasUnsafeChars(text: string): boolean {
|
||||
return /[^\w\s\u4e00-\u9fff\u3400-\u4dbf\u3000-\u303f\u3040-\u309f\u30a0-\u30ff\uac00-\ud7af]/.test(text)
|
||||
function buildLikePattern(value: string): string {
|
||||
return `%${escapeLikePattern(value)}%`
|
||||
}
|
||||
|
||||
function runLikeContentSearch(
|
||||
function normalizeTitleLikeQuery(query: string): string {
|
||||
const tokens = query.match(/"[^"]*"\*?|\S+/g)
|
||||
if (!tokens) return query
|
||||
|
||||
const normalizedTokens = tokens
|
||||
.map((token) => {
|
||||
let value = token.endsWith('*') ? token.slice(0, -1) : token
|
||||
if (value.startsWith('"') && value.endsWith('"')) {
|
||||
value = value.slice(1, -1)
|
||||
}
|
||||
return value
|
||||
})
|
||||
.filter(Boolean)
|
||||
|
||||
return normalizedTokens.join(' ').trim() || query
|
||||
}
|
||||
|
||||
function shouldUseLiteralContentSearch(query: string): boolean {
|
||||
const trimmed = query.trim()
|
||||
if (!trimmed) return false
|
||||
if (/[^\p{L}\p{N}\s"*.-]/u.test(trimmed)) return true
|
||||
|
||||
const tokens = trimmed.match(/"[^"]*"\*?|\S+/g)
|
||||
if (!tokens) return true
|
||||
|
||||
for (const token of tokens) {
|
||||
if (/^(AND|OR|NOT)$/i.test(token)) continue
|
||||
|
||||
const raw = token.endsWith('*') ? token.slice(0, -1) : token
|
||||
if (!raw) return true
|
||||
|
||||
if (raw.startsWith('"') && raw.endsWith('"')) {
|
||||
const inner = raw.slice(1, -1)
|
||||
if (!inner.trim()) return true
|
||||
if (!/^[\p{L}\p{N}\s.-]+$/u.test(inner)) return true
|
||||
if ((inner.includes('.') || inner.includes('-')) && !/^[\p{L}\p{N}]+(?:[.-][\p{L}\p{N}]+)*(?:\s+[\p{L}\p{N}]+(?:[.-][\p{L}\p{N}]+)*)*$/u.test(inner)) return true
|
||||
continue
|
||||
}
|
||||
|
||||
if (raw.includes('.') || raw.includes('-')) {
|
||||
if (!/^[\p{L}\p{N}]+(?:[.-][\p{L}\p{N}]+)*$/u.test(raw)) return true
|
||||
continue
|
||||
}
|
||||
|
||||
if (!/^[\p{L}\p{N}]+$/u.test(raw)) return true
|
||||
}
|
||||
|
||||
return false
|
||||
}
|
||||
|
||||
function runLiteralContentSearch(
|
||||
db: { prepare: (sql: string) => { all: (...params: any[]) => Record<string, unknown>[] } },
|
||||
source: string | undefined,
|
||||
query: string,
|
||||
limit: number,
|
||||
): Record<string, unknown>[] {
|
||||
const likeBase = buildBaseSessionSql(source)
|
||||
const loweredQuery = query.toLowerCase()
|
||||
const likePattern = buildLikePattern(loweredQuery)
|
||||
const likeSql = `
|
||||
WITH base AS (
|
||||
${likeBase.sql}
|
||||
@@ -182,17 +235,17 @@ function runLikeContentSearch(
|
||||
m.id AS matched_message_id,
|
||||
substr(
|
||||
m.content,
|
||||
max(1, instr(m.content, ?) - 40),
|
||||
max(1, instr(LOWER(m.content), ?) - 40),
|
||||
120
|
||||
) AS snippet,
|
||||
0 AS rank
|
||||
FROM base
|
||||
JOIN messages m ON m.session_id = base.id
|
||||
WHERE m.content LIKE ?
|
||||
WHERE LOWER(m.content) LIKE ? ESCAPE '\\'
|
||||
ORDER BY base.last_active DESC, m.timestamp DESC
|
||||
LIMIT ?
|
||||
`
|
||||
const likeStatement = db.prepare(likeSql)
|
||||
return likeStatement.all(...likeBase.params, query, `%${query}%`) as Record<string, unknown>[]
|
||||
return db.prepare(likeSql).all(...likeBase.params, loweredQuery, likePattern, limit * 4) as Record<string, unknown>[]
|
||||
}
|
||||
|
||||
function sanitizeFtsQuery(query: string): string {
|
||||
@@ -208,7 +261,7 @@ function sanitizeFtsQuery(query: string): string {
|
||||
sanitized = sanitized.replace(/(^|\s)\*/g, '$1')
|
||||
sanitized = sanitized.trim().replace(/^(AND|OR|NOT)\b\s*/i, '')
|
||||
sanitized = sanitized.trim().replace(/\s+(AND|OR|NOT)\s*$/i, '')
|
||||
sanitized = sanitized.replace(/\b(\w+(?:[.-]\w+)+)\b/g, '"$1"')
|
||||
sanitized = sanitized.replace(/\b([\p{L}\p{N}]+(?:[.-][\p{L}\p{N}]+)+)\b/gu, '"$1"')
|
||||
|
||||
for (let i = 0; i < quotedParts.length; i += 1) {
|
||||
sanitized = sanitized.replace(`\u0000Q${i}\u0000`, quotedParts[i])
|
||||
@@ -218,7 +271,7 @@ function sanitizeFtsQuery(query: string): string {
|
||||
}
|
||||
|
||||
function toPrefixQuery(query: string): string {
|
||||
const tokens = query.match(/"[^"]*"|\S+/g)
|
||||
const tokens = query.match(/"[^"]*"\*?|\S+/g)
|
||||
if (!tokens) return ''
|
||||
return tokens
|
||||
.map((token) => {
|
||||
@@ -282,6 +335,8 @@ export async function searchSessionSummaries(
|
||||
const db = new DatabaseSync(sessionDbPath(), { open: true, readOnly: true })
|
||||
const normalized = sanitizeFtsQuery(trimmed)
|
||||
const prefixQuery = toPrefixQuery(normalized)
|
||||
const titlePattern = buildLikePattern(normalizeTitleLikeQuery(trimmed).toLowerCase())
|
||||
const useLiteralContentSearch = containsCjk(trimmed) || shouldUseLiteralContentSearch(trimmed)
|
||||
let titleRows: Record<string, unknown>[] = []
|
||||
|
||||
try {
|
||||
@@ -301,13 +356,13 @@ export async function searchSessionSummaries(
|
||||
END AS snippet,
|
||||
0 AS rank
|
||||
FROM base
|
||||
WHERE LOWER(COALESCE(base.title, '')) LIKE ?
|
||||
WHERE LOWER(COALESCE(base.title, '')) LIKE ? ESCAPE '\\'
|
||||
ORDER BY base.last_active DESC
|
||||
LIMIT ?
|
||||
`
|
||||
|
||||
const titleStatement = db.prepare(titleSql)
|
||||
titleRows = titleStatement.all(...titleBase.params, `%${trimmed.toLowerCase()}%`, limit) as Record<string, unknown>[]
|
||||
titleRows = titleStatement.all(...titleBase.params, titlePattern, limit) as Record<string, unknown>[]
|
||||
|
||||
const contentSql = `
|
||||
WITH base AS (
|
||||
@@ -326,9 +381,11 @@ export async function searchSessionSummaries(
|
||||
LIMIT ?
|
||||
`
|
||||
|
||||
const contentRows = prefixQuery
|
||||
? (db.prepare(contentSql).all(...contentBase.params, prefixQuery, limit * 4) as Record<string, unknown>[])
|
||||
: []
|
||||
const contentRows = useLiteralContentSearch
|
||||
? runLiteralContentSearch(db, source, trimmed, limit)
|
||||
: prefixQuery
|
||||
? (db.prepare(contentSql).all(...contentBase.params, prefixQuery, limit * 4) as Record<string, unknown>[])
|
||||
: []
|
||||
|
||||
const merged = new Map<string, HermesSessionSearchRow>()
|
||||
for (const row of titleRows) {
|
||||
@@ -351,19 +408,7 @@ export async function searchSessionSummaries(
|
||||
} catch (err) {
|
||||
const message = err instanceof Error ? err.message : String(err)
|
||||
if (containsCjk(normalized)) {
|
||||
const likeRows = runLikeContentSearch(db, source, trimmed)
|
||||
const merged = new Map<string, HermesSessionSearchRow>()
|
||||
for (const row of likeRows) {
|
||||
const mapped = mapSearchRow(row)
|
||||
if (!merged.has(mapped.id)) {
|
||||
merged.set(mapped.id, mapped)
|
||||
}
|
||||
}
|
||||
return [...merged.values()].slice(0, limit)
|
||||
}
|
||||
|
||||
if (isNumericQuery(trimmed) || hasUnsafeChars(trimmed)) {
|
||||
const likeRows = runLikeContentSearch(db, source, trimmed)
|
||||
const likeRows = runLiteralContentSearch(db, source, trimmed, limit)
|
||||
const merged = new Map<string, HermesSessionSearchRow>()
|
||||
for (const row of titleRows) {
|
||||
const mapped = mapSearchRow(row)
|
||||
@@ -375,7 +420,12 @@ export async function searchSessionSummaries(
|
||||
merged.set(mapped.id, mapped)
|
||||
}
|
||||
}
|
||||
return [...merged.values()].slice(0, limit)
|
||||
const items = [...merged.values()]
|
||||
items.sort((a, b) => {
|
||||
if (a.rank !== b.rank) return a.rank - b.rank
|
||||
return b.last_active - a.last_active
|
||||
})
|
||||
return items.slice(0, limit)
|
||||
}
|
||||
|
||||
throw new Error(`Failed to search sessions: ${message}`)
|
||||
|
||||
@@ -9,11 +9,13 @@ import { mkdir } from 'fs/promises'
|
||||
import { readFileSync } from 'fs'
|
||||
import { config } from './config'
|
||||
import { getToken, requireAuth } from './services/auth'
|
||||
import { initGatewayManager } from './services/gateway-bootstrap'
|
||||
import { initGatewayManager, getGatewayManagerInstance } from './services/gateway-bootstrap'
|
||||
import { bindShutdown } from './services/shutdown'
|
||||
import { setupTerminalWebSocket } from './routes/hermes/terminal'
|
||||
import { startVersionCheck } from './routes/health'
|
||||
import { registerRoutes } from './routes'
|
||||
import { setGroupChatServer } from './routes/hermes/group-chat'
|
||||
import { GroupChatServer } from './services/hermes/group-chat'
|
||||
import { logger } from './services/logger'
|
||||
|
||||
// Injected by esbuild at build time; fallback to reading package.json in dev mode
|
||||
@@ -85,6 +87,19 @@ export async function bootstrap() {
|
||||
setupTerminalWebSocket(server)
|
||||
console.log('[bootstrap] terminal websocket setup')
|
||||
|
||||
// Group chat Socket.IO (must be after server is created)
|
||||
const groupChatServer = new GroupChatServer(server)
|
||||
setGroupChatServer(groupChatServer)
|
||||
groupChatServer.setGatewayManager(getGatewayManagerInstance())
|
||||
|
||||
// Catch-all: destroy upgrade requests not handled by terminal or Socket.IO
|
||||
server.on('upgrade', (req: any, socket: any) => {
|
||||
const url = new URL(req.url || '', `http://${req.headers.host}`)
|
||||
if (url.pathname !== '/api/hermes/terminal' && !url.pathname.startsWith('/socket.io/')) {
|
||||
socket.destroy()
|
||||
}
|
||||
})
|
||||
|
||||
server.on('listening', () => {
|
||||
const interfaces = os.networkInterfaces()
|
||||
const localIp = Object.values(interfaces).flat().find(i => i?.family === 'IPv4' && !i?.internal)?.address || 'localhost'
|
||||
@@ -93,6 +108,9 @@ export async function bootstrap() {
|
||||
console.log(`Log: ~/.hermes-web-ui/logs/server.log`)
|
||||
logger.info('Server: http://localhost:%d (LAN: http://%s:%d)', config.port, localIp, config.port)
|
||||
logger.info('Upstream: %s', config.upstream)
|
||||
|
||||
// Restore group chat agents after server is ready
|
||||
groupChatServer.restoreWhenReady()
|
||||
})
|
||||
|
||||
server.on('error', (err: any) => {
|
||||
@@ -100,7 +118,7 @@ export async function bootstrap() {
|
||||
logger.error({ err }, 'Server error')
|
||||
})
|
||||
|
||||
bindShutdown(server)
|
||||
bindShutdown(server, groupChatServer)
|
||||
startVersionCheck()
|
||||
}
|
||||
|
||||
|
||||
@@ -0,0 +1,270 @@
|
||||
import Router from '@koa/router'
|
||||
import type { GroupChatServer } from '../../services/hermes/group-chat'
|
||||
|
||||
export const groupChatRoutes = new Router()
|
||||
|
||||
let chatServer: GroupChatServer | null = null
|
||||
|
||||
export function setGroupChatServer(server: GroupChatServer) {
|
||||
chatServer = server
|
||||
}
|
||||
|
||||
export function getGroupChatServer(): GroupChatServer | null {
|
||||
return chatServer
|
||||
}
|
||||
|
||||
function generateId(): string {
|
||||
return Date.now().toString(36) + Math.random().toString(36).slice(2, 8)
|
||||
}
|
||||
|
||||
// Create room
|
||||
groupChatRoutes.post('/api/hermes/group-chat/rooms', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const { name, inviteCode, agents, compression } = ctx.request.body as {
|
||||
name?: string
|
||||
inviteCode?: string
|
||||
agents?: { profile: string; name?: string; description?: string; invited?: boolean }[]
|
||||
compression?: { triggerTokens?: number; maxHistoryTokens?: number; tailMessageCount?: number }
|
||||
}
|
||||
if (!name || !inviteCode) {
|
||||
ctx.status = 400
|
||||
ctx.body = { error: 'name and inviteCode are required' }
|
||||
return
|
||||
}
|
||||
|
||||
const roomId = generateId()
|
||||
const storage = chatServer.getStorage()
|
||||
storage.saveRoom(roomId, name, inviteCode, compression)
|
||||
|
||||
// Save agents to DB and auto-connect via Socket.IO
|
||||
const addedAgents = []
|
||||
for (const a of agents || []) {
|
||||
const agentId = generateId()
|
||||
const agent = storage.addRoomAgent(roomId, agentId, a.profile, a.name || a.profile, a.description || '', a.invited ? 1 : 0)
|
||||
addedAgents.push(agent)
|
||||
|
||||
try {
|
||||
const client = await chatServer.agentClients.createAgent({
|
||||
profile: agent.profile,
|
||||
name: agent.name,
|
||||
description: agent.description,
|
||||
invited: agent.invited,
|
||||
})
|
||||
await chatServer.agentClients.addAgentToRoom(roomId, client)
|
||||
} catch (err: any) {
|
||||
console.error(`[GroupChat] Failed to connect agent ${a.profile} to room ${roomId}: ${err.message}`)
|
||||
}
|
||||
}
|
||||
|
||||
const room = storage.getRoom(roomId)
|
||||
ctx.body = { room, agents: addedAgents }
|
||||
})
|
||||
|
||||
// Get room detail and messages
|
||||
groupChatRoutes.get('/api/hermes/group-chat/rooms/:roomId', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const room = chatServer.getStorage().getRoom(ctx.params.roomId)
|
||||
if (!room) {
|
||||
ctx.status = 404
|
||||
ctx.body = { error: 'Room not found' }
|
||||
return
|
||||
}
|
||||
|
||||
const messages = chatServer.getStorage().getMessages(ctx.params.roomId)
|
||||
const agents = chatServer.getStorage().getRoomAgents(ctx.params.roomId)
|
||||
const members = chatServer.getStorage().getRoomMembers(ctx.params.roomId)
|
||||
ctx.body = { room, messages, agents, members }
|
||||
})
|
||||
|
||||
// List rooms
|
||||
groupChatRoutes.get('/api/hermes/group-chat/rooms', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const rooms = chatServer.getStorage().getAllRooms()
|
||||
ctx.body = { rooms }
|
||||
})
|
||||
|
||||
// Get room by invite code
|
||||
groupChatRoutes.get('/api/hermes/group-chat/rooms/join/:code', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const room = chatServer.getStorage().getRoomByInviteCode(ctx.params.code)
|
||||
if (!room) {
|
||||
ctx.status = 404
|
||||
ctx.body = { error: 'Room not found' }
|
||||
return
|
||||
}
|
||||
|
||||
ctx.body = { room }
|
||||
})
|
||||
|
||||
// Update room invite code
|
||||
groupChatRoutes.put('/api/hermes/group-chat/rooms/:roomId/invite-code', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const { inviteCode } = ctx.request.body as { inviteCode?: string }
|
||||
if (!inviteCode) {
|
||||
ctx.status = 400
|
||||
ctx.body = { error: 'inviteCode is required' }
|
||||
return
|
||||
}
|
||||
|
||||
chatServer.getStorage().updateRoomInviteCode(ctx.params.roomId, inviteCode)
|
||||
ctx.body = { success: true }
|
||||
})
|
||||
|
||||
// Add agent to room
|
||||
groupChatRoutes.post('/api/hermes/group-chat/rooms/:roomId/agents', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const { profile, name, description, invited } = ctx.request.body as { profile?: string; name?: string; description?: string; invited?: boolean }
|
||||
if (!profile) {
|
||||
ctx.status = 400
|
||||
ctx.body = { error: 'profile is required' }
|
||||
return
|
||||
}
|
||||
|
||||
// Prevent duplicate agent in same room
|
||||
const existing = chatServer.getStorage().getRoomAgents(ctx.params.roomId)
|
||||
if (existing.find(a => a.profile === profile)) {
|
||||
ctx.status = 409
|
||||
ctx.body = { error: 'Agent already in room' }
|
||||
return
|
||||
}
|
||||
|
||||
const agentId = generateId()
|
||||
const agent = chatServer.getStorage().addRoomAgent(ctx.params.roomId, agentId, profile, name || profile, description || '', invited ? 1 : 0)
|
||||
|
||||
// Auto-connect agent via Socket.IO
|
||||
try {
|
||||
const client = await chatServer.agentClients.createAgent({
|
||||
profile: agent.profile,
|
||||
name: agent.name,
|
||||
description: agent.description,
|
||||
invited: agent.invited,
|
||||
})
|
||||
await chatServer.agentClients.addAgentToRoom(ctx.params.roomId, client)
|
||||
} catch (err: any) {
|
||||
console.error(`[GroupChat] Failed to connect agent ${profile} to room ${ctx.params.roomId}: ${err.message}`)
|
||||
}
|
||||
|
||||
ctx.body = { agent }
|
||||
})
|
||||
|
||||
// List agents in room
|
||||
groupChatRoutes.get('/api/hermes/group-chat/rooms/:roomId/agents', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const agents = chatServer.getStorage().getRoomAgents(ctx.params.roomId)
|
||||
ctx.body = { agents }
|
||||
})
|
||||
|
||||
// Remove agent from room
|
||||
groupChatRoutes.delete('/api/hermes/group-chat/rooms/:roomId/agents/:agentId', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
chatServer.getStorage().removeRoomAgent(ctx.params.agentId)
|
||||
chatServer.agentClients.removeAgentFromRoom(ctx.params.roomId, ctx.params.agentId)
|
||||
ctx.body = { success: true }
|
||||
})
|
||||
|
||||
// Delete room
|
||||
groupChatRoutes.delete('/api/hermes/group-chat/rooms/:roomId', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const roomId = ctx.params.roomId
|
||||
// Disconnect all agents in room
|
||||
chatServer.agentClients.disconnectRoom(roomId)
|
||||
// Delete all data
|
||||
chatServer.getStorage().deleteRoom(roomId)
|
||||
ctx.body = { success: true }
|
||||
})
|
||||
|
||||
// Update room compression config
|
||||
groupChatRoutes.put('/api/hermes/group-chat/rooms/:roomId/config', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const roomId = ctx.params.roomId
|
||||
const { triggerTokens, maxHistoryTokens, tailMessageCount } = ctx.request.body as {
|
||||
triggerTokens?: number
|
||||
maxHistoryTokens?: number
|
||||
tailMessageCount?: number
|
||||
}
|
||||
|
||||
chatServer.getStorage().updateRoomConfig(roomId, { triggerTokens, maxHistoryTokens, tailMessageCount })
|
||||
const room = chatServer.getStorage().getRoom(roomId)
|
||||
ctx.body = { room }
|
||||
})
|
||||
|
||||
// Force compress a room's context
|
||||
groupChatRoutes.post('/api/hermes/group-chat/rooms/:roomId/compress', async (ctx) => {
|
||||
if (!chatServer) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Group chat not initialized' }
|
||||
return
|
||||
}
|
||||
|
||||
const roomId = ctx.params.roomId
|
||||
if (!chatServer.getStorage().getRoom(roomId)) {
|
||||
ctx.status = 404
|
||||
ctx.body = { error: 'Room not found' }
|
||||
return
|
||||
}
|
||||
|
||||
const engine = chatServer.getContextEngine()
|
||||
if (!engine) {
|
||||
ctx.status = 503
|
||||
ctx.body = { error: 'Context engine not available' }
|
||||
return
|
||||
}
|
||||
|
||||
try {
|
||||
const result = await engine.forceCompress(roomId)
|
||||
ctx.body = { success: true, summary: result }
|
||||
} catch (err: any) {
|
||||
ctx.status = 500
|
||||
ctx.body = { error: err.message }
|
||||
}
|
||||
})
|
||||
@@ -122,7 +122,6 @@ export function setupTerminalWebSocket(httpServer: HttpServer) {
|
||||
httpServer.on('upgrade', async (req, socket, head) => {
|
||||
const url = new URL(req.url || '', `http://${req.headers.host}`)
|
||||
if (url.pathname !== '/api/hermes/terminal') {
|
||||
socket.destroy()
|
||||
return
|
||||
}
|
||||
|
||||
|
||||
@@ -24,6 +24,7 @@ import { fileRoutes } from './hermes/files'
|
||||
import { downloadRoutes } from './hermes/download'
|
||||
import { jobRoutes } from './hermes/jobs'
|
||||
import { proxyRoutes, proxyMiddleware } from './hermes/proxy'
|
||||
import { groupChatRoutes, setGroupChatServer } from './hermes/group-chat'
|
||||
|
||||
/**
|
||||
* Register all routes on the Koa app.
|
||||
@@ -55,6 +56,7 @@ export function registerRoutes(app: any, requireAuth: (ctx: Context, next: Next)
|
||||
app.use(nousAuthRoutes.routes())
|
||||
app.use(gatewayRoutes.routes())
|
||||
app.use(weixinRoutes.routes())
|
||||
app.use(groupChatRoutes.routes()) // Must be before proxy
|
||||
app.use(fileRoutes.routes()) // Must be before proxy (proxy catch-all matches everything)
|
||||
app.use(downloadRoutes.routes()) // Must be before proxy
|
||||
app.use(jobRoutes.routes()) // Must be before proxy
|
||||
|
||||
@@ -0,0 +1,357 @@
|
||||
import type {
|
||||
StoredMessage,
|
||||
CompressionConfig,
|
||||
CompressedContext,
|
||||
BuildContextInput,
|
||||
MessageFetcher,
|
||||
GatewayCaller,
|
||||
SessionCleaner,
|
||||
} from './types'
|
||||
import { DEFAULT_COMPRESSION_CONFIG } from './types'
|
||||
import { GatewaySummarizer } from './gateway-client'
|
||||
import { buildAgentInstructions, buildSummarizationSystemPrompt } from './prompt'
|
||||
|
||||
export class ContextEngine {
|
||||
private config: CompressionConfig
|
||||
private messageFetcher: MessageFetcher
|
||||
private gatewayCaller: GatewayCaller
|
||||
/** Per-room compression lock to prevent concurrent snapshot overwrites */
|
||||
private _compressLocks = new Map<string, Promise<void>>()
|
||||
private _upstream = ''
|
||||
private _apiKey: string | null = null
|
||||
|
||||
constructor(opts: {
|
||||
config?: Partial<CompressionConfig>
|
||||
messageFetcher: MessageFetcher
|
||||
gatewayCaller?: GatewayCaller
|
||||
sessionCleaner?: SessionCleaner
|
||||
}) {
|
||||
this.config = { ...DEFAULT_COMPRESSION_CONFIG, ...opts.config }
|
||||
this.messageFetcher = opts.messageFetcher
|
||||
this.gatewayCaller = opts.gatewayCaller || new GatewaySummarizer(this.config.summarizationTimeoutMs)
|
||||
this.sessionCleaner = opts.sessionCleaner
|
||||
}
|
||||
|
||||
private sessionCleaner?: SessionCleaner
|
||||
|
||||
setUpstream(upstream: string, apiKey: string | null): void {
|
||||
this._upstream = upstream
|
||||
this._apiKey = apiKey
|
||||
}
|
||||
|
||||
/**
|
||||
* Build context for an agent reply.
|
||||
*
|
||||
* Flow:
|
||||
* 1. Read persisted snapshot (summary + lastMessageId) from SQLite
|
||||
* 2. If snapshot exists:
|
||||
* a. Collect new messages after lastMessageId
|
||||
* b. Estimate tokens = summary + new messages
|
||||
* c. Under threshold → return as-is
|
||||
* d. Over threshold → incremental compress, update snapshot, return
|
||||
* 3. If no snapshot:
|
||||
* a. Estimate tokens for all messages
|
||||
* b. Under threshold → return all verbatim
|
||||
* c. Over threshold → full compress, save snapshot, return
|
||||
*/
|
||||
async buildContext(input: BuildContextInput): Promise<CompressedContext> {
|
||||
// Serialize compression per room to prevent concurrent snapshot overwrites
|
||||
const existing = this._compressLocks.get(input.roomId)
|
||||
if (existing) {
|
||||
await existing
|
||||
}
|
||||
let resolveLock!: () => void
|
||||
const lock = new Promise<void>(r => { resolveLock = r })
|
||||
this._compressLocks.set(input.roomId, lock)
|
||||
try {
|
||||
return await this._buildContextImpl(input)
|
||||
} finally {
|
||||
resolveLock()
|
||||
this._compressLocks.delete(input.roomId)
|
||||
}
|
||||
}
|
||||
|
||||
private async _buildContextImpl(input: BuildContextInput): Promise<CompressedContext> {
|
||||
const config = { ...this.config, ...input.compression }
|
||||
const allMessages = this.messageFetcher.getMessages(input.roomId)
|
||||
// Filter out messages newer than the current one
|
||||
const messages = allMessages.filter(m => m.timestamp <= input.currentMessage.timestamp)
|
||||
const total = messages.length
|
||||
|
||||
console.log(`[ContextEngine] buildContext START — room=${input.roomId}, agent=${input.agentName}, totalMessagesInDb=${allMessages.length}, afterFilter=${total}`)
|
||||
|
||||
const instructions = buildAgentInstructions({
|
||||
agentName: input.agentName,
|
||||
roomName: input.roomName,
|
||||
agentDescription: input.agentDescription,
|
||||
memberNames: input.memberNames,
|
||||
members: input.members,
|
||||
})
|
||||
|
||||
const meta: CompressedContext['meta'] = {
|
||||
totalMessages: total,
|
||||
verbatimCount: 0,
|
||||
hadSnapshot: false,
|
||||
compressed: false,
|
||||
summaryTokenEstimate: 0,
|
||||
}
|
||||
|
||||
const snapshot = this.messageFetcher.getContextSnapshot(input.roomId)
|
||||
console.log(`[ContextEngine] snapshot=${snapshot ? `EXISTS (lastMsgId=${snapshot.lastMessageId}, summaryLen=${snapshot.summary.length})` : 'NONE'}`)
|
||||
|
||||
// ── Path A: Snapshot exists — incremental ────────────
|
||||
if (snapshot) {
|
||||
meta.hadSnapshot = true
|
||||
|
||||
// Find the position of lastMessageId in messages
|
||||
const snapshotIdx = messages.findIndex(m => m.id === snapshot.lastMessageId)
|
||||
// Collect messages after the snapshot position
|
||||
const newMessages = snapshotIdx >= 0
|
||||
? messages.slice(snapshotIdx + 1)
|
||||
: messages.filter(m => m.timestamp > snapshot.lastMessageTimestamp)
|
||||
|
||||
const summaryTokens = this.countTokens(snapshot.summary)
|
||||
const newTokens = this.estimateTokensFromMessages(newMessages)
|
||||
const totalTokens = summaryTokens + newTokens
|
||||
|
||||
meta.verbatimCount = newMessages.length
|
||||
meta.summaryTokenEstimate = summaryTokens
|
||||
|
||||
console.log(`[ContextEngine] [Path A] snapshotIdx=${snapshotIdx}, newMessages=${newMessages.length}, summaryTokens=~${summaryTokens}, newTokens=~${newTokens}, totalTokens=~${totalTokens}, threshold=${config.triggerTokens}`)
|
||||
console.log(`[ContextEngine] [Path A] EXISTING SUMMARY (${snapshot.summary.length} chars):`, snapshot.summary.slice(0, 300))
|
||||
if (newMessages.length > 0) {
|
||||
console.log(`[ContextEngine] [Path A] NEW MESSAGES (${newMessages.length}):`, newMessages.map(m => `[${m.senderName}]: ${m.content.slice(0, 80)}`).join(' | '))
|
||||
}
|
||||
|
||||
// Under threshold — return summary + new messages directly
|
||||
if (totalTokens <= config.triggerTokens) {
|
||||
console.log(`[ContextEngine] [Path A] UNDER threshold — return summary + ${newMessages.length} verbatim msgs directly`)
|
||||
const history = this.buildHistory(snapshot.summary, newMessages, input.agentSocketId)
|
||||
this.logHistory('Path A (no compress)', history)
|
||||
return { conversationHistory: history, instructions, meta }
|
||||
}
|
||||
|
||||
// Over threshold — incremental compress
|
||||
console.log(`[ContextEngine] [Path A] OVER threshold — starting INCREMENTAL compression of ${newMessages.length} msgs...`)
|
||||
console.log(`[ContextEngine] [Path A] CONTEXT BEFORE COMPRESSION: summary(${snapshot.summary.length} chars) + ${newMessages.length} new msgs`)
|
||||
meta.compressed = true
|
||||
|
||||
const t0 = Date.now()
|
||||
const result = await this.summarize(
|
||||
input.roomId,
|
||||
newMessages,
|
||||
input.upstream,
|
||||
input.apiKey,
|
||||
snapshot.summary,
|
||||
)
|
||||
const elapsed = Date.now() - t0
|
||||
|
||||
if (result.summary) {
|
||||
const lastMsg = newMessages[newMessages.length - 1]
|
||||
this.messageFetcher.saveContextSnapshot(input.roomId, result.summary, lastMsg.id, lastMsg.timestamp)
|
||||
|
||||
meta.summaryTokenEstimate = this.countTokens(result.summary)
|
||||
console.log(`[ContextEngine] [Path A] incremental compression DONE in ${elapsed}ms, newSummaryLen=${result.summary.length}, newLastMsgId=${lastMsg.id}`)
|
||||
console.log(`[ContextEngine] [Path A] NEW SUMMARY (${result.summary.length} chars):`, result.summary.slice(0, 300))
|
||||
const history = this.buildHistory(result.summary, newMessages, input.agentSocketId)
|
||||
this.logHistory('Path A (after incremental compress)', history)
|
||||
if (result.sessionId) this.sessionCleaner?.(result.sessionId)
|
||||
return { conversationHistory: history, instructions, meta }
|
||||
}
|
||||
|
||||
// Compression failed — degrade
|
||||
console.warn(`[ContextEngine] [Path A] incremental compression FAILED (${elapsed}ms) — degrading to summary + trimmed verbatim`)
|
||||
const history = this.buildHistory(snapshot.summary, newMessages, input.agentSocketId)
|
||||
this.trimToBudget(history, summaryTokens, config.maxHistoryTokens)
|
||||
return { conversationHistory: history, instructions, meta }
|
||||
}
|
||||
|
||||
// ── Path B: No snapshot — full context ───────────────
|
||||
const totalTokens = this.estimateTokensFromMessages(messages)
|
||||
meta.verbatimCount = total
|
||||
|
||||
console.log(`[ContextEngine] [Path B] no snapshot, totalMessages=${total}, totalTokens=~${totalTokens}, threshold=${config.triggerTokens}`)
|
||||
|
||||
// Under threshold — pass all messages verbatim
|
||||
if (totalTokens <= config.triggerTokens) {
|
||||
console.log(`[ContextEngine] [Path B] UNDER threshold — return all ${total} msgs verbatim`)
|
||||
const history = messages.map(m => this.mapToHistory(m, input.agentSocketId))
|
||||
this.logHistory('Path B (no compress)', history)
|
||||
return { conversationHistory: history, instructions, meta }
|
||||
}
|
||||
|
||||
// Over threshold — full compress
|
||||
console.log(`[ContextEngine] [Path B] OVER threshold — starting FULL compression of ${total} msgs...`)
|
||||
console.log(`[ContextEngine] [Path B] CONTEXT BEFORE COMPRESSION: ${total} msgs, ~${totalTokens} tokens`)
|
||||
meta.compressed = true
|
||||
|
||||
const t0 = Date.now()
|
||||
const result = await this.summarize(
|
||||
input.roomId,
|
||||
messages,
|
||||
input.upstream,
|
||||
input.apiKey,
|
||||
)
|
||||
const elapsed = Date.now() - t0
|
||||
|
||||
if (result.summary) {
|
||||
// Keep recent tail messages verbatim, compress the rest
|
||||
const { tailMessageCount } = config
|
||||
const toCompress = messages.length > tailMessageCount ? messages.slice(0, -tailMessageCount) : messages
|
||||
const tail = messages.length > tailMessageCount ? messages.slice(-tailMessageCount) : []
|
||||
const lastCompressedMsg = toCompress[toCompress.length - 1]
|
||||
|
||||
this.messageFetcher.saveContextSnapshot(input.roomId, result.summary, lastCompressedMsg.id, lastCompressedMsg.timestamp)
|
||||
|
||||
meta.summaryTokenEstimate = this.countTokens(result.summary)
|
||||
console.log(`[ContextEngine] [Path B] full compression DONE in ${elapsed}ms, summaryLen=${result.summary.length}, compressed=${toCompress.length} msgs, keptTail=${tail.length} msgs, savedLastMsgId=${lastCompressedMsg.id}`)
|
||||
console.log(`[ContextEngine] [Path B] COMPRESSED SUMMARY (${result.summary.length} chars):`, result.summary.slice(0, 300))
|
||||
const history = this.buildHistory(result.summary, tail, input.agentSocketId)
|
||||
this.logHistory('Path B (after full compress)', history)
|
||||
if (result.sessionId) this.sessionCleaner?.(result.sessionId)
|
||||
return { conversationHistory: history, instructions, meta }
|
||||
}
|
||||
|
||||
// Compression failed — degrade
|
||||
console.warn(`[ContextEngine] [Path B] full compression FAILED (${elapsed}ms) — degrading to trimmed verbatim`)
|
||||
const history = messages.map(m => this.mapToHistory(m, input.agentSocketId))
|
||||
this.trimToBudget(history, 0, config.maxHistoryTokens)
|
||||
meta.verbatimCount = history.length
|
||||
return { conversationHistory: history, instructions, meta }
|
||||
}
|
||||
|
||||
invalidateRoom(roomId: string): void {
|
||||
this.messageFetcher.deleteContextSnapshot(roomId)
|
||||
}
|
||||
|
||||
/**
|
||||
* Force compress all messages in a room (full compression).
|
||||
* Used when user manually triggers compression.
|
||||
*/
|
||||
async forceCompress(roomId: string): Promise<string> {
|
||||
const allMessages = this.messageFetcher.getMessages(roomId)
|
||||
if (allMessages.length === 0) return ''
|
||||
|
||||
const config = { ...this.config }
|
||||
console.log(`[ContextEngine] forceCompress room=${roomId}, messages=${allMessages.length}`)
|
||||
|
||||
const t0 = Date.now()
|
||||
const result = await this.summarize(roomId, allMessages, this._upstream, this._apiKey)
|
||||
const elapsed = Date.now() - t0
|
||||
|
||||
if (result.summary) {
|
||||
const { tailMessageCount } = config
|
||||
const toCompress = allMessages.length > tailMessageCount ? allMessages.slice(0, -tailMessageCount) : allMessages
|
||||
const tail = allMessages.length > tailMessageCount ? allMessages.slice(-tailMessageCount) : []
|
||||
const lastCompressedMsg = toCompress[toCompress.length - 1]
|
||||
|
||||
this.messageFetcher.saveContextSnapshot(roomId, result.summary, lastCompressedMsg.id, lastCompressedMsg.timestamp)
|
||||
console.log(`[ContextEngine] forceCompress DONE in ${elapsed}ms`)
|
||||
if (result.sessionId) this.sessionCleaner?.(result.sessionId)
|
||||
return result.summary
|
||||
}
|
||||
|
||||
throw new Error('Compression failed')
|
||||
}
|
||||
|
||||
// ─── Private ─────────────────────────────────────────────
|
||||
|
||||
/**
|
||||
* Build history array: optional summary prefix + verbatim messages.
|
||||
*/
|
||||
private buildHistory(
|
||||
summary: string,
|
||||
messages: StoredMessage[],
|
||||
agentSocketId: string,
|
||||
): Array<{ role: 'user' | 'assistant'; content: string }> {
|
||||
const history: Array<{ role: 'user' | 'assistant'; content: string }> = []
|
||||
|
||||
if (summary) {
|
||||
history.push(
|
||||
{ role: 'user', content: '[Previous conversation summary]\n' + summary },
|
||||
{ role: 'assistant', content: 'I have reviewed the conversation history and understand the context.' },
|
||||
)
|
||||
}
|
||||
|
||||
history.push(...messages.map(m => this.mapToHistory(m, agentSocketId)))
|
||||
return history
|
||||
}
|
||||
|
||||
/**
|
||||
* Summarize messages. If previousSummary is provided, do incremental update.
|
||||
*/
|
||||
private async summarize(
|
||||
roomId: string,
|
||||
messages: StoredMessage[],
|
||||
upstream: string,
|
||||
apiKey: string | null,
|
||||
previousSummary?: string,
|
||||
): Promise<{ summary: string | null; sessionId: string | null }> {
|
||||
if (messages.length === 0 && !previousSummary) return { summary: null, sessionId: null }
|
||||
|
||||
try {
|
||||
const result = await this.gatewayCaller.summarize(
|
||||
upstream,
|
||||
apiKey,
|
||||
buildSummarizationSystemPrompt(),
|
||||
messages,
|
||||
previousSummary,
|
||||
)
|
||||
return { summary: result.summary, sessionId: result.sessionId }
|
||||
} catch (err: any) {
|
||||
console.warn(`[ContextEngine] Summarization failed for room ${roomId}: ${err.message}`)
|
||||
return { summary: null, sessionId: null }
|
||||
} finally {
|
||||
// Session cleanup handled here if sessionCleaner is provided
|
||||
}
|
||||
}
|
||||
|
||||
private mapToHistory(
|
||||
msg: StoredMessage,
|
||||
agentSocketId: string,
|
||||
): { role: 'user' | 'assistant'; content: string } {
|
||||
if (msg.senderId === agentSocketId) {
|
||||
return { role: 'assistant', content: msg.content }
|
||||
}
|
||||
return { role: 'user', content: `[${msg.senderName}]: ${msg.content}` }
|
||||
}
|
||||
|
||||
private trimToBudget(
|
||||
history: Array<{ role: 'user' | 'assistant'; content: string }>,
|
||||
summaryTokens: number,
|
||||
maxTokens: number,
|
||||
): void {
|
||||
let totalTokens = summaryTokens + this.estimateTokens(history)
|
||||
while (totalTokens > maxTokens && history.length > 0) {
|
||||
history.pop()
|
||||
totalTokens = summaryTokens + this.estimateTokens(history)
|
||||
}
|
||||
}
|
||||
|
||||
private estimateTokens(history: Array<{ role: string; content: string }>): number {
|
||||
const text = history.map(m => m.content).join('')
|
||||
return this.countTokens(text)
|
||||
}
|
||||
|
||||
private estimateTokensFromMessages(messages: StoredMessage[]): number {
|
||||
const text = messages.map(m => m.content + m.senderName).join('')
|
||||
return this.countTokens(text)
|
||||
}
|
||||
|
||||
/** Estimate tokens distinguishing CJK (~1.5 tok/char) from Latin (~0.25 tok/char) */
|
||||
private countTokens(text: string): number {
|
||||
const cjk = (text.match(/[\u2e80-\u9fff\uac00-\ud7af\u3000-\u303f\uff00-\uffef]/g) || []).length
|
||||
const other = text.length - cjk
|
||||
return Math.ceil(cjk * 1.5 + other / 4)
|
||||
}
|
||||
|
||||
/** Log assembled history for debugging */
|
||||
private logHistory(label: string, history: Array<{ role: string; content: string }>): void {
|
||||
const totalTokens = this.estimateTokens(history)
|
||||
console.log(`[ContextEngine] ASSEMBLED HISTORY (${label}): ${history.length} entries, ~${totalTokens} tokens`)
|
||||
for (const entry of history) {
|
||||
const preview = entry.content.length > 150 ? entry.content.slice(0, 150) + '...' : entry.content
|
||||
console.log(` [${entry.role}] ${preview}`)
|
||||
}
|
||||
}
|
||||
}
|
||||
@@ -0,0 +1,117 @@
|
||||
import { EventSource } from 'eventsource'
|
||||
import type { StoredMessage, GatewayCaller } from './types'
|
||||
import {
|
||||
buildSummarizationSystemPrompt,
|
||||
buildFullSummaryPrompt,
|
||||
buildIncrementalUpdatePrompt,
|
||||
} from './prompt'
|
||||
|
||||
/**
|
||||
* Calls Hermes /v1/runs to produce LLM-generated summaries.
|
||||
* Uses non-streaming EventSource to wait for run.completed.
|
||||
*/
|
||||
export class GatewaySummarizer implements GatewayCaller {
|
||||
private timeoutMs: number
|
||||
|
||||
constructor(timeoutMs = 30_000) {
|
||||
this.timeoutMs = timeoutMs
|
||||
}
|
||||
|
||||
async summarize(
|
||||
upstream: string,
|
||||
apiKey: string | null,
|
||||
systemPrompt: string,
|
||||
messages: StoredMessage[],
|
||||
previousSummary?: string,
|
||||
): Promise<{ summary: string; sessionId: string }> {
|
||||
// Build conversation_history from messages
|
||||
const history: Array<{ role: string; content: string }> = messages.map(m => ({
|
||||
role: 'user',
|
||||
content: `[${m.senderName}]: ${m.content}`,
|
||||
}))
|
||||
|
||||
// Inject previous summary for incremental update
|
||||
if (previousSummary) {
|
||||
history.unshift(
|
||||
{ role: 'user', content: `[Previous summary]\n${previousSummary}` },
|
||||
{ role: 'assistant', content: 'Understood, I will update the summary.' },
|
||||
)
|
||||
}
|
||||
|
||||
const userPrompt = previousSummary
|
||||
? buildIncrementalUpdatePrompt()
|
||||
: buildFullSummaryPrompt()
|
||||
|
||||
const sessionId = Date.now().toString(36) + Math.random().toString(36).slice(2, 8)
|
||||
|
||||
// POST /v1/runs
|
||||
const res = await fetch(`${upstream}/v1/runs`, {
|
||||
method: 'POST',
|
||||
headers: {
|
||||
'Content-Type': 'application/json',
|
||||
...(apiKey ? { Authorization: `Bearer ${apiKey}` } : {}),
|
||||
},
|
||||
body: JSON.stringify({
|
||||
input: userPrompt,
|
||||
instructions: systemPrompt || buildSummarizationSystemPrompt(),
|
||||
conversation_history: history,
|
||||
session_id: sessionId,
|
||||
}),
|
||||
signal: AbortSignal.timeout(this.timeoutMs),
|
||||
})
|
||||
|
||||
if (!res.ok) {
|
||||
throw new Error(`Summarization run failed: ${res.status}`)
|
||||
}
|
||||
|
||||
const { run_id } = await res.json() as { run_id: string }
|
||||
|
||||
try {
|
||||
const output = await this.pollForResult(upstream, apiKey, run_id)
|
||||
return { summary: output, sessionId }
|
||||
} finally {
|
||||
// Note: session cleanup is handled by the caller (compressor.ts)
|
||||
}
|
||||
}
|
||||
|
||||
private pollForResult(upstream: string, apiKey: string | null, runId: string): Promise<string> {
|
||||
return new Promise<string>((resolve, reject) => {
|
||||
const timer = setTimeout(() => {
|
||||
source.close()
|
||||
reject(new Error('Summarization timed out'))
|
||||
}, this.timeoutMs)
|
||||
|
||||
const eventsUrl = new URL(`${upstream}/v1/runs/${runId}/events`)
|
||||
if (apiKey) eventsUrl.searchParams.set('token', apiKey)
|
||||
|
||||
const source = new EventSource(eventsUrl.toString())
|
||||
|
||||
source.onmessage = (event: MessageEvent) => {
|
||||
try {
|
||||
const parsed = JSON.parse(event.data)
|
||||
if (parsed.event === 'run.completed') {
|
||||
clearTimeout(timer)
|
||||
source.close()
|
||||
const output = parsed.output
|
||||
if (!output || typeof output !== 'string' || output.trim() === '') {
|
||||
reject(new Error('Empty summarization response'))
|
||||
return
|
||||
}
|
||||
resolve(output.trim())
|
||||
} else if (parsed.event === 'run.failed') {
|
||||
clearTimeout(timer)
|
||||
source.close()
|
||||
reject(new Error(parsed.error || 'Summarization run failed'))
|
||||
}
|
||||
} catch { /* ignore parse errors for non-JSON events */ }
|
||||
}
|
||||
|
||||
source.onerror = () => {
|
||||
clearTimeout(timer)
|
||||
source.close()
|
||||
reject(new Error('Summarization SSE connection error'))
|
||||
}
|
||||
})
|
||||
}
|
||||
|
||||
}
|
||||
@@ -0,0 +1,13 @@
|
||||
export { ContextEngine } from './compressor'
|
||||
export { GatewaySummarizer } from './gateway-client'
|
||||
export { buildAgentInstructions, buildSummarizationSystemPrompt, buildFullSummaryPrompt, buildIncrementalUpdatePrompt } from './prompt'
|
||||
export { DEFAULT_COMPRESSION_CONFIG } from './types'
|
||||
export type {
|
||||
StoredMessage,
|
||||
CompressionConfig,
|
||||
CompressedContext,
|
||||
ContextSnapshot,
|
||||
MessageFetcher,
|
||||
GatewayCaller,
|
||||
BuildContextInput,
|
||||
} from './types'
|
||||
@@ -0,0 +1,82 @@
|
||||
// ─── Agent Identity Instructions ────────────────────────────
|
||||
|
||||
import type { MemberInfo } from './types'
|
||||
|
||||
interface AgentInstructionsParams {
|
||||
agentName: string
|
||||
roomName: string
|
||||
agentDescription: string
|
||||
memberNames: string[]
|
||||
members: MemberInfo[]
|
||||
}
|
||||
|
||||
export function buildAgentInstructions(params: AgentInstructionsParams): string {
|
||||
let memberSection: string
|
||||
if (params.members.length > 0) {
|
||||
memberSection = params.members
|
||||
.map(m => m.description ? `- ${m.name}: ${m.description}` : `- ${m.name}`)
|
||||
.join('\n')
|
||||
} else if (params.memberNames.length > 0) {
|
||||
memberSection = params.memberNames.map(n => `- ${n}`).join('\n')
|
||||
} else {
|
||||
memberSection = '- 未知'
|
||||
}
|
||||
|
||||
return `你是"${params.agentName}",群聊房间"${params.roomName}"中的 AI 助手。
|
||||
|
||||
你的角色:${params.agentDescription}
|
||||
|
||||
当前房间成员:
|
||||
${memberSection}
|
||||
|
||||
规则:
|
||||
- 有人用 @${params.agentName} 提及你时才需要回复,重点回应提及你的人。
|
||||
- 回答简洁、对群聊有帮助。
|
||||
- 不要假装是人类,需要时明确表明自己是 AI。
|
||||
- 对话历史中包含多个人的消息,每条消息前标有发送者名字。
|
||||
- 对话开头可能包含之前的对话摘要,用于提供更早的上下文。
|
||||
- 回复最新一条提及你的消息。
|
||||
- 如果需要其他 agent 协作或明确回复某个人,使用 @名字 来提及对方。
|
||||
- 自行判断对话是否已经结束——如果问题已解决、达成共识、或对方只是陈述不需要回复,则不要再 @任何人,直接结束回复,避免产生无意义的循环对话。`
|
||||
}
|
||||
|
||||
// ─── Summarization Prompts ─────────────────────────────────
|
||||
|
||||
export function buildSummarizationSystemPrompt(): string {
|
||||
return `你是一个群聊对话的摘要助手。请创建一份结构化摘要,帮助 AI 助手快速理解完整的对话上下文并智能回复。
|
||||
|
||||
使用以下格式:
|
||||
|
||||
当前话题:
|
||||
- 现在在聊什么,目标是什么
|
||||
|
||||
已知结论:
|
||||
- 已达成哪些共识,哪些问题已经回答过
|
||||
|
||||
待回复消息:
|
||||
- 还剩谁的问题没回,下一步要做什么
|
||||
|
||||
关键人物:
|
||||
- 人名、角色、引用关系
|
||||
|
||||
重要上下文:
|
||||
- 不要丢时间线和立场变化
|
||||
- 少写废话,多保留"可行动信息"
|
||||
- 重点保留:谁说了什么、结论是什么、下一步是什么
|
||||
- 关键的 URL、代码片段、错误信息、约束条件
|
||||
|
||||
规则:
|
||||
- 基于事实,不要编造信息。
|
||||
- 保持简洁(500 字以内)。
|
||||
- 聚焦于帮助 AI 回复下一条消息的可行动信息。
|
||||
- 使用与对话相同的语言。
|
||||
- 不要回复对话内容,只输出摘要。`
|
||||
}
|
||||
|
||||
export function buildFullSummaryPrompt(): string {
|
||||
return '请对上方对话创建一份简洁的摘要。只输出摘要内容。'
|
||||
}
|
||||
|
||||
export function buildIncrementalUpdatePrompt(): string {
|
||||
return '对话自上次摘要后有了新的内容。请更新摘要,整合新消息。保持相同格式,更新所有部分。只输出更新后的摘要。'
|
||||
}
|
||||
@@ -0,0 +1,49 @@
|
||||
import type { SummaryCacheEntry } from './types'
|
||||
|
||||
const MAX_ENTRIES = 200
|
||||
|
||||
export class SummaryCache {
|
||||
private cache = new Map<string, SummaryCacheEntry>()
|
||||
private ttlMs: number
|
||||
|
||||
constructor(ttlMs = 120_000) {
|
||||
this.ttlMs = ttlMs
|
||||
}
|
||||
|
||||
get(roomId: string): SummaryCacheEntry | undefined {
|
||||
const entry = this.cache.get(roomId)
|
||||
if (!entry) return undefined
|
||||
if (Date.now() - entry.createdAt >= this.ttlMs) {
|
||||
this.cache.delete(roomId)
|
||||
return undefined
|
||||
}
|
||||
return entry
|
||||
}
|
||||
|
||||
set(roomId: string, entry: SummaryCacheEntry): void {
|
||||
if (this.cache.size >= MAX_ENTRIES) {
|
||||
let oldestKey = ''
|
||||
let oldestTime = Infinity
|
||||
for (const [k, v] of this.cache) {
|
||||
if (v.createdAt < oldestTime) {
|
||||
oldestTime = v.createdAt
|
||||
oldestKey = k
|
||||
}
|
||||
}
|
||||
if (oldestKey) this.cache.delete(oldestKey)
|
||||
}
|
||||
this.cache.set(roomId, entry)
|
||||
}
|
||||
|
||||
invalidate(roomId: string): void {
|
||||
this.cache.delete(roomId)
|
||||
}
|
||||
|
||||
clear(): void {
|
||||
this.cache.clear()
|
||||
}
|
||||
|
||||
get size(): number {
|
||||
return this.cache.size
|
||||
}
|
||||
}
|
||||
@@ -0,0 +1,111 @@
|
||||
// ─── Message Types ──────────────────────────────────────────
|
||||
|
||||
/** Raw message from SQLite messages table */
|
||||
export interface StoredMessage {
|
||||
id: string
|
||||
roomId: string
|
||||
senderId: string
|
||||
senderName: string
|
||||
content: string
|
||||
timestamp: number
|
||||
}
|
||||
|
||||
// ─── Compression Config ────────────────────────────────────
|
||||
|
||||
export interface CompressionConfig {
|
||||
/** Token threshold to trigger compression (estimate all messages) */
|
||||
triggerTokens: number
|
||||
/** Max tokens for the final compressed context sent to LLM */
|
||||
maxHistoryTokens: number
|
||||
/** Number of recent messages to keep verbatim after compression */
|
||||
tailMessageCount: number
|
||||
/** Characters per token for estimation */
|
||||
charsPerToken: number
|
||||
/** Timeout for summarization LLM call in ms */
|
||||
summarizationTimeoutMs: number
|
||||
}
|
||||
|
||||
export const DEFAULT_COMPRESSION_CONFIG: CompressionConfig = {
|
||||
triggerTokens: 100_000,
|
||||
maxHistoryTokens: 32_000,
|
||||
tailMessageCount: 20,
|
||||
charsPerToken: 4,
|
||||
summarizationTimeoutMs: 30_000,
|
||||
}
|
||||
|
||||
// ─── Compression Output ────────────────────────────────────
|
||||
|
||||
export interface CompressedContext {
|
||||
conversationHistory: Array<{ role: 'user' | 'assistant'; content: string }>
|
||||
instructions: string
|
||||
meta: {
|
||||
totalMessages: number
|
||||
verbatimCount: number
|
||||
hadSnapshot: boolean
|
||||
compressed: boolean
|
||||
summaryTokenEstimate: number
|
||||
}
|
||||
}
|
||||
|
||||
// ─── Context Snapshot (persisted in SQLite) ────────────────
|
||||
|
||||
export interface ContextSnapshot {
|
||||
roomId: string
|
||||
summary: string
|
||||
lastMessageId: string
|
||||
lastMessageTimestamp: number
|
||||
updatedAt: number
|
||||
}
|
||||
|
||||
// ─── Summary Cache ──────────────────────────────────────────
|
||||
|
||||
export interface SummaryCacheEntry {
|
||||
summary: string
|
||||
lastMessageId: string
|
||||
lastMessageTimestamp: number
|
||||
createdAt: number
|
||||
}
|
||||
|
||||
// ─── Dependency Injection ──────────────────────────────────
|
||||
|
||||
export interface MessageFetcher {
|
||||
getMessages(roomId: string, limit?: number): StoredMessage[]
|
||||
getContextSnapshot(roomId: string): ContextSnapshot | null
|
||||
saveContextSnapshot(roomId: string, summary: string, lastMessageId: string, lastMessageTimestamp: number): void
|
||||
deleteContextSnapshot(roomId: string): void
|
||||
}
|
||||
|
||||
export interface GatewayCaller {
|
||||
summarize(
|
||||
upstream: string,
|
||||
apiKey: string | null,
|
||||
systemPrompt: string,
|
||||
messages: StoredMessage[],
|
||||
previousSummary?: string,
|
||||
): Promise<{ summary: string; sessionId: string }>
|
||||
}
|
||||
|
||||
export type SessionCleaner = (sessionId: string) => void
|
||||
|
||||
// ─── Build Context Input ───────────────────────────────────
|
||||
|
||||
export interface MemberInfo {
|
||||
userId: string
|
||||
name: string
|
||||
description: string
|
||||
}
|
||||
|
||||
export interface BuildContextInput {
|
||||
roomId: string
|
||||
agentId: string
|
||||
agentName: string
|
||||
agentDescription: string
|
||||
agentSocketId: string
|
||||
roomName: string
|
||||
memberNames: string[]
|
||||
members: MemberInfo[]
|
||||
upstream: string
|
||||
apiKey: string | null
|
||||
currentMessage: StoredMessage
|
||||
compression?: Partial<CompressionConfig>
|
||||
}
|
||||
@@ -0,0 +1,669 @@
|
||||
import { io, Socket } from 'socket.io-client'
|
||||
import { EventSource } from 'eventsource'
|
||||
import { getToken } from '../../../services/auth'
|
||||
import type { GatewayManager } from '../gateway-manager'
|
||||
import { deleteSession as hermesDeleteSession } from '../hermes-cli'
|
||||
import { getActiveProfileName } from '../hermes-profile'
|
||||
import { logger } from '../../../services/logger'
|
||||
|
||||
// ─── Types ────────────────────────────────────────────────────
|
||||
|
||||
interface AgentConfig {
|
||||
profile: string
|
||||
name: string
|
||||
description: string
|
||||
invited: number
|
||||
}
|
||||
|
||||
interface MessageData {
|
||||
id: string
|
||||
roomId: string
|
||||
senderId: string
|
||||
senderName: string
|
||||
content: string
|
||||
timestamp: number
|
||||
}
|
||||
|
||||
interface MemberData {
|
||||
id: string
|
||||
name: string
|
||||
joinedAt: number
|
||||
}
|
||||
|
||||
interface JoinResult {
|
||||
roomId: string
|
||||
roomName: string
|
||||
members: MemberData[]
|
||||
messages: MessageData[]
|
||||
rooms: string[]
|
||||
}
|
||||
|
||||
export interface AgentEventHandler {
|
||||
onMessage?: (data: { roomId: string; msg: MessageData }) => void
|
||||
onTyping?: (data: { roomId: string; userId: string; userName: string }) => void
|
||||
onStopTyping?: (data: { roomId: string; userId: string; userName: string }) => void
|
||||
onMemberJoined?: (data: { roomId: string; memberId: string; memberName: string; members: MemberData[] }) => void
|
||||
onMemberLeft?: (data: { roomId: string; memberId: string; memberName: string; members: MemberData[] }) => void
|
||||
}
|
||||
|
||||
// ─── Agent Client (single connection) ─────────────────────────
|
||||
|
||||
class AgentClient {
|
||||
readonly agentId: string
|
||||
readonly profile: string
|
||||
readonly name: string
|
||||
readonly description: string
|
||||
private socket: Socket | null = null
|
||||
private joinedRooms = new Set<string>()
|
||||
private handlers: AgentEventHandler
|
||||
private _reconnecting = false
|
||||
private gatewayManager: GatewayManager | null = null
|
||||
private contextEngine: any = null
|
||||
private storage: any = null
|
||||
|
||||
constructor(config: AgentConfig, handlers: AgentEventHandler = {}) {
|
||||
this.agentId = Date.now().toString(36) + Math.random().toString(36).slice(2, 8)
|
||||
this.profile = config.profile
|
||||
this.name = config.name
|
||||
this.description = config.description
|
||||
this.handlers = handlers
|
||||
}
|
||||
|
||||
get connected(): boolean {
|
||||
return this.socket?.connected ?? false
|
||||
}
|
||||
|
||||
get id(): string | undefined {
|
||||
return this.socket?.id
|
||||
}
|
||||
|
||||
setGatewayManager(manager: GatewayManager): void {
|
||||
this.gatewayManager = manager
|
||||
}
|
||||
|
||||
setContextEngine(engine: any): void {
|
||||
this.contextEngine = engine
|
||||
}
|
||||
|
||||
setStorage(storage: any): void {
|
||||
this.storage = storage
|
||||
}
|
||||
|
||||
async connect(port = 8648): Promise<void> {
|
||||
const token = await getToken()
|
||||
|
||||
this.socket = io(`http://127.0.0.1:${port}/group-chat`, {
|
||||
auth: {
|
||||
token: token || undefined,
|
||||
name: this.name,
|
||||
},
|
||||
transports: ['websocket'],
|
||||
reconnection: true,
|
||||
reconnectionAttempts: Infinity,
|
||||
reconnectionDelay: 1000,
|
||||
reconnectionDelayMax: 30000,
|
||||
})
|
||||
|
||||
this.bindEvents()
|
||||
|
||||
return new Promise((resolve, reject) => {
|
||||
const timeout = setTimeout(() => reject(new Error('Connection timeout')), 10000)
|
||||
|
||||
this.socket!.on('connect', () => {
|
||||
clearTimeout(timeout)
|
||||
logger.debug(`[AgentClient] ${this.name} connected, socket id: ${this.socket!.id}`)
|
||||
resolve()
|
||||
})
|
||||
|
||||
this.socket!.on('connect_error', (err) => {
|
||||
clearTimeout(timeout)
|
||||
logger.error(err, `[AgentClient] ${this.name} connect_error`)
|
||||
reject(err)
|
||||
})
|
||||
})
|
||||
}
|
||||
|
||||
disconnect(): void {
|
||||
if (this.socket) {
|
||||
this.socket.disconnect()
|
||||
this.socket = null
|
||||
this.joinedRooms.clear()
|
||||
}
|
||||
}
|
||||
|
||||
async joinRoom(roomId: string): Promise<JoinResult> {
|
||||
this.ensureConnected()
|
||||
return new Promise((resolve, reject) => {
|
||||
this.socket!.emit('join', { roomId }, (res: JoinResult | { error: string }) => {
|
||||
if ('error' in res) {
|
||||
reject(new Error(res.error))
|
||||
} else {
|
||||
this.joinedRooms.add(roomId)
|
||||
resolve(res)
|
||||
}
|
||||
})
|
||||
})
|
||||
}
|
||||
|
||||
sendMessage(roomId: string, content: string): Promise<string> {
|
||||
this.ensureConnected()
|
||||
return new Promise((resolve, reject) => {
|
||||
this.socket!.emit('message', { roomId, content }, (res: { id?: string; error?: string }) => {
|
||||
if (res.error) {
|
||||
reject(new Error(res.error))
|
||||
} else {
|
||||
resolve(res.id!)
|
||||
}
|
||||
})
|
||||
})
|
||||
}
|
||||
|
||||
startTyping(roomId: string): void {
|
||||
this.ensureConnected()
|
||||
this.socket!.emit('typing', { roomId })
|
||||
}
|
||||
|
||||
stopTyping(roomId: string): void {
|
||||
this.ensureConnected()
|
||||
this.socket!.emit('stop_typing', { roomId })
|
||||
}
|
||||
|
||||
emitContextStatus(roomId: string, status: 'compressing' | 'replying' | 'ready'): void {
|
||||
this.ensureConnected()
|
||||
this.socket!.emit('context_status', { roomId, agentName: this.name, status })
|
||||
}
|
||||
|
||||
getJoinedRooms(): string[] {
|
||||
return Array.from(this.joinedRooms)
|
||||
}
|
||||
|
||||
private ensureConnected(): void {
|
||||
if (!this.socket?.connected) {
|
||||
throw new Error(`Agent "${this.name}" is not connected`)
|
||||
}
|
||||
}
|
||||
|
||||
private async deleteSession(sessionId: string): Promise<void> {
|
||||
try {
|
||||
const sessionProfile = this.storage?.getSessionProfile?.(sessionId)
|
||||
const currentProfile = getActiveProfileName()
|
||||
|
||||
if (sessionProfile && sessionProfile.profile_name !== currentProfile) {
|
||||
// Cross-profile: enqueue deferred delete, don't switch profile
|
||||
this.storage?.enqueuePendingSessionDelete?.(sessionId, sessionProfile.profile_name)
|
||||
logger.info(`[AgentClients] ${this.name}: cross-profile deferred delete session ${sessionId} (session=${sessionProfile.profile_name}, active=${currentProfile})`)
|
||||
return
|
||||
}
|
||||
|
||||
// Same profile or no mapping: delete directly
|
||||
const ok = await hermesDeleteSession(sessionId)
|
||||
if (ok) {
|
||||
this.storage?.deleteSessionProfile?.(sessionId)
|
||||
}
|
||||
logger.debug(`[AgentClients] ${this.name}: delete session ${sessionId} (profile=${this.profile}) → ${ok ? 'ok' : 'failed'}`)
|
||||
} catch (err: any) {
|
||||
logger.warn(`[AgentClients] ${this.name}: failed to delete session ${sessionId}: ${err.message}`)
|
||||
}
|
||||
}
|
||||
|
||||
// ─── Hermes Gateway Integration ────────────────────────────
|
||||
|
||||
/**
|
||||
* Handle an @mention from the server side.
|
||||
* Called by AgentClients.processMentions() — no socket round-trip needed.
|
||||
* onStatus is called to report context compression progress.
|
||||
*/
|
||||
async replyToMention(
|
||||
roomId: string,
|
||||
msg: { content: string; senderName: string; senderId: string; timestamp: number },
|
||||
onStatus?: (status: 'compressing' | 'replying' | 'ready') => void,
|
||||
): Promise<void> {
|
||||
logger.debug(`[AgentClients] ${this.name} mentioned by ${msg.senderName}: "${msg.content.slice(0, 50)}"`)
|
||||
if (!this.gatewayManager) {
|
||||
logger.debug(`[AgentClients] ${this.name}: gatewayManager is null, skipping`)
|
||||
return
|
||||
}
|
||||
|
||||
const upstream = this.gatewayManager.getUpstream(this.profile)
|
||||
const apiKey = this.gatewayManager.getApiKey(this.profile)
|
||||
logger.debug(`[AgentClients] ${this.name}: upstream=${upstream}, profile=${this.profile}`)
|
||||
if (!upstream) {
|
||||
logger.error(`[AgentClients] ${this.name}: no gateway upstream for profile "${this.profile}"`)
|
||||
return
|
||||
}
|
||||
|
||||
const sessionId = Date.now().toString(36) + Math.random().toString(36).slice(2, 8)
|
||||
|
||||
try {
|
||||
// Notify room that agent is typing
|
||||
this.startTyping(roomId)
|
||||
|
||||
// Build compressed context if context engine is available
|
||||
let conversationHistory: Array<{ role: string; content: string }> = []
|
||||
let instructions: string | undefined
|
||||
|
||||
if (this.contextEngine && this.storage) {
|
||||
try {
|
||||
logger.debug(`[AgentClients] ${this.name}: building context...`)
|
||||
onStatus?.('compressing')
|
||||
// Get room members with descriptions for context
|
||||
const roomMembers: Array<{ userId: string; name: string; description: string }> = this.storage.getRoomMembers(roomId) || []
|
||||
const memberNames = roomMembers.map((m: any) => m.name)
|
||||
const members = roomMembers.map((m: any) => ({ userId: m.userId, name: m.name, description: m.description }))
|
||||
|
||||
// Get room compression config
|
||||
const roomInfo = this.storage.getRoom(roomId)
|
||||
const compression = roomInfo ? {
|
||||
triggerTokens: roomInfo.triggerTokens,
|
||||
maxHistoryTokens: roomInfo.maxHistoryTokens,
|
||||
tailMessageCount: roomInfo.tailMessageCount,
|
||||
} : undefined
|
||||
|
||||
const ctx = await this.contextEngine.buildContext({
|
||||
roomId,
|
||||
agentId: this.agentId,
|
||||
agentName: this.name,
|
||||
agentDescription: this.description,
|
||||
agentSocketId: this.socket?.id || '',
|
||||
roomName: roomId,
|
||||
memberNames,
|
||||
members,
|
||||
upstream,
|
||||
apiKey,
|
||||
currentMessage: msg,
|
||||
compression,
|
||||
})
|
||||
conversationHistory = ctx.conversationHistory
|
||||
instructions = ctx.instructions
|
||||
logger.debug(`[AgentClients] ${this.name}: context built — historyLen=${conversationHistory.length}, meta=%j`, ctx.meta)
|
||||
onStatus?.('replying')
|
||||
} catch (err: any) {
|
||||
logger.warn(`[AgentClients] ${this.name}: context engine failed: ${err.message}`)
|
||||
onStatus?.('replying')
|
||||
// Degrade: continue without context
|
||||
}
|
||||
}
|
||||
|
||||
// Strip @mention from input — agent already knows it was mentioned
|
||||
const input = msg.content.replace(new RegExp(`@${this.name.replace(/[.*+?^${}()|[\]\\]/g, '\\$&')}\\s*`, 'gi'), '').trim() || msg.content
|
||||
|
||||
// Start a run on Hermes gateway
|
||||
const runRes = await fetch(`${upstream}/v1/runs`, {
|
||||
method: 'POST',
|
||||
headers: {
|
||||
'Content-Type': 'application/json',
|
||||
...(apiKey ? { Authorization: `Bearer ${apiKey}` } : {}),
|
||||
},
|
||||
body: JSON.stringify({
|
||||
input,
|
||||
session_id: sessionId,
|
||||
...(conversationHistory.length > 0 ? { conversation_history: conversationHistory } : {}),
|
||||
...(instructions ? { instructions } : {}),
|
||||
}),
|
||||
signal: AbortSignal.timeout(120000),
|
||||
})
|
||||
|
||||
if (!runRes.ok) {
|
||||
const text = await runRes.text().catch(() => '')
|
||||
logger.error(`[AgentClients] ${this.name}: gateway run failed (${runRes.status}): ${text}`)
|
||||
this.stopTyping(roomId)
|
||||
return
|
||||
}
|
||||
|
||||
const runData = await runRes.json() as any
|
||||
const run_id = runData.run_id
|
||||
logger.debug(`[AgentClients] ${this.name}: run started, response=%j`, runData)
|
||||
if (!run_id) {
|
||||
logger.error(`[AgentClients] ${this.name}: no run_id in response`)
|
||||
this.stopTyping(roomId)
|
||||
return
|
||||
}
|
||||
|
||||
// Save session-to-profile mapping after gateway confirms the run
|
||||
const actualSessionId = runData.session_id || sessionId
|
||||
if (!this.storage) {
|
||||
logger.warn(`[AgentClients] ${this.name}: storage is null, cannot save session profile for ${actualSessionId}`)
|
||||
} else {
|
||||
this.storage.saveSessionProfile(actualSessionId, roomId, this.agentId, this.profile)
|
||||
logger.debug(`[AgentClients] ${this.name}: saved session profile ${actualSessionId} → profile=${this.profile}`)
|
||||
}
|
||||
|
||||
// Stream events from Hermes
|
||||
const eventsUrl = new URL(`${upstream}/v1/runs/${run_id}/events`)
|
||||
if (apiKey) eventsUrl.searchParams.set('token', apiKey)
|
||||
logger.debug(`[AgentClients] ${this.name}: streaming events from ${eventsUrl}`)
|
||||
const source = new EventSource(eventsUrl.toString())
|
||||
|
||||
let fullContent = ''
|
||||
|
||||
source.onmessage = (e: any) => {
|
||||
try {
|
||||
const parsed = JSON.parse(e.data)
|
||||
logger.debug(`[AgentClients] ${this.name}: event=${parsed.event}`)
|
||||
|
||||
if (parsed.event === 'run.completed') {
|
||||
source.close()
|
||||
logger.debug(`[AgentClients] ${this.name}: run completed, content length=${fullContent.length}`)
|
||||
if (fullContent) {
|
||||
this.stopTyping(roomId)
|
||||
this.sendMessage(roomId, fullContent)
|
||||
}
|
||||
this.deleteSession(actualSessionId).catch(() => { })
|
||||
onStatus?.('ready')
|
||||
return
|
||||
}
|
||||
|
||||
if (parsed.event === 'run.failed') {
|
||||
source.close()
|
||||
logger.error(`[AgentClients] ${this.name}: run failed`)
|
||||
this.stopTyping(roomId)
|
||||
this.deleteSession(actualSessionId).catch(() => { })
|
||||
onStatus?.('ready')
|
||||
return
|
||||
}
|
||||
|
||||
// Accumulate message deltas
|
||||
if (parsed.event === 'message.delta' && parsed.delta) {
|
||||
fullContent += parsed.delta
|
||||
}
|
||||
} catch {
|
||||
// ignore parse errors
|
||||
}
|
||||
}
|
||||
|
||||
source.onerror = (err: any) => {
|
||||
logger.error(err, `[AgentClients] ${this.name}: EventSource error`)
|
||||
source.close()
|
||||
this.stopTyping(roomId)
|
||||
this.deleteSession(actualSessionId).catch(() => { })
|
||||
onStatus?.('ready')
|
||||
}
|
||||
} catch (err: any) {
|
||||
logger.error(`[AgentClients] ${this.name}: error handling message: ${err.message}`)
|
||||
this.stopTyping(roomId)
|
||||
this.deleteSession(sessionId).catch(() => { })
|
||||
onStatus?.('ready')
|
||||
}
|
||||
}
|
||||
|
||||
private bindEvents(): void {
|
||||
const s = this.socket!
|
||||
|
||||
s.on('typing', (data: any) => {
|
||||
this.handlers.onTyping?.(data)
|
||||
})
|
||||
|
||||
s.on('stop_typing', (data: any) => {
|
||||
this.handlers.onStopTyping?.(data)
|
||||
})
|
||||
|
||||
s.on('member_joined', (data: any) => {
|
||||
this.handlers.onMemberJoined?.(data)
|
||||
})
|
||||
|
||||
s.on('member_left', (data: any) => {
|
||||
this.handlers.onMemberLeft?.(data)
|
||||
})
|
||||
|
||||
// Auto rejoin rooms on reconnect
|
||||
s.io.on('reconnect', async () => {
|
||||
if (this._reconnecting) return
|
||||
this._reconnecting = true
|
||||
logger.info(`[AgentClients] ${this.name} reconnecting, rejoining ${this.joinedRooms.size} rooms...`)
|
||||
const rooms = Array.from(this.joinedRooms)
|
||||
for (const roomId of rooms) {
|
||||
try {
|
||||
await this.joinRoom(roomId)
|
||||
} catch (err: any) {
|
||||
logger.error(`[AgentClients] ${this.name} failed to rejoin room ${roomId}: ${err.message}`)
|
||||
}
|
||||
}
|
||||
this._reconnecting = false
|
||||
})
|
||||
}
|
||||
}
|
||||
|
||||
// ─── AgentClients (roomId -> agents) ──────────────────────────
|
||||
|
||||
export class AgentClients {
|
||||
private rooms = new Map<string, Map<string, AgentClient>>()
|
||||
private _gatewayManager: GatewayManager | null = null
|
||||
private _contextEngine: any = null
|
||||
private _storage: any = null
|
||||
|
||||
// Per-room processing lock + mention queue
|
||||
private _processingRooms = new Set<string>()
|
||||
private _mentionQueue = new Map<string, Array<{ agent: AgentClient; msg: { content: string; senderName: string; senderId: string; timestamp: number } }>>()
|
||||
|
||||
/**
|
||||
* Create an agent client and connect it to the server.
|
||||
* The agent will NOT auto-join any room — call addAgentToRoom separately.
|
||||
*/
|
||||
async createAgent(config: AgentConfig, handlers?: AgentEventHandler, port?: number): Promise<AgentClient> {
|
||||
const client = new AgentClient(config, handlers)
|
||||
await client.connect(port)
|
||||
|
||||
// Auto-apply stored references (fixes propagation for agents created after set*)
|
||||
if (this._gatewayManager) client.setGatewayManager(this._gatewayManager)
|
||||
if (this._contextEngine) client.setContextEngine(this._contextEngine)
|
||||
if (this._storage) client.setStorage(this._storage)
|
||||
|
||||
logger.info(`[AgentClients] Connected: ${client.name} (${client.agentId})`)
|
||||
return client
|
||||
}
|
||||
|
||||
/**
|
||||
* Connect an agent to a room.
|
||||
*/
|
||||
async addAgentToRoom(roomId: string, client: AgentClient): Promise<JoinResult> {
|
||||
let room = this.rooms.get(roomId)
|
||||
if (!room) {
|
||||
room = new Map()
|
||||
this.rooms.set(roomId, room)
|
||||
}
|
||||
|
||||
room.set(client.agentId, client)
|
||||
const result = await client.joinRoom(roomId)
|
||||
logger.info(`[AgentClients] ${client.name} joined room: ${roomId}`)
|
||||
return result
|
||||
}
|
||||
|
||||
/**
|
||||
* Remove an agent from a room and disconnect it.
|
||||
*/
|
||||
removeAgentFromRoom(roomId: string, agentId: string): void {
|
||||
const room = this.rooms.get(roomId)
|
||||
if (!room) return
|
||||
|
||||
const client = room.get(agentId)
|
||||
if (client) {
|
||||
client.disconnect()
|
||||
room.delete(agentId)
|
||||
logger.info(`[AgentClients] ${client.name} left room: ${roomId}`)
|
||||
|
||||
// Invalidate context engine cache for this agent
|
||||
if (this._contextEngine) {
|
||||
try { this._contextEngine.invalidateRoom(roomId) } catch { /* ignore */ }
|
||||
}
|
||||
}
|
||||
|
||||
if (room.size === 0) {
|
||||
this.rooms.delete(roomId)
|
||||
}
|
||||
}
|
||||
|
||||
/**
|
||||
* Get all agents in a room.
|
||||
*/
|
||||
getAgents(roomId: string): AgentClient[] {
|
||||
const room = this.rooms.get(roomId)
|
||||
return room ? Array.from(room.values()) : []
|
||||
}
|
||||
|
||||
/**
|
||||
* Get a specific agent in a room.
|
||||
*/
|
||||
getAgent(roomId: string, agentId: string): AgentClient | undefined {
|
||||
return this.rooms.get(roomId)?.get(agentId)
|
||||
}
|
||||
|
||||
/**
|
||||
* Get all room IDs that have agents.
|
||||
*/
|
||||
getRoomIds(): string[] {
|
||||
return Array.from(this.rooms.keys())
|
||||
}
|
||||
|
||||
/**
|
||||
* Send a message from a specific agent in a room.
|
||||
*/
|
||||
async sendMessage(roomId: string, agentId: string, content: string): Promise<string> {
|
||||
const client = this.getAgent(roomId, agentId)
|
||||
if (!client) {
|
||||
throw new Error(`Agent "${agentId}" not found in room "${roomId}"`)
|
||||
}
|
||||
return client.sendMessage(roomId, content)
|
||||
}
|
||||
|
||||
/**
|
||||
* Broadcast a message from all agents in a room.
|
||||
*/
|
||||
async broadcastFromRoom(roomId: string, content: string): Promise<string[]> {
|
||||
const agents = this.getAgents(roomId)
|
||||
return Promise.all(agents.map((agent) => agent.sendMessage(roomId, content)))
|
||||
}
|
||||
|
||||
/**
|
||||
* Disconnect all agents in a room.
|
||||
*/
|
||||
disconnectRoom(roomId: string): void {
|
||||
const room = this.rooms.get(roomId)
|
||||
if (!room) return
|
||||
|
||||
room.forEach((client) => client.disconnect())
|
||||
this.rooms.delete(roomId)
|
||||
logger.info(`[AgentClients] All agents disconnected from room: ${roomId}`)
|
||||
|
||||
// Invalidate context engine cache for this room
|
||||
if (this._contextEngine) {
|
||||
try { this._contextEngine.invalidateRoom(roomId) } catch { /* ignore */ }
|
||||
}
|
||||
}
|
||||
|
||||
/**
|
||||
* Disconnect all agents in all rooms.
|
||||
*/
|
||||
disconnectAll(): void {
|
||||
this.rooms.forEach((room) => {
|
||||
room.forEach((client) => client.disconnect())
|
||||
})
|
||||
this.rooms.clear()
|
||||
logger.info('[AgentClients] All agents disconnected')
|
||||
}
|
||||
|
||||
/**
|
||||
* Set gateway manager for all existing and future agents.
|
||||
*/
|
||||
setGatewayManager(manager: GatewayManager): void {
|
||||
this._gatewayManager = manager
|
||||
this.rooms.forEach((room) => {
|
||||
room.forEach((client) => client.setGatewayManager(manager))
|
||||
})
|
||||
}
|
||||
|
||||
/**
|
||||
* Set context engine for all existing and future agents.
|
||||
*/
|
||||
setContextEngine(engine: any): void {
|
||||
this._contextEngine = engine
|
||||
this.rooms.forEach((room) => {
|
||||
room.forEach((client) => client.setContextEngine(engine))
|
||||
})
|
||||
}
|
||||
|
||||
/**
|
||||
* Set message storage for all existing and future agents.
|
||||
*/
|
||||
setStorage(storage: any): void {
|
||||
this._storage = storage
|
||||
this.rooms.forEach((room) => {
|
||||
room.forEach((client) => client.setStorage(storage))
|
||||
})
|
||||
}
|
||||
|
||||
|
||||
/**
|
||||
* Server-side: parse @mentions and forward to matching agents directly.
|
||||
* If the room is already processing (compressing/replying), queue the mention.
|
||||
*/
|
||||
async processMentions(roomId: string, msg: { content: string; senderName: string; senderId: string; timestamp: number }): Promise<void> {
|
||||
if (!this._gatewayManager) return
|
||||
|
||||
const content = msg.content.toLowerCase()
|
||||
const agents = this.getAgents(roomId)
|
||||
|
||||
const mentioned = agents.filter(a => content.includes(`@${a.name.toLowerCase()}`))
|
||||
if (mentioned.length === 0) return
|
||||
|
||||
logger.debug(`[AgentClients] ${mentioned.map(a => a.name).join(', ')} mentioned by ${msg.senderName}`)
|
||||
|
||||
for (const agent of mentioned) {
|
||||
this._processAgentMention(roomId, agent, msg).catch((err) => {
|
||||
logger.error(`[AgentClients] error processing mention for ${agent.name}: ${err.message}`)
|
||||
})
|
||||
}
|
||||
}
|
||||
|
||||
/**
|
||||
* Process a single agent mention with status reporting and queue drain.
|
||||
*/
|
||||
private async _processAgentMention(
|
||||
roomId: string,
|
||||
agent: AgentClient,
|
||||
msg: { content: string; senderName: string; senderId: string; timestamp: number },
|
||||
): Promise<void> {
|
||||
const agentKey = `${roomId}:${agent.name}`
|
||||
if (this._processingRooms.has(agentKey)) {
|
||||
// Queue for this specific agent
|
||||
let queue = this._mentionQueue.get(agentKey)
|
||||
if (!queue) {
|
||||
queue = []
|
||||
this._mentionQueue.set(agentKey, queue)
|
||||
}
|
||||
queue.push({ agent, msg })
|
||||
logger.debug(`[AgentClients] agent ${agent.name} is processing, queued mention in room ${roomId}`)
|
||||
return
|
||||
}
|
||||
|
||||
this._processingRooms.add(agentKey)
|
||||
const onStatus = (status: 'compressing' | 'replying' | 'ready') => {
|
||||
agent.emitContextStatus(roomId, status)
|
||||
logger.debug(`[AgentClients] room ${roomId} agent ${agent.name} status: ${status}`)
|
||||
}
|
||||
|
||||
try {
|
||||
await agent.replyToMention(roomId, msg, onStatus)
|
||||
} finally {
|
||||
this._processingRooms.delete(agentKey)
|
||||
await this._drainQueue(agentKey, roomId)
|
||||
}
|
||||
}
|
||||
|
||||
/**
|
||||
* Drain queued mentions for a room after processing completes.
|
||||
*/
|
||||
private async _drainQueue(agentKey: string, roomId: string): Promise<void> {
|
||||
const queue = this._mentionQueue.get(agentKey)
|
||||
if (!queue || queue.length === 0) return
|
||||
|
||||
this._mentionQueue.delete(agentKey)
|
||||
logger.debug(`[AgentClients] draining ${queue.length} queued mention(s) for ${agentKey}`)
|
||||
|
||||
// Process the last queued mention only (most recent, discards stale intermediate ones)
|
||||
const last = queue[queue.length - 1]
|
||||
this._processingRooms.add(agentKey)
|
||||
this._processAgentMention(roomId, last.agent, last.msg).catch((err) => {
|
||||
logger.error(`[AgentClients] error processing queued mention: ${err.message}`)
|
||||
})
|
||||
}
|
||||
}
|
||||
@@ -0,0 +1,866 @@
|
||||
import { Server, Socket, Namespace } from 'socket.io'
|
||||
import type { Server as HttpServer } from 'http'
|
||||
import { getToken } from '../../../services/auth'
|
||||
import { logger } from '../../../services/logger'
|
||||
import { getDb, ensureTable } from '../../../db'
|
||||
import { AgentClients } from './agent-clients'
|
||||
import { deleteSession as hermesDeleteSession } from '../hermes-cli'
|
||||
import { ContextEngine } from '../context-engine/compressor'
|
||||
|
||||
// ─── Types ────────────────────────────────────────────────────
|
||||
|
||||
interface ChatMessage {
|
||||
id: string
|
||||
roomId: string
|
||||
senderId: string
|
||||
senderName: string
|
||||
content: string
|
||||
timestamp: number
|
||||
}
|
||||
|
||||
interface RoomAgent {
|
||||
id: string
|
||||
roomId: string
|
||||
agentId: string
|
||||
profile: string
|
||||
name: string
|
||||
description: string
|
||||
invited: number
|
||||
}
|
||||
|
||||
interface Member {
|
||||
id: string
|
||||
userId: string
|
||||
name: string
|
||||
description: string
|
||||
joinedAt: number
|
||||
online: boolean
|
||||
socketId: string
|
||||
}
|
||||
|
||||
// ─── SQLite Storage (global DB) ──────────────────────────────
|
||||
|
||||
const GC_PENDING_SESSION_DELETES_SCHEMA: Record<string, string> = {
|
||||
session_id: 'TEXT PRIMARY KEY',
|
||||
profile_name: 'TEXT NOT NULL',
|
||||
status: "TEXT NOT NULL DEFAULT 'pending'",
|
||||
attempt_count: 'INTEGER NOT NULL DEFAULT 0',
|
||||
last_error: 'TEXT',
|
||||
created_at: 'INTEGER NOT NULL',
|
||||
updated_at: 'INTEGER NOT NULL',
|
||||
next_attempt_at: 'INTEGER NOT NULL DEFAULT 0',
|
||||
}
|
||||
|
||||
const GC_SESSION_PROFILES_SCHEMA: Record<string, string> = {
|
||||
session_id: 'TEXT PRIMARY KEY',
|
||||
room_id: 'TEXT NOT NULL',
|
||||
agent_id: 'TEXT NOT NULL',
|
||||
profile_name: 'TEXT NOT NULL',
|
||||
created_at: 'INTEGER NOT NULL',
|
||||
}
|
||||
|
||||
const GC_ROOMS_SCHEMA: Record<string, string> = {
|
||||
id: 'TEXT PRIMARY KEY',
|
||||
name: 'TEXT NOT NULL',
|
||||
inviteCode: 'TEXT UNIQUE',
|
||||
triggerTokens: 'INTEGER NOT NULL DEFAULT 100000',
|
||||
maxHistoryTokens: 'INTEGER NOT NULL DEFAULT 32000',
|
||||
tailMessageCount: 'INTEGER NOT NULL DEFAULT 20',
|
||||
totalTokens: 'INTEGER NOT NULL DEFAULT 0',
|
||||
}
|
||||
|
||||
const GC_MESSAGES_SCHEMA: Record<string, string> = {
|
||||
id: 'TEXT PRIMARY KEY',
|
||||
roomId: 'TEXT NOT NULL',
|
||||
senderId: 'TEXT NOT NULL',
|
||||
senderName: 'TEXT NOT NULL',
|
||||
content: 'TEXT NOT NULL',
|
||||
timestamp: 'INTEGER NOT NULL',
|
||||
}
|
||||
|
||||
const GC_ROOM_AGENTS_SCHEMA: Record<string, string> = {
|
||||
id: 'TEXT PRIMARY KEY',
|
||||
roomId: 'TEXT NOT NULL',
|
||||
agentId: 'TEXT NOT NULL',
|
||||
profile: 'TEXT NOT NULL',
|
||||
name: 'TEXT NOT NULL',
|
||||
description: "TEXT NOT NULL DEFAULT ''",
|
||||
invited: 'INTEGER NOT NULL DEFAULT 0',
|
||||
}
|
||||
|
||||
const GC_CONTEXT_SNAPSHOTS_SCHEMA: Record<string, string> = {
|
||||
roomId: 'TEXT PRIMARY KEY',
|
||||
summary: 'TEXT NOT NULL DEFAULT \'\'',
|
||||
lastMessageId: 'TEXT NOT NULL',
|
||||
lastMessageTimestamp: 'INTEGER NOT NULL',
|
||||
updatedAt: 'INTEGER NOT NULL',
|
||||
}
|
||||
|
||||
const GC_ROOM_MEMBERS_SCHEMA: Record<string, string> = {
|
||||
id: 'TEXT PRIMARY KEY',
|
||||
roomId: 'TEXT NOT NULL',
|
||||
userId: 'TEXT NOT NULL',
|
||||
userName: 'TEXT NOT NULL',
|
||||
description: "TEXT NOT NULL DEFAULT ''",
|
||||
joinedAt: 'INTEGER NOT NULL',
|
||||
updatedAt: 'INTEGER NOT NULL',
|
||||
}
|
||||
|
||||
let _tablesEnsured = false
|
||||
|
||||
interface PendingSessionDelete {
|
||||
session_id: string
|
||||
profile_name: string
|
||||
status: string
|
||||
attempt_count: number
|
||||
last_error: string | null
|
||||
created_at: number
|
||||
updated_at: number
|
||||
next_attempt_at: number
|
||||
}
|
||||
|
||||
interface GroupChatSessionProfile {
|
||||
session_id: string
|
||||
room_id: string
|
||||
agent_id: string
|
||||
profile_name: string
|
||||
created_at: number
|
||||
}
|
||||
|
||||
export interface PendingSessionDeleteDrainResult {
|
||||
deleted: string[]
|
||||
failed: Array<{ sessionId: string; error: string }>
|
||||
}
|
||||
|
||||
class ChatStorage {
|
||||
private db() { return getDb() }
|
||||
|
||||
init(): void {
|
||||
if (_tablesEnsured) return
|
||||
const db = this.db()
|
||||
if (!db) return
|
||||
ensureTable('gc_rooms', GC_ROOMS_SCHEMA)
|
||||
ensureTable('gc_messages', GC_MESSAGES_SCHEMA)
|
||||
ensureTable('gc_room_agents', GC_ROOM_AGENTS_SCHEMA)
|
||||
ensureTable('gc_context_snapshots', GC_CONTEXT_SNAPSHOTS_SCHEMA)
|
||||
ensureTable('gc_room_members', GC_ROOM_MEMBERS_SCHEMA)
|
||||
ensureTable('gc_pending_session_deletes', GC_PENDING_SESSION_DELETES_SCHEMA)
|
||||
ensureTable('gc_session_profiles', GC_SESSION_PROFILES_SCHEMA)
|
||||
// Indexes (safe to run multiple times — CREATE INDEX IF NOT EXISTS)
|
||||
try { db.exec('CREATE INDEX IF NOT EXISTS idx_gc_messages_room ON gc_messages(roomId, timestamp)') } catch { /* ignore */ }
|
||||
try { db.exec('CREATE INDEX IF NOT EXISTS idx_gc_room_agents_room ON gc_room_agents(roomId)') } catch { /* ignore */ }
|
||||
try { db.exec('CREATE UNIQUE INDEX IF NOT EXISTS idx_gc_room_members_unique ON gc_room_members(roomId, userId)') } catch { /* ignore */ }
|
||||
try { db.exec('CREATE INDEX IF NOT EXISTS idx_gc_pending_session_deletes_profile ON gc_pending_session_deletes(profile_name, status, next_attempt_at, created_at)') } catch { /* ignore */ }
|
||||
try { db.exec('CREATE INDEX IF NOT EXISTS idx_gc_session_profiles_profile ON gc_session_profiles(profile_name, created_at)') } catch { /* ignore */ }
|
||||
_tablesEnsured = true
|
||||
}
|
||||
|
||||
saveSessionProfile(sessionId: string, roomId: string, agentId: string, profileName: string): void {
|
||||
this.db()?.prepare(
|
||||
'INSERT INTO gc_session_profiles (session_id, room_id, agent_id, profile_name, created_at) VALUES (?, ?, ?, ?, ?) ON CONFLICT(session_id) DO UPDATE SET room_id = excluded.room_id, agent_id = excluded.agent_id, profile_name = excluded.profile_name'
|
||||
).run(sessionId, roomId, agentId, profileName, Date.now())
|
||||
}
|
||||
|
||||
getSessionProfile(sessionId: string): GroupChatSessionProfile | null {
|
||||
return (this.db()?.prepare(
|
||||
'SELECT session_id, room_id, agent_id, profile_name, created_at FROM gc_session_profiles WHERE session_id = ?'
|
||||
).get(sessionId) as GroupChatSessionProfile | undefined) ?? null
|
||||
}
|
||||
|
||||
deleteSessionProfile(sessionId: string): void {
|
||||
this.db()?.prepare('DELETE FROM gc_session_profiles WHERE session_id = ?').run(sessionId)
|
||||
}
|
||||
|
||||
listPendingSessionDeletes(profileName: string, limit = 50): PendingSessionDelete[] {
|
||||
const rows = this.db()?.prepare(
|
||||
`SELECT session_id, profile_name, status, attempt_count, last_error, created_at, updated_at, next_attempt_at
|
||||
FROM gc_pending_session_deletes
|
||||
WHERE profile_name = ? AND status = 'pending' AND next_attempt_at <= ?
|
||||
ORDER BY created_at ASC
|
||||
LIMIT ?`
|
||||
).all(profileName, Date.now(), limit) || []
|
||||
return rows.map((row: any) => ({
|
||||
session_id: String(row.session_id || ''),
|
||||
profile_name: String(row.profile_name || ''),
|
||||
status: String(row.status || 'pending'),
|
||||
attempt_count: Number(row.attempt_count || 0),
|
||||
last_error: row.last_error == null ? null : String(row.last_error),
|
||||
created_at: Number(row.created_at || 0),
|
||||
updated_at: Number(row.updated_at || 0),
|
||||
next_attempt_at: Number(row.next_attempt_at || 0),
|
||||
}))
|
||||
}
|
||||
|
||||
enqueuePendingSessionDelete(sessionId: string, profileName: string): void {
|
||||
const now = Date.now()
|
||||
this.db()?.prepare(
|
||||
`INSERT INTO gc_pending_session_deletes (session_id, profile_name, status, attempt_count, last_error, created_at, updated_at, next_attempt_at)
|
||||
VALUES (?, ?, 'pending', 0, NULL, ?, ?, 0)
|
||||
ON CONFLICT(session_id) DO UPDATE SET
|
||||
profile_name = excluded.profile_name,
|
||||
status = 'pending',
|
||||
updated_at = excluded.updated_at,
|
||||
next_attempt_at = 0`
|
||||
).run(sessionId, profileName, now, now)
|
||||
}
|
||||
|
||||
claimPendingSessionDeletes(profileName: string, limit = 50): PendingSessionDelete[] {
|
||||
const rows = this.listPendingSessionDeletes(profileName, limit)
|
||||
if (rows.length === 0) return []
|
||||
const now = Date.now()
|
||||
const stmt = this.db()?.prepare(
|
||||
`UPDATE gc_pending_session_deletes
|
||||
SET status = 'processing', updated_at = ?
|
||||
WHERE session_id = ? AND status = 'pending'`
|
||||
)
|
||||
const claimed: PendingSessionDelete[] = []
|
||||
for (const row of rows) {
|
||||
const result = stmt?.run(now, row.session_id)
|
||||
if (result?.changes) {
|
||||
claimed.push({ ...row, status: 'processing', updated_at: now })
|
||||
}
|
||||
}
|
||||
return claimed
|
||||
}
|
||||
|
||||
markPendingSessionDeleteFailed(sessionId: string, error: string): void {
|
||||
const now = Date.now()
|
||||
this.db()?.prepare(
|
||||
`UPDATE gc_pending_session_deletes
|
||||
SET status = 'pending',
|
||||
attempt_count = attempt_count + 1,
|
||||
last_error = ?,
|
||||
updated_at = ?,
|
||||
next_attempt_at = ?
|
||||
WHERE session_id = ?`
|
||||
).run(error, now, now + 60_000, sessionId)
|
||||
}
|
||||
|
||||
removePendingSessionDelete(sessionId: string): void {
|
||||
this.db()?.prepare('DELETE FROM gc_pending_session_deletes WHERE session_id = ?').run(sessionId)
|
||||
}
|
||||
|
||||
getPendingDeletedSessionIds(): Set<string> {
|
||||
const rows = (this.db()?.prepare(
|
||||
`SELECT session_id FROM gc_pending_session_deletes WHERE status IN ('pending', 'processing')`
|
||||
).all() || []) as Array<{ session_id: string }>
|
||||
return new Set(rows.map(row => row.session_id))
|
||||
}
|
||||
|
||||
// ─── Rooms ────────────────────────────────────────────────
|
||||
|
||||
getRoom(roomId: string): { id: string; name: string; inviteCode: string | null; triggerTokens: number; maxHistoryTokens: number; tailMessageCount: number; totalTokens: number } | undefined {
|
||||
return this.db()?.prepare('SELECT id, name, inviteCode, triggerTokens, maxHistoryTokens, tailMessageCount, totalTokens FROM gc_rooms WHERE id = ?').get(roomId) as any
|
||||
}
|
||||
|
||||
getRoomByInviteCode(code: string): { id: string; name: string; inviteCode: string | null; triggerTokens: number; maxHistoryTokens: number; tailMessageCount: number; totalTokens: number } | undefined {
|
||||
return this.db()?.prepare('SELECT id, name, inviteCode, triggerTokens, maxHistoryTokens, tailMessageCount, totalTokens FROM gc_rooms WHERE inviteCode = ?').get(code) as any
|
||||
}
|
||||
|
||||
getAllRooms(): { id: string; name: string; inviteCode: string | null; triggerTokens: number; maxHistoryTokens: number; tailMessageCount: number; totalTokens: number }[] {
|
||||
return (this.db()?.prepare('SELECT id, name, inviteCode, triggerTokens, maxHistoryTokens, tailMessageCount, totalTokens FROM gc_rooms ORDER BY id').all() || []) as any[]
|
||||
}
|
||||
|
||||
saveRoom(id: string, name: string, inviteCode?: string, config?: { triggerTokens?: number; maxHistoryTokens?: number; tailMessageCount?: number }): void {
|
||||
this.db()?.prepare(
|
||||
'INSERT OR IGNORE INTO gc_rooms (id, name, inviteCode, triggerTokens, maxHistoryTokens, tailMessageCount) VALUES (?, ?, ?, ?, ?, ?)'
|
||||
).run(id, name, inviteCode || null, config?.triggerTokens ?? 100000, config?.maxHistoryTokens ?? 32000, config?.tailMessageCount ?? 20)
|
||||
}
|
||||
|
||||
updateRoomConfig(roomId: string, config: { triggerTokens?: number; maxHistoryTokens?: number; tailMessageCount?: number }): void {
|
||||
const sets: string[] = []
|
||||
const vals: any[] = []
|
||||
if (config.triggerTokens !== undefined) { sets.push('triggerTokens = ?'); vals.push(config.triggerTokens) }
|
||||
if (config.maxHistoryTokens !== undefined) { sets.push('maxHistoryTokens = ?'); vals.push(config.maxHistoryTokens) }
|
||||
if (config.tailMessageCount !== undefined) { sets.push('tailMessageCount = ?'); vals.push(config.tailMessageCount) }
|
||||
if (sets.length === 0) return
|
||||
vals.push(roomId)
|
||||
this.db()?.prepare(`UPDATE gc_rooms SET ${sets.join(', ')} WHERE id = ?`).run(...vals)
|
||||
}
|
||||
|
||||
updateRoomInviteCode(roomId: string, inviteCode: string): void {
|
||||
this.db()?.prepare('UPDATE gc_rooms SET inviteCode = ? WHERE id = ?').run(inviteCode, roomId)
|
||||
}
|
||||
|
||||
updateRoomTotalTokens(roomId: string, tokens: number): void {
|
||||
this.db()?.prepare('UPDATE gc_rooms SET totalTokens = ? WHERE id = ?').run(tokens, roomId)
|
||||
}
|
||||
|
||||
estimateTokens(text: string): number {
|
||||
const cjk = (text.match(/[\u2e80-\u9fff\uac00-\ud7af\u3000-\u303f\uff00-\uffef]/g) || []).length
|
||||
const other = text.length - cjk
|
||||
return Math.ceil(cjk * 1.5 + other / 4)
|
||||
}
|
||||
|
||||
// ─── Messages ─────────────────────────────────────────────
|
||||
|
||||
getMessages(roomId: string, limit = 500): ChatMessage[] {
|
||||
const rows = (this.db()?.prepare(
|
||||
'SELECT id, roomId, senderId, senderName, content, timestamp FROM gc_messages WHERE roomId = ? ORDER BY timestamp DESC LIMIT ?'
|
||||
).all(roomId, limit) || []) as any[]
|
||||
return rows.reverse()
|
||||
}
|
||||
|
||||
addMessage(msg: ChatMessage): void {
|
||||
this.db()?.prepare(
|
||||
'INSERT INTO gc_messages (id, roomId, senderId, senderName, content, timestamp) VALUES (?, ?, ?, ?, ?, ?)'
|
||||
).run(msg.id, msg.roomId, msg.senderId, msg.senderName, msg.content, msg.timestamp)
|
||||
}
|
||||
|
||||
pruneMessages(roomId: string, keep = 500): void {
|
||||
const db = this.db()
|
||||
if (!db) return
|
||||
const count = (db.prepare('SELECT COUNT(*) as c FROM gc_messages WHERE roomId = ?').get(roomId) as any)?.c
|
||||
if (count > keep) {
|
||||
const cutoff = db.prepare(
|
||||
'SELECT timestamp FROM gc_messages WHERE roomId = ? ORDER BY timestamp DESC LIMIT 1 OFFSET ?'
|
||||
).get(roomId, keep - 1) as any
|
||||
if (cutoff) {
|
||||
const result = db.prepare('DELETE FROM gc_messages WHERE roomId = ? AND timestamp < ?').run(roomId, cutoff.timestamp)
|
||||
logger.info(`[GroupChat] pruned ${result.changes} messages from room ${roomId} (had ${count}, keeping ${keep})`)
|
||||
}
|
||||
}
|
||||
}
|
||||
|
||||
// ─── Room Agents ──────────────────────────────────────────
|
||||
|
||||
getRoomAgents(roomId: string): RoomAgent[] {
|
||||
return (this.db()?.prepare(
|
||||
'SELECT id, roomId, agentId, profile, name, description, invited FROM gc_room_agents WHERE roomId = ?'
|
||||
).all(roomId) || []) as unknown as RoomAgent[]
|
||||
}
|
||||
|
||||
addRoomAgent(roomId: string, agentId: string, profile: string, name: string, description: string, invited: number): RoomAgent {
|
||||
const id = Date.now().toString(36) + Math.random().toString(36).slice(2, 8)
|
||||
this.db()?.prepare(
|
||||
'INSERT INTO gc_room_agents (id, roomId, agentId, profile, name, description, invited) VALUES (?, ?, ?, ?, ?, ?, ?)'
|
||||
).run(id, roomId, agentId, profile, name, description, invited)
|
||||
return { id, roomId, agentId, profile, name, description, invited }
|
||||
}
|
||||
|
||||
removeRoomAgent(agentId: string): void {
|
||||
this.db()?.prepare('DELETE FROM gc_room_agents WHERE id = ?').run(agentId)
|
||||
}
|
||||
|
||||
// ─── Context Snapshots ──────────────────────────────────
|
||||
|
||||
getContextSnapshot(roomId: string): { roomId: string; summary: string; lastMessageId: string; lastMessageTimestamp: number; updatedAt: number } | null {
|
||||
return (this.db()?.prepare(
|
||||
'SELECT roomId, summary, lastMessageId, lastMessageTimestamp, updatedAt FROM gc_context_snapshots WHERE roomId = ?'
|
||||
).get(roomId) as any) ?? null
|
||||
}
|
||||
|
||||
saveContextSnapshot(roomId: string, summary: string, lastMessageId: string, lastMessageTimestamp: number): void {
|
||||
this.db()?.prepare(
|
||||
'INSERT INTO gc_context_snapshots (roomId, summary, lastMessageId, lastMessageTimestamp, updatedAt) VALUES (?, ?, ?, ?, ?) ON CONFLICT(roomId) DO UPDATE SET summary = excluded.summary, lastMessageId = excluded.lastMessageId, lastMessageTimestamp = excluded.lastMessageTimestamp, updatedAt = excluded.updatedAt'
|
||||
).run(roomId, summary, lastMessageId, lastMessageTimestamp, Date.now())
|
||||
}
|
||||
|
||||
deleteContextSnapshot(roomId: string): void {
|
||||
this.db()?.prepare('DELETE FROM gc_context_snapshots WHERE roomId = ?').run(roomId)
|
||||
}
|
||||
|
||||
deleteRoom(roomId: string): void {
|
||||
const db = this.db()
|
||||
if (!db) return
|
||||
db.prepare('DELETE FROM gc_messages WHERE roomId = ?').run(roomId)
|
||||
db.prepare('DELETE FROM gc_room_agents WHERE roomId = ?').run(roomId)
|
||||
db.prepare('DELETE FROM gc_room_members WHERE roomId = ?').run(roomId)
|
||||
db.prepare('DELETE FROM gc_context_snapshots WHERE roomId = ?').run(roomId)
|
||||
db.prepare('DELETE FROM gc_rooms WHERE id = ?').run(roomId)
|
||||
}
|
||||
|
||||
// ─── Room Members ──────────────────────────────────────
|
||||
|
||||
getRoomMembers(roomId: string): { id: string; userId: string; name: string; description: string; joinedAt: number }[] {
|
||||
return (this.db()?.prepare(
|
||||
'SELECT id, userId, userName as name, description, joinedAt FROM gc_room_members WHERE roomId = ? ORDER BY joinedAt'
|
||||
).all(roomId) || []) as unknown as { id: string; userId: string; name: string; description: string; joinedAt: number }[]
|
||||
}
|
||||
|
||||
addRoomMember(roomId: string, userId: string, userName: string, description: string): void {
|
||||
const existing = this.getMemberByUserId(roomId, userId)
|
||||
if (existing) {
|
||||
// Update name/description on rejoin, refresh updatedAt
|
||||
this.db()?.prepare(
|
||||
'UPDATE gc_room_members SET userName = ?, description = ?, updatedAt = ? WHERE roomId = ? AND userId = ?'
|
||||
).run(userName, description, Date.now(), roomId, userId)
|
||||
return
|
||||
}
|
||||
const id = Date.now().toString(36) + Math.random().toString(36).slice(2, 8)
|
||||
const now = Date.now()
|
||||
this.db()?.prepare(
|
||||
'INSERT INTO gc_room_members (id, roomId, userId, userName, description, joinedAt, updatedAt) VALUES (?, ?, ?, ?, ?, ?, ?)'
|
||||
).run(id, roomId, userId, userName, description, now, now)
|
||||
}
|
||||
|
||||
getMemberByUserId(roomId: string, userId: string): Member | null {
|
||||
return (this.db()?.prepare(
|
||||
'SELECT id, userId, userName as name, description, joinedAt FROM gc_room_members WHERE roomId = ? AND userId = ?'
|
||||
).get(roomId, userId) as any) ?? null
|
||||
}
|
||||
|
||||
updateMemberActivity(roomId: string, userId: string): void {
|
||||
this.db()?.prepare(
|
||||
'UPDATE gc_room_members SET updatedAt = ? WHERE roomId = ? AND userId = ?'
|
||||
).run(Date.now(), roomId, userId)
|
||||
}
|
||||
}
|
||||
|
||||
export async function drainPendingSessionDeletes(profileName: string): Promise<PendingSessionDeleteDrainResult> {
|
||||
const storage = new ChatStorage()
|
||||
storage.init()
|
||||
const claimed = storage.claimPendingSessionDeletes(profileName)
|
||||
const result: PendingSessionDeleteDrainResult = { deleted: [], failed: [] }
|
||||
|
||||
for (const item of claimed) {
|
||||
try {
|
||||
const ok = await hermesDeleteSession(item.session_id)
|
||||
if (!ok) {
|
||||
throw new Error('Failed to delete session')
|
||||
}
|
||||
storage.removePendingSessionDelete(item.session_id)
|
||||
storage.deleteSessionProfile(item.session_id)
|
||||
result.deleted.push(item.session_id)
|
||||
} catch (err: any) {
|
||||
const message = err?.message || 'Failed to delete session'
|
||||
storage.markPendingSessionDeleteFailed(item.session_id, message)
|
||||
result.failed.push({ sessionId: item.session_id, error: message })
|
||||
}
|
||||
}
|
||||
|
||||
return result
|
||||
}
|
||||
|
||||
// ─── ChatRoom (in-memory, for online members) ─────────────────
|
||||
|
||||
class ChatRoom {
|
||||
readonly id: string
|
||||
name: string
|
||||
readonly members = new Map<string, Member>()
|
||||
|
||||
constructor(id: string, name?: string) {
|
||||
this.id = id
|
||||
this.name = name || id
|
||||
}
|
||||
|
||||
addOrUpdateMember(socketId: string, userId: string, name: string, description: string): Member {
|
||||
const existing = this.members.get(userId)
|
||||
if (existing) {
|
||||
existing.name = name
|
||||
existing.description = description
|
||||
existing.online = true
|
||||
existing.socketId = socketId
|
||||
return existing
|
||||
}
|
||||
const member: Member = { id: socketId, userId, name, description, joinedAt: Date.now(), online: true, socketId }
|
||||
this.members.set(userId, member)
|
||||
return member
|
||||
}
|
||||
|
||||
removeMember(socketId: string): void {
|
||||
for (const member of this.members.values()) {
|
||||
if (member.socketId === socketId) {
|
||||
member.online = false
|
||||
break
|
||||
}
|
||||
}
|
||||
}
|
||||
|
||||
getMembersList(): Member[] {
|
||||
return Array.from(this.members.values())
|
||||
}
|
||||
|
||||
getOnlineMemberBySocketId(socketId: string): Member | undefined {
|
||||
for (const member of this.members.values()) {
|
||||
if (member.socketId === socketId && member.online) return member
|
||||
}
|
||||
return undefined
|
||||
}
|
||||
|
||||
hasOnlineMember(socketId: string): boolean {
|
||||
return this.getOnlineMemberBySocketId(socketId) !== undefined
|
||||
}
|
||||
}
|
||||
|
||||
// ─── GroupChat Server ────────────────────────────────────────
|
||||
|
||||
export class GroupChatServer {
|
||||
private io: Server
|
||||
private nsp: Namespace
|
||||
private storage: ChatStorage
|
||||
private rooms = new Map<string, ChatRoom>()
|
||||
/** Map: socket.id → persistent userId */
|
||||
private socketUserMap = new Map<string, string>()
|
||||
/** Map: userId → { name, description } (from auth) */
|
||||
private userInfoMap = new Map<string, { name: string; description: string }>()
|
||||
readonly agentClients = new AgentClients()
|
||||
private _contextEngine: ContextEngine | null = null
|
||||
private _restoreScheduled = false
|
||||
/** roomId -> (userId -> { userName, timer }) */
|
||||
private typingState = new Map<string, Map<string, { userName: string; timer: ReturnType<typeof setTimeout> }>>()
|
||||
/** roomId -> (agentName -> { agentName, status }) */
|
||||
private contextStatusState = new Map<string, Map<string, { agentName: string; status: string }>>()
|
||||
|
||||
setGatewayManager(manager: any): void {
|
||||
this.agentClients.setGatewayManager(manager)
|
||||
if (this._contextEngine && manager) {
|
||||
this._contextEngine.setUpstream(manager.getUpstream(''), manager.getApiKey(''))
|
||||
}
|
||||
}
|
||||
|
||||
constructor(httpServer: HttpServer) {
|
||||
this.storage = new ChatStorage()
|
||||
this.storage.init()
|
||||
|
||||
this.io = new Server(httpServer, {
|
||||
cors: { origin: '*' }
|
||||
})
|
||||
this.nsp = this.io.of('/group-chat')
|
||||
this.nsp.use(this.authMiddleware.bind(this))
|
||||
this.nsp.on('connection', this.onConnection.bind(this))
|
||||
|
||||
// Restore persisted rooms into memory
|
||||
this.storage.getAllRooms().forEach((row) => {
|
||||
this.rooms.set(row.id, new ChatRoom(row.id, row.name))
|
||||
})
|
||||
|
||||
logger.info('[GroupChat] Socket.IO ready at /group-chat')
|
||||
|
||||
// Initialize context engine for group chat compression
|
||||
const contextEngine = new ContextEngine({
|
||||
messageFetcher: this.storage,
|
||||
sessionCleaner: async (sessionId: string) => {
|
||||
try {
|
||||
await hermesDeleteSession(sessionId)
|
||||
} catch (err: any) {
|
||||
logger.warn(`[GroupChat] failed to delete compression session ${sessionId}: ${err.message}`)
|
||||
}
|
||||
},
|
||||
})
|
||||
this.agentClients.setContextEngine(contextEngine)
|
||||
this.agentClients.setStorage(this.storage)
|
||||
this._contextEngine = contextEngine
|
||||
|
||||
// Restore agent connections — call restoreAgents() after server is listening
|
||||
this._restoreScheduled = false
|
||||
}
|
||||
|
||||
getIO(): Server {
|
||||
return this.io
|
||||
}
|
||||
|
||||
getStorage(): ChatStorage {
|
||||
return this.storage
|
||||
}
|
||||
|
||||
getContextEngine(): ContextEngine | null {
|
||||
return this._contextEngine || null
|
||||
}
|
||||
|
||||
getRoomIds(): string[] {
|
||||
return Array.from(this.rooms.keys())
|
||||
}
|
||||
|
||||
// ─── Restore Agents ─────────────────────────────────────────
|
||||
|
||||
/**
|
||||
* Restore persisted agent connections. Safe to call multiple times;
|
||||
* will only execute once.
|
||||
*/
|
||||
async restoreWhenReady(): Promise<void> {
|
||||
if (this._restoreScheduled) return
|
||||
this._restoreScheduled = true
|
||||
await this.restoreAgents()
|
||||
}
|
||||
|
||||
private async restoreAgents(): Promise<void> {
|
||||
const rooms = this.storage.getAllRooms()
|
||||
let total = 0
|
||||
|
||||
for (const room of rooms) {
|
||||
const agents = this.storage.getRoomAgents(room.id)
|
||||
for (const agent of agents) {
|
||||
try {
|
||||
const client = await this.agentClients.createAgent({
|
||||
profile: agent.profile,
|
||||
name: agent.name,
|
||||
description: agent.description,
|
||||
invited: agent.invited,
|
||||
})
|
||||
await this.agentClients.addAgentToRoom(room.id, client)
|
||||
total++
|
||||
} catch (err: any) {
|
||||
logger.error(`[GroupChat] Failed to restore agent ${agent.name} in room ${room.id}: ${err.message}`)
|
||||
}
|
||||
}
|
||||
}
|
||||
|
||||
if (total > 0) {
|
||||
logger.info(`[GroupChat] Restored ${total} agent(s) across ${rooms.length} room(s)`)
|
||||
}
|
||||
}
|
||||
|
||||
// ─── Auth ───────────────────────────────────────────────────
|
||||
|
||||
private async authMiddleware(socket: Socket, next: (err?: Error) => void): Promise<void> {
|
||||
const authToken = await getToken()
|
||||
const token = socket.handshake.auth.token || socket.handshake.query.token || ''
|
||||
if (authToken) {
|
||||
if (token !== authToken) {
|
||||
return next(new Error('Unauthorized'))
|
||||
}
|
||||
}
|
||||
next()
|
||||
}
|
||||
|
||||
// ─── Connection ─────────────────────────────────────────────
|
||||
|
||||
private onConnection(socket: Socket): void {
|
||||
const auth = socket.handshake.auth as { userId?: string; name?: string; description?: string }
|
||||
const userId = auth.userId || socket.id
|
||||
const userName = auth.name || `User-${userId.slice(0, 6)}`
|
||||
const description = auth.description || ''
|
||||
|
||||
this.socketUserMap.set(socket.id, userId)
|
||||
this.userInfoMap.set(userId, { name: userName, description })
|
||||
|
||||
logger.debug(`[GroupChat] Connected: ${userName} (socket=${socket.id}, user=${userId})`)
|
||||
|
||||
socket.on('join', (data: { roomId?: string; name?: string }, ack?: (response?: unknown) => void) => this.handleJoin(socket, data, ack))
|
||||
socket.on('message', (data: { roomId?: string; content: string }, ack?: (response?: unknown) => void) => this.handleMessage(socket, data, ack))
|
||||
socket.on('typing', (data: { roomId?: string }) => this.handleTyping(socket, data))
|
||||
socket.on('stop_typing', (data: { roomId?: string }) => this.handleStopTyping(socket, data))
|
||||
socket.on('context_status', (data: { roomId?: string; agentName?: string; status?: string }) => this.handleContextStatus(socket, data))
|
||||
socket.on('disconnect', () => this.handleDisconnect(socket))
|
||||
}
|
||||
|
||||
// ─── Handlers ───────────────────────────────────────────────
|
||||
|
||||
private handleJoin(socket: Socket, data: { roomId?: string; name?: string; description?: string }, ack?: (res: any) => void): void {
|
||||
const socketId = socket.id
|
||||
const userId = this.socketUserMap.get(socketId) || socketId
|
||||
const userInfo = this.userInfoMap.get(userId) || { name: `User-${userId.slice(0, 6)}`, description: '' }
|
||||
const userName = data.name || userInfo.name
|
||||
const description = data.description || userInfo.description
|
||||
|
||||
// Update stored user info
|
||||
this.userInfoMap.set(userId, { name: userName, description })
|
||||
|
||||
const roomId = data.roomId || 'general'
|
||||
let room = this.rooms.get(roomId)
|
||||
if (!room) {
|
||||
room = new ChatRoom(roomId)
|
||||
this.rooms.set(roomId, room)
|
||||
this.storage.saveRoom(roomId, roomId)
|
||||
}
|
||||
|
||||
// Persist member to SQLite
|
||||
this.storage.addRoomMember(roomId, userId, userName, description)
|
||||
|
||||
// Add to in-memory online members (keyed by userId)
|
||||
room.addOrUpdateMember(socketId, userId, userName, description)
|
||||
socket.join(roomId)
|
||||
|
||||
socket.to(roomId).emit('member_joined', {
|
||||
roomId,
|
||||
memberId: userId,
|
||||
memberName: userName,
|
||||
members: room.getMembersList(),
|
||||
})
|
||||
|
||||
// Load history from SQLite
|
||||
const messages = this.storage.getMessages(roomId)
|
||||
const agents = this.storage.getRoomAgents(roomId)
|
||||
|
||||
ack?.({
|
||||
roomId,
|
||||
roomName: room.name,
|
||||
members: room.getMembersList(),
|
||||
messages,
|
||||
agents,
|
||||
rooms: this.getRoomIds(),
|
||||
typingUsers: this.getTypingUsers(roomId),
|
||||
contextStatuses: this.getContextStatuses(roomId),
|
||||
})
|
||||
|
||||
logger.debug(`[GroupChat] ${userName} (user=${userId}) joined room: ${roomId}`)
|
||||
}
|
||||
|
||||
private handleMessage(socket: Socket, data: { roomId?: string; content: string }, ack?: (res: any) => void): void {
|
||||
const socketId = socket.id
|
||||
const roomId = data.roomId || 'general'
|
||||
const room = this.rooms.get(roomId)
|
||||
|
||||
if (!room || !room.hasOnlineMember(socketId)) {
|
||||
ack?.({ error: 'Not in room' })
|
||||
return
|
||||
}
|
||||
|
||||
const member = room.getOnlineMemberBySocketId(socketId)
|
||||
const userId = member?.userId || socketId
|
||||
const userName = member?.name || `User-${socketId.slice(0, 6)}`
|
||||
|
||||
const msg: ChatMessage = {
|
||||
id: this.generateId(),
|
||||
roomId,
|
||||
senderId: userId,
|
||||
senderName: userName,
|
||||
content: data.content,
|
||||
timestamp: Date.now(),
|
||||
}
|
||||
|
||||
this.storage.addMessage(msg)
|
||||
this.storage.pruneMessages(roomId)
|
||||
|
||||
// Recalculate total tokens for the room
|
||||
const messages = this.storage.getMessages(roomId)
|
||||
const totalTokens = this.storage.estimateTokens(messages.map(m => m.content + m.senderName).join(''))
|
||||
this.storage.updateRoomTotalTokens(roomId, totalTokens)
|
||||
|
||||
this.nsp.to(roomId).emit('message', msg)
|
||||
this.nsp.to(roomId).emit('room_updated', { roomId, totalTokens })
|
||||
ack?.({ id: msg.id })
|
||||
|
||||
// Server-side @mention routing — parse mentions and invoke agents directly
|
||||
this.agentClients.processMentions(roomId, {
|
||||
content: msg.content,
|
||||
senderName: msg.senderName,
|
||||
senderId: msg.senderId,
|
||||
timestamp: msg.timestamp,
|
||||
}).catch((err) => {
|
||||
logger.error(`[GroupChat] processMentions error: ${err.message}`)
|
||||
})
|
||||
}
|
||||
|
||||
private handleTyping(socket: Socket, data: { roomId?: string }): void {
|
||||
const roomId = data.roomId || 'general'
|
||||
const userId = this.socketUserMap.get(socket.id) || socket.id
|
||||
const userName = this.userInfoMap.get(userId)?.name || `User-${socket.id.slice(0, 6)}`
|
||||
|
||||
// Track typing state for rejoin recovery
|
||||
let roomTyping = this.typingState.get(roomId)
|
||||
if (!roomTyping) {
|
||||
roomTyping = new Map()
|
||||
this.typingState.set(roomId, roomTyping)
|
||||
}
|
||||
const existing = roomTyping.get(userId)
|
||||
if (existing) clearTimeout(existing.timer)
|
||||
roomTyping.set(userId, {
|
||||
userName,
|
||||
timer: setTimeout(() => {
|
||||
roomTyping!.delete(userId)
|
||||
if (roomTyping!.size === 0) this.typingState.delete(roomId)
|
||||
}, 30000),
|
||||
})
|
||||
|
||||
socket.to(roomId).emit('typing', {
|
||||
roomId,
|
||||
userId,
|
||||
userName,
|
||||
})
|
||||
}
|
||||
|
||||
private handleStopTyping(socket: Socket, data: { roomId?: string }): void {
|
||||
const roomId = data.roomId || 'general'
|
||||
const userId = this.socketUserMap.get(socket.id) || socket.id
|
||||
|
||||
// Remove from typing state
|
||||
const roomTyping = this.typingState.get(roomId)
|
||||
if (roomTyping) {
|
||||
const entry = roomTyping.get(userId)
|
||||
if (entry) clearTimeout(entry.timer)
|
||||
roomTyping.delete(userId)
|
||||
if (roomTyping.size === 0) this.typingState.delete(roomId)
|
||||
}
|
||||
|
||||
socket.to(roomId).emit('stop_typing', {
|
||||
roomId,
|
||||
userId,
|
||||
})
|
||||
}
|
||||
|
||||
private handleContextStatus(socket: Socket, data: { roomId?: string; agentName?: string; status?: string }): void {
|
||||
const roomId = data.roomId || 'general'
|
||||
const agentName = data.agentName || ''
|
||||
const status = data.status || ''
|
||||
|
||||
if (!agentName) return
|
||||
|
||||
let roomStatuses = this.contextStatusState.get(roomId)
|
||||
if (!roomStatuses) {
|
||||
roomStatuses = new Map()
|
||||
this.contextStatusState.set(roomId, roomStatuses)
|
||||
}
|
||||
|
||||
if (status === 'ready') {
|
||||
roomStatuses.delete(agentName)
|
||||
if (roomStatuses.size === 0) this.contextStatusState.delete(roomId)
|
||||
} else {
|
||||
roomStatuses.set(agentName, { agentName, status })
|
||||
}
|
||||
|
||||
// Relay to all other sockets in the room
|
||||
socket.to(roomId).emit('context_status', {
|
||||
roomId,
|
||||
agentName,
|
||||
status,
|
||||
})
|
||||
}
|
||||
|
||||
private handleDisconnect(socket: Socket): void {
|
||||
const socketId = socket.id
|
||||
const userId = this.socketUserMap.get(socketId)
|
||||
const userName = userId ? this.userInfoMap.get(userId)?.name : undefined
|
||||
|
||||
logger.debug(`[GroupChat] Disconnected: ${userName || socketId} (socket=${socketId}, user=${userId || socketId})`)
|
||||
|
||||
// Clean up typing state for this socket
|
||||
for (const [roomId, roomTyping] of this.typingState) {
|
||||
const entry = roomTyping.get(userId || socketId)
|
||||
if (entry) {
|
||||
clearTimeout(entry.timer)
|
||||
roomTyping.delete(userId || socketId)
|
||||
if (roomTyping.size === 0) this.typingState.delete(roomId)
|
||||
}
|
||||
}
|
||||
|
||||
this.leaveAllRooms(socket, socketId)
|
||||
this.socketUserMap.delete(socketId)
|
||||
// Don't delete userInfoMap — it persists across reconnects
|
||||
}
|
||||
|
||||
// ─── Helpers ────────────────────────────────────────────────
|
||||
|
||||
private getTypingUsers(roomId: string): Array<{ userId: string; userName: string }> {
|
||||
const roomTyping = this.typingState.get(roomId)
|
||||
if (!roomTyping) return []
|
||||
return Array.from(roomTyping.entries()).map(([userId, entry]) => ({ userId, userName: entry.userName }))
|
||||
}
|
||||
|
||||
private getContextStatuses(roomId: string): Array<{ agentName: string; status: string }> {
|
||||
const roomStatuses = this.contextStatusState.get(roomId)
|
||||
if (!roomStatuses) return []
|
||||
return Array.from(roomStatuses.values())
|
||||
}
|
||||
|
||||
private leaveAllRooms(socket: Socket, socketId: string): void {
|
||||
this.rooms.forEach((room, rid) => {
|
||||
if (room.hasOnlineMember(socketId)) {
|
||||
const member = room.getOnlineMemberBySocketId(socketId)
|
||||
room.removeMember(socketId)
|
||||
socket.leave(rid)
|
||||
this.nsp.to(rid).emit('member_left', {
|
||||
roomId: rid,
|
||||
memberId: member?.userId || socketId,
|
||||
memberName: member?.name || `User-${socketId.slice(0, 6)}`,
|
||||
members: room.getMembersList(),
|
||||
})
|
||||
}
|
||||
})
|
||||
}
|
||||
|
||||
private generateId(): string {
|
||||
return Date.now().toString(36) + Math.random().toString(36).slice(2, 8)
|
||||
}
|
||||
}
|
||||
@@ -23,7 +23,7 @@ function rotateIfNeeded() {
|
||||
truncateSync(logFile, 0)
|
||||
writeFileSync(logFile, buf)
|
||||
}
|
||||
} catch {}
|
||||
} catch { }
|
||||
}
|
||||
|
||||
// Rotate on startup
|
||||
@@ -36,5 +36,5 @@ export const logger = pino({
|
||||
level: process.env.LOG_LEVEL || 'info',
|
||||
}, pino.destination({
|
||||
dest: logFile,
|
||||
sync: false,
|
||||
sync: true,
|
||||
}))
|
||||
|
||||
@@ -1,6 +1,6 @@
|
||||
import { logger } from './logger'
|
||||
|
||||
export function bindShutdown(server: any): void {
|
||||
export function bindShutdown(server: any, groupChatServer?: any): void {
|
||||
let isShuttingDown = false
|
||||
|
||||
const shutdown = async (signal: string) => {
|
||||
@@ -10,6 +10,13 @@ export function bindShutdown(server: any): void {
|
||||
logger.info('Shutting down (%s)...', signal)
|
||||
|
||||
try {
|
||||
// Disconnect Socket.IO before HTTP server to prevent hanging
|
||||
if (groupChatServer) {
|
||||
groupChatServer.agentClients.disconnectAll()
|
||||
groupChatServer.getIO().close()
|
||||
logger.info('Socket.IO closed')
|
||||
}
|
||||
|
||||
if (server) {
|
||||
await new Promise<void>((resolve) => {
|
||||
server.close(() => {
|
||||
|
||||
Reference in New Issue
Block a user