Bundle Pluginsource linked

Agent Searchkitv0.3.3

Local-first SearXNG search + reranking + research pipelines for AI agents. Framework-agnostic.

agent-searchkit·runtime agent-searchkit·by @lemoncandy42
openclaw bundles install clawhub:agent-searchkit
Latest release: v0.3.3Download zip

Capabilities

Bundle format
generic
Runtime ID
agent-searchkit

Compatibility

Built With Open Claw Version
2026.5.7
Min Gateway Version
2026.3.24-beta.2
Plugin Api Range
>=2026.3.24-beta.2
Plugin Sdk Version
2026.5.7
Security Scan
VirusTotalVirusTotal
stale
View report →
OpenClawOpenClaw
Suspicious
high confidence
!
Purpose & Capability
The search, reranking, extraction, and research-run capabilities fit the stated purpose, but the README's local-only privacy claim conflicts with its disclosed Google/Bing/DuckDuckGo/Qwant aggregation.
Instruction Scope
The OpenClaw instructions can make this the default web_search provider, so agents may route searches through it automatically; this is disclosed and purpose-aligned, but users should understand where queries go.
Install Mechanism
There is no OpenClaw install spec, but the README provides user-run Docker and git setup commands. The Docker image is not pinned, and registry requirements under-declare Docker/Node prerequisites, but this appears setup-related rather than hidden execution.
!
Credentials
Network access is expected for a web search/extraction tool, but the artifacts under-disclose the privacy impact by claiming queries never leave the machine while using external search engines through SearXNG.
Persistence & Privilege
Research runs are intentionally saved under local runs directories, and static scan snippets show subprocess execution with inherited environment variables; both are sensitive but appear purpose-aligned and disclosed enough to be review notes rather than evidence of malicious behavior.
Scan Findings in Context
[suspicious.dangerous_exec] expected: The flagged `spawn(command, args, ...)` in `src/index.ts`/`dist/index.js` indicates local subprocess execution. That is plausible for a local service/CLI integration, and the provided snippets do not show hidden unrelated commands or automatic destructive execution.
[suspicious.env_credential_access] expected: The flagged `env: process.env` means spawned child processes inherit the full environment. This is a sensitive review point, but the artifacts do not show hardcoded credentials, logging of secrets, or exfiltration.
[suspicious.install_untrusted_source] expected: The flagged `http://127.0.0.1:8888` is a localhost SearXNG service URL in plugin configuration, not evidence of a remote untrusted install source.
What to consider before installing
Install only if you are comfortable with your search queries going to the configured SearXNG instance and its upstream search engines. Do not search secrets, run it with minimal environment credentials, pin the Docker image if possible, and periodically clean local research-run files.
dist/index.js:3688
Shell command execution detected (child_process).
src/index.ts:5155
Shell command execution detected (child_process).
dist/index.js:3690
Environment variable access combined with network send.
src/index.ts:5157
Environment variable access combined with network send.
!
openclaw.plugin.json:13
Install source points to URL shortener or raw IP.
Patterns worth reviewing
These patterns may indicate risky behavior. Check the VirusTotal and OpenClaw results above for context-aware analysis before installing.

Verification

Tier
source linked
Scope
artifact only
Summary
Validated package structure and linked the release to source metadata.
Commit
e2004f6bf722
Tag
main
Provenance
No
Scan status
suspicious

Tags

latest
0.3.3
<p align="center"> <img src="https://img.shields.io/badge/🔍-agent--searchkit-blueviolet?style=for-the-badge&labelColor=0d1117&color=58a6ff" alt="agent-searchkit"> </p> <h1 align="center">agent-searchkit</h1> <p align="center"> <strong>为 AI Agent 打造的本地搜索基础设施</strong><br> 多引擎聚合 · 7 级 reranking · 数据不出本机 </p> <p align="center"> <a href="https://github.com/LemonCANDY42/agent-searchkit/blob/main/LICENSE"><img src="https://img.shields.io/badge/license-MIT-00d56b?style=flat-square&labelColor=0d1117" alt="License"></a> <a href="https://github.com/LemonCANDY42/agent-searchkit/stargazers"><img src="https://img.shields.io/github/stars/LemonCANDY42/agent-searchkit?style=flat-square&labelColor=0d1117&color=ffd700" alt="Stars"></a> <a href="https://github.com/LemonCANDY42/agent-searchkit/issues"><img src="https://img.shields.io/github/issues/LemonCANDY42/agent-searchkit?style=flat-square&labelColor=0d1117&color=ff6b6b" alt="Issues"></a> <a href="https://github.com/LemonCANDY42/agent-searchkit/network/members"><img src="https://img.shields.io/github/forks/LemonCANDY42/agent-searchkit?style=flat-square&labelColor=0d1117&color=8b949e" alt="Forks"></a> </p> <p align="center"> <a href="#-quickstart">Quickstart</a> · <a href="#-features">Features</a> · <a href="#-comparison">Comparison</a> · <a href="#-integration">Integration</a> · <a href="#-architecture">Architecture</a> </p>

✨ Highlights

  • 📊 7 级 Reranking — 从原始排序到实体感知,渐进优化
  • 🌐 四引擎聚合 — Google + Bing + DuckDuckGo + Qwant 同时搜
  • 🔒 完全本地 — 查询永不出本机,零遥测,无需 API Key
  • 📎 引用注释 — 可选输出 [1] [2] ... 标准引用格式
  • 🔍 搜索 + 提取 + 研究 — search / extract / research / status 四个工具
  • 🔌 即插即用 — 原生支持 OpenClaw / MCP / LangChain / CrewAI

📋 Prerequisites

  • Docker(用于运行 SearXNG 搜索引擎)
  • Node.js 18+(用于 CLI 工具和插件)

⚡ Quickstart

1️⃣ 启动 SearXNG

docker run -d --name searxng -p 8888:8080 searxng/searxng

2️⃣ 安装

OpenClaw(推荐):

# 安装插件
openclaw plugins install clawhub:agent-searchkit
openclaw config set plugins.entries.agent-searchkit.enabled true
openclaw config set plugins.entries.agent-searchkit.config.searxngBaseUrl "http://127.0.0.1:8888"

# 设为默认 web search(可选)
openclaw config set tools.web.search.provider agent-searchkit

# 重启
openclaw gateway restart

设置 tools.web.search.provider 后,Agent 调用内置 web_search 会自动走 SearXNG + reranking。

其他框架 / 独立使用:

git clone https://github.com/LemonCANDY42/agent-searchkit.git
cd agent-searchkit/services && cp .env.example .env.local && ./manage.sh up

3️⃣ 搜索

./bin/searx-search "Python 3.14 new features"
./bin/searx-search -c news -n 5 "AI regulation 2026"
./bin/searx-search -l zh-CN "量子计算 最新突破"

🔍 Comparison

<table> <tr> <th></th> <th>agent-searchkit</th> <th>Brave API</th> <th>Google CSE</th> <th>DuckDuckGo</th> </tr> <tr> <td>💰 <b>费用</b></td> <td>✅ 免费,无限制</td> <td>2K/月后 $3/千次</td> <td>100次/天</td> <td>免费但限速</td> </tr> <tr> <td>🌐 <b>多引擎聚合</b></td> <td>✅ 4 引擎同时搜</td> <td>仅 Brave</td> <td>仅 Google</td> <td>仅 DDG</td> </tr> <tr> <td>🔒 <b>数据隐私</b></td> <td>✅ 数据不出本机</td> <td>发送到 Brave</td> <td>发送到 Google</td> <td>有限</td> </tr> <tr> <td>🔑 <b>API Key</b></td> <td>✅ 不需要</td> <td>❌ 必需</td> <td>❌ 必需</td> <td>✅ 不需要</td> </tr> <tr> <td>📊 <b>Reranking</b></td> <td>✅ 7 个版本渐进优化</td> <td>❌</td> <td>❌</td> <td>❌</td> </tr> <tr> <td>📝 <b>研究流水线</b></td> <td>✅ 结果自动保存</td> <td>❌</td> <td>❌</td> <td>❌</td> </tr> </table>

✨ Features

🔍 四大工具

Tool能力说明
web_searchkit_searchSearXNG 搜索 + 多版本 rerank核心搜索入口
web_searchkit_research搜索并保存结果到本地产出 search.json + report.md
web_searchkit_extract网页提取 (fetch + Playwright)支持 JS 渲染页面
web_searchkit_status健康检查栈状态检查

📊 7 级 Reranking 流水线

从原始排序到实体感知,渐进式提升搜索质量:

v1.0  原始 SearXNG 排序 ─────────────────────── 基线
v1.1  启发式混合 (词法 + 域名先验) ──────────── 快、无依赖
v1.2  + 片段嵌入相似度 ───────────────────────── 语义匹配
v1.3  自适应混合 (查询桶加权) ────────────────── 意图感知
v1.4  ★ 默认 ── 检索优先 + 自适应 rerank ──── 通用最优
v1.5  + 精确拟合优化 (结构化查询) ──────────── 文档/包/API
v2.0  实体感知 + 页面角色覆盖 ──────────────── 高级研究

🎯 搜索模式

mode: "auto" | "general" | "official-docs" | "github" | "models" | "packages"

Agent 只需传 query,模式自动检测。或者手动指定——查文档用 official-docs,找 repo 用 github,找模型用 models

📎 引用注释 (Citations)

搜索时传入 citations=true,每条结果附带引用信息:

{
  "citation": {
    "ref": "[1]",
    "formatted": "[1] Page Title. https://example.com/page (accessed 2026-05-15)",
    "inline": "(example.com, 2026)"
  }
}
  • ref — 编号引用,用于行内标注 [1]
  • formatted — 完整引用文本,适合参考文献列表
  • inline — 简短括号形式,适合行内注明 (来源, 年份)

默认关闭。传入 citations=true 开启。


🔌 Integration

OpenClaw(原生插件)

openclaw plugins install clawhub:agent-searchkit
openclaw config set tools.web.search.provider agent-searchkit
openclaw gateway restart
# Agent 调用 web_search 时自动走 agent-searchkit

MCP Server

{
  "mcpServers": {
    "agent-searchkit": {
      "command": "node",
      "args": ["path/to/agent-searchkit/src/index.ts"],
      "env": { "SEARXNG_BASE_URL": "http://127.0.0.1:8888" }
    }
  }
}

Python / LangChain

import subprocess, json
from langchain.tools import Tool

def local_search(query: str, limit: int = 8) -> list[dict]:
    result = subprocess.run(
        ["./bin/searx-search", "--json", "-n", str(limit), query],
        capture_output=True, text=True
    )
    return json.loads(result.stdout)

search_tool = Tool(name="local_search", func=local_search, description="Local web search")

CrewAI

from crewai.tools import BaseTool

class WebSearchTool(BaseTool):
    name = "web_search"
    description = "Search the web locally with reranking"

    def _run(self, query: str) -> str:
        result = subprocess.run(
            ["./bin/searx-search", "--json", "-n", "8", query],
            capture_output=True, text=True, timeout=30,
        )
        return result.stdout

🏗️ Architecture

  ┌──────────────────────────────────────────────────┐
  │             AI Agent (任意框架)                    │
  │   OpenClaw · MCP · CrewAI · LangChain · 自研     │
  └────────────────────┬─────────────────────────────┘
                       │
            ┌──────────▼──────────┐
            │   agent-searchkit    │
            │   ┌──────────────┐  │
            │   │ search       │  │   ← 7 级 rerank
            │   │ research     │  │   ← 结果保存到本地
            │   │ extract      │  │   ← Playwright fallback
            │   │ status       │  │   ← 健康检查
            │   └──────────────┘  │
            └──────────┬──────────┘
                       │
            ┌──────────▼──────────┐
            │      SearXNG        │     ┌──────────┐
            │   (meta-search)     │────▶│  Google   │
            │   localhost:8888    │────▶│  Bing     │
            └──────────┬──────────┘────▶│  DuckDG   │
                       │                │  Qwant    │
            ┌──────────▼──────────┐     └──────────┐
            │   Rerank Pipeline   │
            │   v1.0 ──▶ v2.0    │
            └──────────┬──────────┘
                       │
            ┌──────────▼──────────┐
            │   Research Runs     │
            │   runs/<timestamp>/ │
            │   ├─ search.json    │
            │   └─ report.md      │
            └─────────────────────┘

⚙️ Configuration

字段默认值说明
searxngBaseUrlhttp://127.0.0.1:8888SearXNG 地址
defaultLanguageen-US默认搜索语言
defaultLimit8默认返回条数
rerankEnabledtrue启用 reranking
defaultRerankVersionv1.4默认 rerank 版本
defaultModeauto默认搜索模式

🤝 Contributing

git clone https://github.com/LemonCANDY42/agent-searchkit.git
cd agent-searchkit
git checkout -b feat/your-feature
# 测试
node src/index.test.mjs
./services/manage.sh test
git push origin feat/your-feature
# 开 PR 🎉

📄 License

MIT


<p align="center"> <sub>Built with 🧠 by <a href="https://github.com/LemonCANDY42">Kenny</a> · Powered by <a href="https://docs.searxng.org/">SearXNG</a></sub> </p>