2026-03-12 22:58:50 网络安全文章来源：ZONE.CI 全球网 0 阅读模式

文章总结： 这篇文章详细介绍了在KaliLinux上完全本地化部署大语言模型的完整流程，涵盖NVIDIAGPU驱动配置、Ollama安装与模型测试、MCPKaliServer部署以及5ire客户端配置。通过这些步骤可实现离线AI辅助安全测试环境，让本地LLM调用Kali安全工具进行渗透测试，适合有Linux基础的安全从业者实操参考。 综合评分： 79 文章分类： AI安全,安全工具,实战经验,渗透测试,安全建设

cover_image

Kali 与 LLM：完全本地化，使用 Ollama 与 5ire

APT-101 APT-101

APT-101

2026年3月12日 09:20 陕西

注意：本地 LLM 对硬件要求很高. 成本因素在于购买硬件及运行开销。如果你能复用现有设备，那再好不过！

GPU（NVIDIA）

首先，我们来确认一下我们的硬件：

$&nbsp;lspci | grep -i vga07:00.0 VGA compatible controller: NVIDIA Corporation GP106 [GeForce GTX 1060 6GB] (rev a1)$

NVIDIA GeForce GTX 1060（6 GB）.

驱动程序

我们将通过检查“非自由”专有驱动是否已安装，来确认硬件准备就绪。非自由选项允许 CUDA 支持，而开源的 nouveau 驱动则不具备此功能。

同时，请确保内核与头文件为最新版本：

$&nbsp;sudo apt update[...]$$&nbsp;sudo apt install -y linux-image-$(dpkg --print-architecture) linux-headers-$(dpkg --print-architecture) nvidia-driver nvidia-smi[...]│&nbsp;Conflicting&nbsp;nouveau kernel&nbsp;module&nbsp;loaded &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;││&nbsp;The&nbsp;free nouveau kernel&nbsp;module&nbsp;is currently loaded&nbsp;and&nbsp;conflicts with the non-free nvidia kernel&nbsp;module. &nbsp;││&nbsp;The&nbsp;easiest way to fix this is to reboot the machine once the installation has finished. &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;|[...]$$ sudo reboot

使用 AMD 或 Intel 等其他厂商 GPU 的情况不在本文讨论范围内。

测试

重启后重新登录，我们可用 nvidia-smi 快速验证：

$&nbsp;lspci&nbsp;-s&nbsp;07:00.0&nbsp;-v&nbsp;| grep KernelKernel driver&nbsp;in&nbsp;use: nvidiaKernel modules: nvidia$$&nbsp;lsmod | grep&nbsp;'^nouveau'$$&nbsp;lsmod | grep&nbsp;'^nvidia'nvidia_drm &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;126976&nbsp;&nbsp;2nvidia_modeset &nbsp; &nbsp; &nbsp;&nbsp;16632&nbsp;&nbsp;3&nbsp;nvidia_drmnvidia &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;60710912&nbsp;&nbsp;29&nbsp;nvidia_drm,nvidia_modeset$$&nbsp;nvidia-smiTue Jan&nbsp;27&nbsp;14:33:31&nbsp;2026+-----------------------------------------------------------------------------------------+| NVIDIA-SMI&nbsp;550.163.01&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Driver Version:&nbsp;550.163.01&nbsp; &nbsp; &nbsp;CUDA Version:&nbsp;12.4&nbsp; &nbsp; &nbsp;||-----------------------------------------+------------------------+----------------------+| GPU &nbsp;Name &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Persistence-M&nbsp;| Bus-Id&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Disp.A | Volatile Uncorr. ECC || Fan &nbsp;Temp &nbsp; Perf &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Pwr:Usage/Cap | &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Memory-Usage&nbsp;| GPU-Util&nbsp; Compute M. || &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; | &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;| &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; MIG M. ||=========================================+========================+======================|| &nbsp;&nbsp;0&nbsp; NVIDIA GeForce GTX&nbsp;1060&nbsp;6GB &nbsp; &nbsp;Off | &nbsp;&nbsp;00000000:07:00.0&nbsp; On | &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;N/A || &nbsp;0% &nbsp;&nbsp;30C &nbsp; &nbsp;P8 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;6W / &nbsp;120W | &nbsp; &nbsp; &nbsp;25MiB / &nbsp;&nbsp;6144MiB | &nbsp; &nbsp; &nbsp;0% &nbsp; &nbsp; &nbsp;Default || &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; | &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;| &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;N/A |+-----------------------------------------+------------------------+----------------------++-----------------------------------------------------------------------------------------+| Processes: &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;|| &nbsp;GPU &nbsp;&nbsp;GI&nbsp; &nbsp;CI &nbsp; &nbsp; &nbsp; &nbsp;PID &nbsp;&nbsp;Type&nbsp; &nbsp;Process&nbsp;name &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;GPU Memory || &nbsp; &nbsp; &nbsp; &nbsp;ID &nbsp; ID &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Usage &nbsp; &nbsp; &nbsp;||=========================================================================================|| &nbsp; &nbsp;0&nbsp; &nbsp;N/A &nbsp;N/A &nbsp; &nbsp; &nbsp;&nbsp;969&nbsp; &nbsp; &nbsp; G &nbsp; /usr/lib/xorg/Xorg &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;21MiB |+-----------------------------------------------------------------------------------------+$

一切看起来都正常。

Ollama

接下来，我们需要安装 Ollama。Ollama 将允许我们加载本地 LLM。

Ollama 是 llama.cpp 的封装。5ire 支持 Ollama，但不支持 llama.cpp。

如果你不想执行 curl | bash，请参阅手动安装方法；或按以下方式操作（针对 v0.15.2，撰写时最新版，2026-01-27）

$&nbsp;sudo apt install&nbsp;-y&nbsp;curl[...]$$&nbsp;curl&nbsp;--fail&nbsp;--location&nbsp;https://ollama.com/download/ollama-linux-amd64.tar.zst > /tmp/ollama-linux-amd64.tar.zst[...]$$&nbsp;file /tmp/ollama-linux-amd64.tar.zst/tmp/ollama-linux-amd64.tar.zst: Zstandard compressed&nbsp;data&nbsp;(v0.8+), Dictionary ID: None$&nbsp;sha512sum /tmp/ollama-linux-amd64.tar.zst1c16259de4898a694ac23e7d4a3038dc3aebbbb8247cf30a05f5c84f2bde573294e8e612f3a9d5042201ebfe148f5b7fe64acc50f5478d3453f62f85d44593a1 &nbsp;/tmp/ollama-linux-amd64.tar.zst$$&nbsp;sudo tar x&nbsp;-v&nbsp;--zstd&nbsp;-C&nbsp;/usr&nbsp;-f&nbsp;/tmp/ollama-linux-amd64.tar.zst[...]$$&nbsp;sudo useradd&nbsp;-r&nbsp;-s&nbsp;/bin/false&nbsp;-U&nbsp;-m&nbsp;-d&nbsp;/usr/share/ollama ollama$$&nbsp;sudo usermod&nbsp;-a&nbsp;-G&nbsp;ollama&nbsp;$(whoami)$$&nbsp;cat&nbsp;<<EOF | sudo&nbsp;tee&nbsp;/etc/systemd/system/ollama.service >/dev/null[Unit]Description=Ollama ServiceAfter=network-online.target[Service]ExecStart=/usr/bin/ollama serveUser=ollamaGroup=ollamaRestart=alwaysRestartSec=3Environment="PATH=\$PATH"[Install]WantedBy=multi-user.targetEOF$$&nbsp;sudo systemctl daemon-reload$$&nbsp;sudo systemctl enable&nbsp;--now&nbsp;ollamaCreated symlink&nbsp;'/etc/systemd/system/multi-user.target.wants/ollama.service'&nbsp;→&nbsp;'/etc/systemd/system/ollama.service'.$$&nbsp;systemctl status ollama● ollama.service - Ollama ServiceLoaded: loaded (/etc/systemd/system/ollama.service; enabled; preset: disabled)Active: active (running) since Tue&nbsp;2026-01-27&nbsp;14:44:39&nbsp;GMT;&nbsp;18s ago[...]$$&nbsp;oama&nbsp;-vollama version is&nbsp;0.15.2$

服务报告为活跃且正在运行（日志中无错误）。

LLM

现在我们需要一个供 Ollama 运行的 LLM！有几个地方可获取预生成的 LLM：

Ollama.com
HuggingFace.co（简称 HF）

你想要哪种模型？是时候实验了！

我们需要一个支持 “Tools” 功能的模型。我们稍后会解释为什么这很重要。
你的硬件将决定你能运行多复杂的模型。我们使用的硬件有 6GB VRAM，因此我们需要小于该容量的模型。

我们选择了 3 个进行测试：

$ ollama listNAME &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ID &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;SIZE &nbsp; &nbsp; &nbsp;MODIFIEDllama3.1:8b &nbsp; &nbsp;46e0c10c039e &nbsp; &nbsp;4.9 GB &nbsp; &nbsp;8 minutes agollama3.2:3b &nbsp; &nbsp;a80c4f17acd5 &nbsp; &nbsp;2.0 GB &nbsp; &nbsp;29 minutes agoqwen3:4b &nbsp; &nbsp; &nbsp; 359d7dd4bcda &nbsp; &nbsp;2.5 GB &nbsp; &nbsp;39 minutes ago$

测试

让我们测试 Ollama 是否工作正常：

$&nbsp;ollama run qwen3:4b

首次运行时, 它需要将模型加载到内存中。这可能需要一段时间，具体取决于你的硬件。

当 LLM 加载完成后, 我们将看到提示符。让我们输入 “Hello world!”：

>>>&nbsp;Hello world!Thinking...Okay, the&nbsp;user&nbsp;said "Hello world!"&nbsp;and&nbsp;wants me&nbsp;to&nbsp;respond. Let me think about how&nbsp;to&nbsp;approach this.&nbsp;First, I should acknowledge their greeting. Since they used the classic "Hello World!" which&nbsp;is&nbsp;oftenthe&nbsp;first&nbsp;program&nbsp;in&nbsp;many programming languages, maybe I can relate that&nbsp;to&nbsp;my capabilities. I should make sure&nbsp;to&nbsp;keep the tone friendly&nbsp;and&nbsp;open&nbsp;for&nbsp;further conversation. Let me&nbsp;check&nbsp;if there'sanything specific they might need help with. Maybe they're just testing me&nbsp;or&nbsp;want&nbsp;to&nbsp;start&nbsp;a discussion. I'll keep the response simple and welcoming, inviting them to ask questions or share what theyneed help with. Also, I should avoid any markdown and keep it natural. Alright, time to put that together....done thinking.Hello! &nbsp;How can I assist you today? Whether you have questions, need help with something, or just want to chat, I'm here&nbsp;for&nbsp;you!&nbsp;What's on your mind?>>> /exit$

我们可通过以下命令检查 Ollama 状态：

$&nbsp;ollama&nbsp;psNAME &nbsp; &nbsp; &nbsp; &nbsp;ID &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;SIZE &nbsp; &nbsp; &nbsp;PROCESSOR &nbsp; &nbsp;CONTEXT &nbsp; &nbsp;UNTILqwen3:4b &nbsp; &nbsp;359d7dd4bcda &nbsp; &nbsp;3.5&nbsp;GB &nbsp; &nbsp;100% GPU &nbsp; &nbsp;&nbsp;4096&nbsp; &nbsp; &nbsp; &nbsp;4&nbsp;minutes from now$

很好，看起来一切正常！

MCP 服务器（MCP Kali Server）

我们现在需要安装并运行一个 MCP 服务器。

对于本指南, 我们使用的是 Kali 的最小安装版, 意味着没有预装工具。

继续使用 mcp-kali-server：

$ sudo apt install&nbsp;-y mcp-kali-server dirb gobuster nikto nmap enum4linux-ng hydra john metasploit-framework sqlmap wpscan wordlists[...]$$ sudo gunzip&nbsp;-v&nbsp;/usr/share/wordlists/rockyou.txt.gz/usr/share/wordlists/rockyou.txt.gz: &nbsp;&nbsp;61.9%&nbsp;--&nbsp;replaced with&nbsp;/usr/share/wordlists/rockyou.txt$$ kali-server-mcp2026-01-27&nbsp;15:54:01,339&nbsp;[INFO]&nbsp;Starting&nbsp;Kali&nbsp;Linux&nbsp;Tools&nbsp;API&nbsp;Server&nbsp;on&nbsp;127.0.0.1:5000*&nbsp;Serving&nbsp;Flask&nbsp;app 'kali_server'*&nbsp;Debug&nbsp;mode: off2026-01-27&nbsp;15:54:01,352INFO]&nbsp;WARNING:&nbsp;This&nbsp;is&nbsp;a development server.&nbsp;Do&nbsp;not use it&nbsp;in&nbsp;a production deployment.&nbsp;Use&nbsp;a production&nbsp;WSGI&nbsp;server instead.*&nbsp;Running&nbsp;on http://127.0.0.1:50002026-01-27&nbsp;15:54:01,352&nbsp;[INFO]&nbsp;Press&nbsp;CTRL+C&nbsp;to quit

长期来看, 有多种方式让 kali-server-mcp 在后台运行, 例如使用 tmux/screen 会话或创建 systemd.unit, 但这超出了本文范围。

测试

现在手动运行 mcp-server：

$ mcp-server2026-01-27 15:54:18,802 [INFO] Initialized Kali Tools Client connecting to http://localhost:50002026-01-27 15:54:18,811 [INFO] Successfully connected to Kali API server at http://localhost:50002026-01-27 15:54:18,811 [INFO] Server health status: healthy2026-01-27 15:54:18,826 [INFO] Starting Kali MCP server2026-01-27 15:54:18,804 [INFO] Executing&nbsp;command:&nbsp;which&nbsp;nmap2026-01-27 15:54:18,806 [INFO] Executing&nbsp;command:&nbsp;which&nbsp;gobuster2026-01-27 15:54:18,807 [INFO] Executing&nbsp;command:&nbsp;which&nbsp;dirb2026-01-27 15:54:18,808 [INFO] Executing&nbsp;command:&nbsp;which&nbsp;nikto2026-01-27 15:54:18,810 [INFO] 127.0.0.1 - - [27/Jan/2026 15:54:18]&nbsp;"GET /health HTTP/1.1"&nbsp;200 -

一切看起来都很好！无错误或警告。

我们还可以看到 kali-server-mcp 的日志中新增了额外行。很好。

5ire

因此，我们有一个本地 LLM 和一个 MCP 服务器。Ollama 不支持 MCP（目前？），所以我们需要一个能桥接两者的工具。这就是 5ire ——“一个简洁的 AI 助手与 MCP 客户端”。

接下来，下载 5ire 的 AppImage（撰写时为 5ire-0.15.3-x86_64.AppImage，2026-01-27）并创建菜单项：

$ curl --fail --location https://github.com/nanbingxyz/5ire/releases/download/v0.15.3/5ire-0.15.386_64.AppImage > 5ire-x86_64.AppImage[...]$$ file 5ire-x86_64.AppImage5ire-x86_64.AppImage: ELF 64-bit LSB executable, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2,&nbsp;for&nbsp;GNU/Linux 2.6.18, stripped$&nbsp;sha512sum&nbsp;5ire-x86_64.AppImagebdf665fc6636da240153d44629723cb311bba4068db21c607f05cc6e1e58bb2e45aa72363a979a2aa165cb08a12db7babb715ac58da448fc9cf0258b22a56707 &nbsp;5ire-x86_64.AppImage$$&nbsp;sudo&nbsp;mkdir&nbsp;-pv /opt/5ire/mkdir: created directory&nbsp;'/opt/5ire/'$$&nbsp;sudo&nbsp;mv&nbsp;-v 5ire-x86_64.AppImage /opt/5ire/5ire-x86_64.AppImagerenamed&nbsp;'5ire-x86_64.AppImage'&nbsp;->&nbsp;'/opt/5ire/5ire-x86_64.AppImage'$$&nbsp;chmod&nbsp;-v 0755 /opt/5ire/5ire-x86_64.AppImagemode of&nbsp;'/opt/5ire/5ire-x86_64.AppImage'&nbsp;changed from 0664 (rw-rw-r--) to 0755 (rwxr-xr-x)$$&nbsp;mkdir&nbsp;-pv ~/.local/share/applications/mkdir: created directory&nbsp;'/home/kali/.local/share/applications/'$$&nbsp;cat&nbsp;<<EOF | sudo tee ~/.local/share/applications/5ire.desktop >/dev/null[Desktop Entry]Name=5ireComment=5ire Desktop AI AssistantExec=/opt/5ire/5ire-x86_64.AppImageTerminal=falseType=ApplicationCategories=Utility;Development;StartupWMClass=5ireEOF$$&nbsp;sudo&nbsp;ln&nbsp;-sfv /opt/5ire/5ire-x86_64.AppImage /usr/local/bin/5ire'/usr/local/bin/5ire'&nbsp;->&nbsp;'/opt/5ire/5ire-x86_64.AppImage'$$&nbsp;sudo&nbsp;apt install -y libfuse2t64[...]$

我们现在既可以通过菜单，也可以从终端调用它。

现在我们需要配置 5ire 以使用 Ollama（LLM）和 mcp-kali-server（MCP 服务器）：

打开 5ire，然后：

5ire → Workspace → Providers → Ollama 启用默认设置，并为每个 Ollama 模型启用 “Tools” 和 “Enabled” → 保存。对每个模型重复此操作。

（如需，可选择一个模型作为默认）

测试

现在让我们测试 5ire！

新建聊天 → Ollama Hello world!

再次检查状态：

$&nbsp;ollama&nbsp;psNAME &nbsp; &nbsp; &nbsp; &nbsp;ID &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;SIZE &nbsp; &nbsp; &nbsp;PROCESSOR &nbsp; &nbsp;CONTEXT &nbsp; &nbsp;UNTILqwen3:4b &nbsp; &nbsp;359d7dd4bcda &nbsp; &nbsp;3.5&nbsp;GB &nbsp; &nbsp;100% GPU &nbsp; &nbsp;&nbsp;4096&nbsp; &nbsp; &nbsp; &nbsp;2&nbsp;minutes from now$

看起来运行良好！现在设置 MCP。

MCP 客户端（5ire）

我们可以使用 5ire 的 GUI：

5ire → Tools → Local

现在填写字段：

名称：mcp-kali-server
描述：MCP Kali Server
批准策略：……由你决定
命令：/usr/bin/mcp-server 保存别忘了启用它！

我们可以查看现在有什么可用的。... → 浏览

测试

新建聊天 → Ollama 你能帮我扫描一下 scanme.nmap.org 的端口吗？目标端口：TCP 80、443、21、22？

原文：https://www.kali.org/blog/kali-llm-ollama-5ire/?utm_source=dlvr.it&utm_medium=twitter

免责声明：

本文所载程序、技术方法仅面向合法合规的安全研究与教学场景，旨在提升网络安全防护能力，具有明确的技术研究属性。

任何单位或个人未经授权，将本文内容用于攻击、破坏等非法用途的，由此引发的全部法律责任、民事赔偿及连带责任，均由行为人独立承担，本站不承担任何连带责任。

本站内容均为技术交流与知识分享目的发布，若存在版权侵权或其他异议，请通过邮件联系处理，具体联系方式可点击页面上方的联系我。

本文转载自：APT-101 APT-101 APT-101《Kali 与 LLM：完全本地化，使用 Ollama 与 5ire》