2026-03-17 23:41:59 网络安全文章来源：ZONE.CI 全球网 0 阅读模式

文章总结： 本文档系统讲解了利用Wireshark和tcpdump进行网络故障排查的实战技术。内容详细对比工具特性，解析BPF过滤语法、TCP握手异常与重传分析、HTTP及TLS流量解读等核心技能。通过丰富的命令示例与排障逻辑，帮助运维人员精准定位连接超时、丢包等问题，具有较高的实操参考价值。 综合评分： 89 文章分类： 网络安全,实战经验,安全工具,安全运营

cover_image

网络抓包分析实战：Wireshark与tcpdump定位网络故障

点击关注👉 点击关注👉

马哥网络安全

2026年3月1日 17:01 河南

网络抓包分析实战：Wireshark与tcpdump定位网络故障

一、概述

1.1 背景介绍

网络抓包是运维排障的终极手段。当监控告警只告诉你”连接超时”、日志只记录了”connection refused”，而你需要搞清楚到底是客户端没发 SYN、服务端没回 SYN-ACK、还是中间防火墙吞了包——这时候只有抓包能给出确定性答案。

抓包分析的本质是在 OSI 模型的不同层级截获原始数据包，还原网络通信的真实过程。L2 层看 ARP 和 VLAN 标签，L3 层看 IP 路由和 TTL，L4 层看 TCP 握手/重传/窗口，L7 层看 HTTP 请求/DNS 查询。不同层级的故障对应不同的过滤器和分析方法，搞清楚问题在哪一层是高效排障的前提。

抓包工具的底层都依赖 libpcap（Linux）或 WinPcap/Npcap（Windows）。libpcap 通过 BPF（Berkeley Packet Filter）在内核态过滤数据包，只将匹配的包拷贝到用户态，避免了全量抓包对性能的冲击。BPF 过滤器被编译为字节码在内核虚拟机中执行，这也是为什么 tcpdump 的过滤表达式效率远高于抓完再 grep。

三大工具的定位对比：

1.2 技术特点

零侵入：抓包不修改任何网络配置，不影响现有流量，随时可以开始和停止
全栈可见：从以太网帧头到应用层载荷，每个字节都可以检查，不存在日志遗漏的问题
精确时间戳：微秒级时间戳精度，可以精确计算 RTT、重传间隔、DNS 解析耗时
可回放：pcap 文件可以反复分析，不同工程师可以独立验证结论，适合事后复盘
BPF 内核过滤：在内核态完成包过滤，高流量环境下也能精准抓取目标流量

1.3 适用场景

TCP 连接建立失败（SYN 超时、RST、握手异常）
DNS 解析异常（解析慢、NXDOMAIN、DNS 劫持）
HTTP/HTTPS 请求调试（状态码异常、TLS 握手失败、证书问题）
网络延迟定位（区分客户端延迟、网络延迟、服务端处理延迟）
丢包与重传分析（定位丢包发生的位置和原因）
安全事件取证（异常流量、端口扫描、数据泄露检测）

1.4 环境要求

| 组件 | 版本要求 | 说明 | | — | — | — | | tcpdump | 4.99.x | 大多数 Linux 发行版预装，apt install tcpdump | | Wireshark | 4.4.x | 工作站安装，用于离线分析 pcap 文件 | | tshark | 4.4.x | 随 Wireshark 安装，apt install tshark | | libpcap | 1.10.x | tcpdump 依赖库，通常随 tcpdump 一起安装 | | 操作系统 | Ubuntu 22.04+ / CentOS 8+ | 内核 4.x+ 支持完整的 BPF 特性 | | 权限 | root 或 CAP_NET_RAW | 抓包需要原始套接字权限 |

二、详细步骤

2.1 tcpdump 核心用法

2.1.1 基础语法与常用参数

tcpdump 的基本语法：

tcpdump [选项] [BPF过滤表达式]

常用参数速查：

# 核心参数
-i eth0 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 指定网卡，-i any 抓所有网卡
-n &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 不解析主机名（IP 直接显示）
-nn &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 不解析主机名和端口名（80 而不是 http）
-v / -vv / -vvv &nbsp;# 详细程度递增，-vv 显示 TTL、IP ID 等
-c 100 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 抓 100 个包后自动停止
-s 0 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 抓完整包（默认 262144 字节，够用）
-w capture.pcap &nbsp;# 写入 pcap 文件（后续用 Wireshark 分析）
-r capture.pcap &nbsp;# 读取 pcap 文件
-A &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 以 ASCII 显示包内容（适合 HTTP 明文）
-X &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 同时以十六进制和 ASCII 显示
-e &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 显示链路层头部（MAC 地址、VLAN 标签）
-q &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 精简输出，只显示协议摘要
-t &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 不显示时间戳
-tttt &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 显示完整日期时间格式

2.1.2 BPF 过滤表达式

BPF 过滤器在内核态执行，写好过滤表达式是高效抓包的关键。语法由原语（primitive）和逻辑运算符组成：

# 按主机过滤
tcpdump -nn -i eth0 host 10.0.1.50 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 源或目的 IP
tcpdump -nn -i eth0 src host 10.0.1.50 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 仅源 IP
tcpdump -nn -i eth0 dst host 10.0.1.50 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 仅目的 IP

# 按端口过滤
tcpdump -nn -i eth0 port 443 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 源或目的端口
tcpdump -nn -i eth0 src port 3306 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 仅源端口
tcpdump -nn -i eth0 portrange 8000-8080 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 端口范围

# 按网段过滤
tcpdump -nn -i eth0 net 10.0.1.0/24 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 整个子网

# 按协议过滤
tcpdump -nn -i eth0 tcp &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 仅 TCP
tcpdump -nn -i eth0 udp &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 仅 UDP
tcpdump -nn -i eth0 icmp &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 仅 ICMP

# 组合过滤（and / or / not）
tcpdump -nn -i eth0 host 10.0.1.50 and port 80 &nbsp;&nbsp;# 指定主机的 80 端口
tcpdump -nn -i eth0 tcp and not port 22 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# TCP 流量排除 SSH
tcpdump -nn -i eth0&nbsp;'src 10.0.1.50 and (dst port 80 or dst port 443)'&nbsp;&nbsp;# 括号需要引号

# 按 TCP 标志位过滤（高级用法）
tcpdump -nn -i eth0&nbsp;'tcp[tcpflags] & (tcp-syn) != 0'&nbsp; &nbsp; &nbsp;&nbsp;# 包含 SYN 的包
tcpdump -nn -i eth0&nbsp;'tcp[tcpflags] & (tcp-rst) != 0'&nbsp; &nbsp; &nbsp;&nbsp;# 包含 RST 的包
tcpdump -nn -i eth0&nbsp;'tcp[tcpflags] == tcp-syn'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 仅 SYN（不含 SYN-ACK）
tcpdump -nn -i eth0&nbsp;'tcp[tcpflags] & (tcp-syn|tcp-fin) != 0'&nbsp;&nbsp;# SYN 或 FIN

# 按包大小过滤
tcpdump -nn -i eth0&nbsp;'greater 1000'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 大于 1000 字节的包
tcpdump -nn -i eth0&nbsp;'less 64'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 小于 64 字节的包（可能是空 ACK）

2.1.3 输出格式解读

tcpdump 默认输出格式：

14:32:05.123456 IP 10.0.1.50.45678 > 10.0.1.100.80: Flags [S], seq 1234567890, win 65535, options [mss 1460,sackOK,TS val 123456 ecr 0,nop,wscale 7], length 0

各字段含义：

14:32:05.123456：时间戳（微秒精度）
10.0.1.50.45678：源 IP.源端口
10.0.1.100.80：目的 IP.目的端口
Flags [S]：TCP 标志位，S=SYN, .=ACK, P=PSH, F=FIN, R=RST
seq：序列号
win：TCP 窗口大小
options：TCP 选项（MSS、SACK、时间戳、窗口缩放）
length：载荷长度

常见标志位组合：[S]=SYN, [S.]=SYN-ACK, [.]=ACK, [P.]=PSH-ACK, [F.]=FIN-ACK, [R.]=RST-ACK

2.1.4 保存与读取 pcap 文件

# 抓包写入文件（生产环境推荐加 -c 或 -G 限制）
tcpdump -nn -i eth0 -w /tmp/capture.pcap -c 10000 host 10.0.1.50

# 按时间轮转（每 3600 秒一个文件，保留最近 24 个）
tcpdump -nn -i eth0 -w /tmp/capture_%Y%m%d_%H%M%S.pcap -G 3600 -W 24 port 80

# 按文件大小轮转（每 100MB 一个文件，保留 10 个）
tcpdump -nn -i eth0 -w /tmp/capture.pcap -C 100 -W 10 port 443

# 读取 pcap 文件并过滤
tcpdump -nn -r /tmp/capture.pcap&nbsp;'tcp[tcpflags] & (tcp-rst) != 0'
tcpdump -nn -r /tmp/capture.pcap -c 20 &nbsp;&nbsp;# 只看前 20 个包

2.2 Wireshark/tshark 分析技巧

2.2.1 tshark 命令行分析

生产服务器没有 GUI，tshark 是 Wireshark 的命令行版本，拥有完整的协议解析能力：

# 实时抓包并显示（类似 tcpdump，但解析更详细）
tshark -i eth0 -f&nbsp;"port 80"&nbsp;-c 50

# 读取 pcap 文件，指定显示过滤器
tshark -r capture.pcap -Y&nbsp;"tcp.analysis.retransmission"

# 只显示特定字段（-T fields + -e）
tshark -r capture.pcap -Y&nbsp;"http.request"&nbsp;\
&nbsp; -T fields -e frame.time -e ip.src -e http.host -e http.request.uri

# 统计 HTTP 响应码分布
tshark -r capture.pcap -Y&nbsp;"http.response"&nbsp;\
&nbsp; -T fields -e http.response.code | sort | uniq -c | sort -rn

# 统计每个 IP 的流量
tshark -r capture.pcap -q -z endpoints,ip

# 统计 TCP 会话
tshark -r capture.pcap -q -z conv,tcp

# 导出为 JSON 格式（便于脚本处理）
tshark -r capture.pcap -Y&nbsp;"dns"&nbsp;-T json > dns_packets.json

2.2.2 捕获过滤器 vs 显示过滤器

这两者语法完全不同，混淆是新手最常犯的错误：

2.2.3 常用显示过滤器

# TCP 分析类（排障核心）
tcp.analysis.retransmission &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# TCP 重传
tcp.analysis.fast_retransmission &nbsp; &nbsp;&nbsp;# 快速重传
tcp.analysis.duplicate_ack &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 重复 ACK
tcp.analysis.zero_window &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 零窗口（接收方缓冲区满）
tcp.analysis.window_update &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 窗口更新
tcp.analysis.reset &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# RST 包
tcp.analysis.lost_segment &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 丢包（序列号跳跃）
tcp.analysis.out_of_order &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 乱序包

# HTTP 分析
http.request.method ==&nbsp;"POST"&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# POST 请求
http.response.code >= 400 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 4xx/5xx 错误
http.response.code == 502 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 502 Bad Gateway
http.time > 1 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 响应时间超过 1 秒

# DNS 分析
dns.flags.rcode != 0 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# DNS 查询失败
dns.qry.name contains&nbsp;"example.com"&nbsp;&nbsp;# 查询特定域名
dns.time > 0.5 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# DNS 响应超过 500ms

# TLS 分析
tls.handshake.type == 1 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# Client Hello
tls.handshake.type == 2 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# Server Hello
tls.alert_message &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# TLS 告警

2.2.4 统计功能

Wireshark 的统计功能在排障时非常有用，tshark 也能通过 -z 参数实现：

# 协议层级统计（快速了解流量构成）
tshark -r capture.pcap -q -z io,phs

# IO 图表数据（每秒包数/字节数）
tshark -r capture.pcap -q -z io,stat,1

# TCP 流统计（找出有问题的连接）
tshark -r capture.pcap -q -z conv,tcp

# HTTP 请求统计
tshark -r capture.pcap -q -z http,tree

# DNS 响应时间统计
tshark -r capture.pcap -q -z dns,tree

2.3 TCP 三次握手与四次挥手分析

2.3.1 正常握手与挥手

正常三次握手的抓包特征：

# 三次握手
Client -> Server: [S] &nbsp; &nbsp;seq=0 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;win=65535 &nbsp;&nbsp;# 第一次：客户端发 SYN
Server -> Client: [S.] &nbsp; seq=0 ack=1 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;win=65535 &nbsp;&nbsp;# 第二次：服务端回 SYN-ACK
Client -> Server: [.] &nbsp; &nbsp;seq=1 ack=1 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;win=65535 &nbsp;&nbsp;# 第三次：客户端发 ACK

# 四次挥手
Client -> Server: [F.] &nbsp; seq=100 ack=200 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 客户端发 FIN
Server -> Client: [.] &nbsp; &nbsp;seq=200 ack=101 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 服务端 ACK
Server -> Client: [F.] &nbsp; seq=200 ack=101 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 服务端发 FIN
Client -> Server: [.] &nbsp; &nbsp;seq=101 ack=201 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 客户端 ACK

抓取握手包的命令：

# 抓 SYN、SYN-ACK、RST（排查连接建立问题）
tcpdump -nn -i eth0&nbsp;'tcp[tcpflags] & (tcp-syn|tcp-rst) != 0'&nbsp;and host 10.0.1.100

2.3.2 SYN 超时分析

客户端发了 SYN 但收不到 SYN-ACK，内核会按指数退避重传 SYN。Linux 默认重传 6 次（net.ipv4.tcp_syn_retries=6），总耗时约 127 秒：

14:00:01.000 Client -> Server: [S] seq=100 &nbsp; &nbsp; &nbsp; &nbsp;# 第 1 次 SYN
14:00:02.000 Client -> Server: [S] seq=100 &nbsp; &nbsp; &nbsp; &nbsp;# 1 秒后重传
14:00:04.000 Client -> Server: [S] seq=100 &nbsp; &nbsp; &nbsp; &nbsp;# 2 秒后重传
14:00:08.000 Client -> Server: [S] seq=100 &nbsp; &nbsp; &nbsp; &nbsp;# 4 秒后重传
14:00:16.000 Client -> Server: [S] seq=100 &nbsp; &nbsp; &nbsp; &nbsp;# 8 秒后重传

看到这种模式，说明 SYN 包或 SYN-ACK 包在网络中被丢弃了。常见原因：防火墙 DROP 规则、安全组未放行、目标端口未监听（但这种通常会回 RST）、SYN flood 防护触发。

2.3.3 RST 包分析

RST 是 TCP 的”强制终止”信号，不同场景的 RST 特征不同：

# 抓所有 RST 包
tcpdump -nn -i eth0&nbsp;'tcp[tcpflags] & (tcp-rst) != 0'&nbsp;-c 100

# 常见 RST 场景：
# 1. 端口未监听：SYN -> RST-ACK（立即返回，说明端口没有服务）
# 2. 防火墙拒绝：SYN -> RST（可能带有特定 TTL，区别于服务端的 RST）
# 3. 连接被中间设备切断：数据传输中突然收到 RST
# 4. 应用层主动关闭：SO_LINGER=0 时 close() 发 RST 而不是 FIN

区分 RST 来源的技巧：对比 RST 包的 TTL 和正常包的 TTL，如果 TTL 不同，RST 很可能来自中间设备（防火墙、负载均衡器）而非真正的目标服务器。

2.3.4 TCP 重传与乱序分析

# 用 tshark 统计重传包数量
tshark -r capture.pcap -Y&nbsp;"tcp.analysis.retransmission"&nbsp;| wc -l

# 统计重传率
total=$(tshark -r capture.pcap -Y&nbsp;"tcp"&nbsp;| wc -l)
retrans=$(tshark -r capture.pcap -Y&nbsp;"tcp.analysis.retransmission"&nbsp;| wc -l)
echo&nbsp;"重传率:&nbsp;$(echo "scale=4; $retrans/$total*100" | bc)%"

# 重传率参考值：
# < 0.1% &nbsp;正常
# 0.1%-1% 轻微丢包，可能影响性能
# > 1% &nbsp; &nbsp;严重丢包，需要排查网络链路
# > 5% &nbsp; &nbsp;网络基本不可用

2.4 HTTP/HTTPS 流量分析

2.4.1 HTTP 明文抓包

# 抓 HTTP 请求和响应（-A 显示 ASCII 内容）
tcpdump -nn -i eth0 -A -s 0&nbsp;'tcp port 80 and host 10.0.1.100'&nbsp;| grep -E&nbsp;"^(GET|POST|PUT|DELETE|HTTP/)"

# 用 tshark 提取完整 HTTP 信息
tshark -r capture.pcap -Y&nbsp;"http.request"&nbsp;\
&nbsp; -T fields -e frame.time -e ip.src -e ip.dst \
&nbsp; -e http.request.method -e http.host -e http.request.uri -e http.content_length

2.4.2 HTTPS 加密流量分析

HTTPS 流量加密后无法直接看到内容，但 TLS 握手过程是明文的：

# 抓 TLS 握手（Client Hello 中包含 SNI，可以看到访问的域名）
tshark -r capture.pcap -Y&nbsp;"tls.handshake.type == 1"&nbsp;\
&nbsp; -T fields -e ip.src -e tls.handshake.extensions_server_name

# 检查 TLS 版本
tshark -r capture.pcap -Y&nbsp;"tls.handshake.type == 2"&nbsp;\
&nbsp; -T fields -e ip.src -e tls.handshake.version

# 检查证书信息
tshark -r capture.pcap -Y&nbsp;"tls.handshake.type == 11"&nbsp;\
&nbsp; -T fields -e tls.handshake.certificate

调试环境下解密 HTTPS（通过 SSLKEYLOGFILE）：

# 设置环境变量，让客户端导出 TLS 会话密钥
export&nbsp;SSLKEYLOGFILE=/tmp/tls_keys.log

# 用 curl 发请求（会自动写入密钥）
curl https://api.example.com/health

# tshark 使用密钥文件解密
tshark -r capture.pcap -o&nbsp;"tls.keylog_file:/tmp/tls_keys.log"&nbsp;-Y&nbsp;"http2"
# Wireshark 中：Edit -> Preferences -> Protocols -> TLS -> (Pre)-Master-Secret log filename

2.5 DNS 流量分析

2.5.1 DNS 查询与响应抓包

# 抓所有 DNS 流量
tcpdump -nn -i eth0 port 53

# 用 tshark 解析 DNS 详情
tshark -r capture.pcap -Y&nbsp;"dns"&nbsp;\
&nbsp; -T fields -e frame.time -e ip.src -e ip.dst \
&nbsp; -e dns.qry.name -e dns.qry.type -e dns.a -e dns.flags.rcode

# DNS rcode 含义：
# 0 = NoError（正常）
# 2 = ServFail（服务器故障）
# 3 = NXDomain（域名不存在）
# 5 = Refused（拒绝查询）

2.5.2 DNS 解析延迟定位

# 统计 DNS 响应时间
tshark -r capture.pcap -Y&nbsp;"dns.flags.response == 1"&nbsp;\
&nbsp; -T fields -e dns.qry.name -e dns.time | sort -t$'\t'&nbsp;-k2 -rn | head -20

# 正常 DNS 响应应在 10ms 以内（同机房 DNS 服务器）
# 超过 100ms 需要排查：DNS 服务器负载、网络延迟、递归查询链路

2.5.3 DNS 劫持检测

# 对比不同 DNS 服务器的解析结果
# 先抓包，然后分别向内网 DNS 和公网 DNS 查询
dig @10.0.1.2 example.com A
dig @8.8.8.8 example.com A

# 在 pcap 中检查是否有非预期的 DNS 响应源
tshark -r capture.pcap -Y&nbsp;"dns.flags.response == 1"&nbsp;\
&nbsp; -T fields -e ip.src -e dns.qry.name -e dns.a | sort -u
# 如果出现非配置的 DNS 服务器 IP 返回了响应，可能存在 DNS 劫持

三、示例代码和配置

3.1 自动化抓包分析脚本

3.1.1 定时抓包与自动轮转脚本

#!/bin/bash
# 文件名：auto_capture.sh
# 功能：定时抓包，自动轮转，磁盘空间保护
set&nbsp;-euo pipefail

# ========== 配置区 ==========
IFACE="${1:-eth0}"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 网卡，默认 eth0
FILTER="${2:-tcp port 80 or tcp port 443}"&nbsp;&nbsp;# BPF 过滤器
CAPTURE_DIR="/data/pcap"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 抓包文件存储目录
ROTATE_SEC=3600 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 每小时轮转一个文件
MAX_FILES=48 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 最多保留 48 个文件（2天）
MAX_DISK_PERCENT=80 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 磁盘使用超过 80% 停止抓包
SNAP_LEN=0 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 抓完整包，设为 96 可只抓包头
# =============================

mkdir -p&nbsp;"${CAPTURE_DIR}"

# 磁盘空间检查
check_disk() {
&nbsp; &nbsp;&nbsp;local&nbsp;usage
&nbsp; &nbsp; usage=$(df&nbsp;"${CAPTURE_DIR}"&nbsp;| awk&nbsp;'NR==2 {gsub(/%/,""); print $5}')
&nbsp; &nbsp;&nbsp;if&nbsp;[[&nbsp;${usage}&nbsp;-ge&nbsp;${MAX_DISK_PERCENT}&nbsp;]];&nbsp;then
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"[$(date '+%F %T')] 磁盘使用率&nbsp;${usage}% 超过阈值，清理旧文件"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 删除最旧的文件，直到低于阈值
&nbsp; &nbsp; &nbsp; &nbsp; ls -1t&nbsp;"${CAPTURE_DIR}"/capture_*.pcap 2>/dev/null | tail -n +$((MAX_FILES/2)) | xargs -r rm -f
&nbsp; &nbsp;&nbsp;fi
}

# 清理超出保留数量的旧文件
cleanup_old_files() {
&nbsp; &nbsp;&nbsp;local&nbsp;count
&nbsp; &nbsp; count=$(ls -1&nbsp;"${CAPTURE_DIR}"/capture_*.pcap 2>/dev/null | wc -l)
&nbsp; &nbsp;&nbsp;if&nbsp;[[&nbsp;${count}&nbsp;-gt&nbsp;${MAX_FILES}&nbsp;]];&nbsp;then
&nbsp; &nbsp; &nbsp; &nbsp; ls -1t&nbsp;"${CAPTURE_DIR}"/capture_*.pcap | tail -n +$((MAX_FILES+1)) | xargs -r rm -f
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"[$(date '+%F %T')] 清理旧文件，当前保留&nbsp;${MAX_FILES}&nbsp;个"
&nbsp; &nbsp;&nbsp;fi
}

# 启动前检查
check_disk

echo&nbsp;"[$(date '+%F %T')] 开始抓包: 网卡=${IFACE}, 过滤器='${FILTER}'"
echo&nbsp;"[$(date '+%F %T')] 轮转间隔=${ROTATE_SEC}s, 最大文件数=${MAX_FILES}"

# tcpdump -G 按秒轮转，-W 限制文件数，-z 轮转后执行命令
tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;\
&nbsp; &nbsp; -s&nbsp;"${SNAP_LEN}"&nbsp;\
&nbsp; &nbsp; -w&nbsp;"${CAPTURE_DIR}/capture_%Y%m%d_%H%M%S.pcap"&nbsp;\
&nbsp; &nbsp; -G&nbsp;"${ROTATE_SEC}"&nbsp;\
&nbsp; &nbsp; -W&nbsp;"${MAX_FILES}"&nbsp;\
&nbsp; &nbsp; -Z root \
&nbsp; &nbsp;&nbsp;${FILTER}&nbsp;&

TCPDUMP_PID=$!
echo&nbsp;"[$(date '+%F %T')] tcpdump 进程启动: PID=${TCPDUMP_PID}"

# 后台定期清理
while&nbsp;kill&nbsp;-0&nbsp;${TCPDUMP_PID}&nbsp;2>/dev/null;&nbsp;do
&nbsp; &nbsp; sleep 300
&nbsp; &nbsp; check_disk
&nbsp; &nbsp; cleanup_old_files
done

echo&nbsp;"[$(date '+%F %T')] tcpdump 进程已退出"

3.2 TCP 连接质量分析脚本

3.2.1 从 pcap 提取关键指标

#!/bin/bash
# 文件名：tcp_quality_report.sh
# 功能：分析 pcap 文件，输出 TCP 连接质量报告
set&nbsp;-euo pipefail

PCAP_FILE="${1:?用法: $0 <pcap文件路径>}"

if&nbsp;[[ ! -f&nbsp;"${PCAP_FILE}"&nbsp;]];&nbsp;then
&nbsp; &nbsp;&nbsp;echo&nbsp;"错误: 文件不存在 -&nbsp;${PCAP_FILE}"
&nbsp; &nbsp;&nbsp;exit&nbsp;1
fi

echo&nbsp;"=============================="
echo&nbsp;" TCP 连接质量分析报告"
echo&nbsp;" 文件:&nbsp;${PCAP_FILE}"
echo&nbsp;" 时间:&nbsp;$(date '+%F %T')"
echo&nbsp;"=============================="
echo&nbsp;""

# 总包数
total_packets=$(tshark -r&nbsp;"${PCAP_FILE}"&nbsp;-Y&nbsp;"tcp"&nbsp;2>/dev/null | wc -l)
echo&nbsp;"[基础统计]"
echo&nbsp;" &nbsp;TCP 总包数:&nbsp;${total_packets}"

# 重传统计
retrans=$(tshark -r&nbsp;"${PCAP_FILE}"&nbsp;-Y&nbsp;"tcp.analysis.retransmission"&nbsp;2>/dev/null | wc -l)
fast_retrans=$(tshark -r&nbsp;"${PCAP_FILE}"&nbsp;-Y&nbsp;"tcp.analysis.fast_retransmission"&nbsp;2>/dev/null | wc -l)
if&nbsp;[[&nbsp;${total_packets}&nbsp;-gt 0 ]];&nbsp;then
&nbsp; &nbsp; retrans_rate=$(echo&nbsp;"scale=4;&nbsp;${retrans}/${total_packets}*100"&nbsp;| bc)
else
&nbsp; &nbsp; retrans_rate="0"
fi

echo&nbsp;""
echo&nbsp;"[重传分析]"
echo&nbsp;" &nbsp;重传包数:&nbsp;${retrans}"
echo&nbsp;" &nbsp;快速重传:&nbsp;${fast_retrans}"
echo&nbsp;" &nbsp;重传率: &nbsp;&nbsp;${retrans_rate}%"

# 零窗口
zero_window=$(tshark -r&nbsp;"${PCAP_FILE}"&nbsp;-Y&nbsp;"tcp.analysis.zero_window"&nbsp;2>/dev/null | wc -l)
echo&nbsp;""
echo&nbsp;"[窗口分析]"
echo&nbsp;" &nbsp;零窗口事件:&nbsp;${zero_window}"

# RST 统计
rst_count=$(tshark -r&nbsp;"${PCAP_FILE}"&nbsp;-Y&nbsp;"tcp.flags.reset == 1"&nbsp;2>/dev/null | wc -l)
echo&nbsp;""
echo&nbsp;"[连接异常]"
echo&nbsp;" &nbsp;RST 包数:&nbsp;${rst_count}"

# 乱序包
ooo=$(tshark -r&nbsp;"${PCAP_FILE}"&nbsp;-Y&nbsp;"tcp.analysis.out_of_order"&nbsp;2>/dev/null | wc -l)
dup_ack=$(tshark -r&nbsp;"${PCAP_FILE}"&nbsp;-Y&nbsp;"tcp.analysis.duplicate_ack"&nbsp;2>/dev/null | wc -l)
echo&nbsp;" &nbsp;乱序包数:&nbsp;${ooo}"
echo&nbsp;" &nbsp;重复ACK: &nbsp;${dup_ack}"

# TOP 10 重传最多的连接
echo&nbsp;""
echo&nbsp;"[TOP 10 重传连接]"
tshark -r&nbsp;"${PCAP_FILE}"&nbsp;-Y&nbsp;"tcp.analysis.retransmission"&nbsp;\
&nbsp; &nbsp; -T fields -e ip.src -e tcp.srcport -e ip.dst -e tcp.dstport 2>/dev/null \
&nbsp; &nbsp; | sort | uniq -c | sort -rn | head -10 \
&nbsp; &nbsp; | awk&nbsp;'{printf " &nbsp;%5d 次 &nbsp;%s:%s -> %s:%s\n", $1, $2, $3, $4, $5}'

# 质量评估
echo&nbsp;""
echo&nbsp;"[质量评估]"
if&nbsp;(( $(echo&nbsp;"${retrans_rate}&nbsp;< 0.1"&nbsp;| bc -l) ));&nbsp;then
&nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;状态: 正常 - 重传率低于 0.1%"
elif&nbsp;(( $(echo&nbsp;"${retrans_rate}&nbsp;< 1.0"&nbsp;| bc -l) ));&nbsp;then
&nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;状态: 警告 - 重传率&nbsp;${retrans_rate}%，存在轻微丢包"
elif&nbsp;(( $(echo&nbsp;"${retrans_rate}&nbsp;< 5.0"&nbsp;| bc -l) ));&nbsp;then
&nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;状态: 严重 - 重传率&nbsp;${retrans_rate}%，网络质量差"
else
&nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;状态: 危险 - 重传率&nbsp;${retrans_rate}%，网络基本不可用"
fi

3.3 常见故障场景抓包命令速查

3.3.1 场景命令对照表

#!/bin/bash
# 文件名：capture_cheatsheet.sh
# 功能：常见故障场景的抓包命令集合，按需执行
set&nbsp;-euo pipefail

IFACE="${1:-eth0}"
OUTPUT_DIR="/tmp/captures"
mkdir -p&nbsp;"${OUTPUT_DIR}"

case&nbsp;"${2:-help}"&nbsp;in
&nbsp; &nbsp;&nbsp;# 场景1：TCP 连接建立失败
&nbsp; &nbsp; conn-fail)
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"抓取 SYN/SYN-ACK/RST 包，排查连接建立问题"
&nbsp; &nbsp; &nbsp; &nbsp; tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;-w&nbsp;"${OUTPUT_DIR}/conn_fail.pcap"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;'tcp[tcpflags] & (tcp-syn|tcp-rst) != 0'&nbsp;-c 1000
&nbsp; &nbsp; &nbsp; &nbsp; ;;

&nbsp; &nbsp;&nbsp;# 场景2：DNS 解析异常
&nbsp; &nbsp; dns)
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"抓取所有 DNS 流量"
&nbsp; &nbsp; &nbsp; &nbsp; tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;-w&nbsp;"${OUTPUT_DIR}/dns.pcap"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;'port 53'&nbsp;-c 500
&nbsp; &nbsp; &nbsp; &nbsp; ;;

&nbsp; &nbsp;&nbsp;# 场景3：HTTP 5xx 错误排查
&nbsp; &nbsp; http-err)
&nbsp; &nbsp; &nbsp; &nbsp; TARGET="${3:?需要指定目标 IP}"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"抓取与&nbsp;${TARGET}&nbsp;的 HTTP 流量"
&nbsp; &nbsp; &nbsp; &nbsp; tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;-w&nbsp;"${OUTPUT_DIR}/http_err.pcap"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"host&nbsp;${TARGET}&nbsp;and tcp port 80"&nbsp;-c 2000
&nbsp; &nbsp; &nbsp; &nbsp; ;;

&nbsp; &nbsp;&nbsp;# 场景4：TLS 握手失败
&nbsp; &nbsp; tls-fail)
&nbsp; &nbsp; &nbsp; &nbsp; TARGET="${3:?需要指定目标 IP}"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"抓取与&nbsp;${TARGET}&nbsp;的 TLS 握手包"
&nbsp; &nbsp; &nbsp; &nbsp; tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;-w&nbsp;"${OUTPUT_DIR}/tls_fail.pcap"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"host&nbsp;${TARGET}&nbsp;and tcp port 443"&nbsp;-c 1000
&nbsp; &nbsp; &nbsp; &nbsp; ;;

&nbsp; &nbsp;&nbsp;# 场景5：网络延迟分析
&nbsp; &nbsp; latency)
&nbsp; &nbsp; &nbsp; &nbsp; TARGET="${3:?需要指定目标 IP}"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"抓取与&nbsp;${TARGET}&nbsp;的全部 TCP 流量（含时间戳）"
&nbsp; &nbsp; &nbsp; &nbsp; tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;-w&nbsp;"${OUTPUT_DIR}/latency.pcap"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; -tttt&nbsp;"host&nbsp;${TARGET}&nbsp;and tcp"&nbsp;-c 5000
&nbsp; &nbsp; &nbsp; &nbsp; ;;

&nbsp; &nbsp;&nbsp;# 场景6：丢包与重传
&nbsp; &nbsp; retrans)
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"抓取全部 TCP 流量，后续用 tshark 分析重传"
&nbsp; &nbsp; &nbsp; &nbsp; timeout 60 tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;-w&nbsp;"${OUTPUT_DIR}/retrans.pcap"&nbsp;tcp
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"分析重传情况："
&nbsp; &nbsp; &nbsp; &nbsp; tshark -r&nbsp;"${OUTPUT_DIR}/retrans.pcap"&nbsp;-Y&nbsp;"tcp.analysis.retransmission"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; -T fields -e frame.time -e ip.src -e ip.dst -e tcp.srcport -e tcp.dstport \
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; | head -50
&nbsp; &nbsp; &nbsp; &nbsp; ;;

&nbsp; &nbsp;&nbsp;# 场景7：特定端口的全量抓包
&nbsp; &nbsp; port)
&nbsp; &nbsp; &nbsp; &nbsp; PORT="${3:?需要指定端口号}"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"抓取端口&nbsp;${PORT}&nbsp;的全部流量"
&nbsp; &nbsp; &nbsp; &nbsp; tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;-w&nbsp;"${OUTPUT_DIR}/port_${PORT}.pcap"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"port&nbsp;${PORT}"&nbsp;-c 5000
&nbsp; &nbsp; &nbsp; &nbsp; ;;

&nbsp; &nbsp;&nbsp;# 场景8：ICMP 问题排查（ping 不通）
&nbsp; &nbsp; icmp)
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"抓取 ICMP 流量（ping/traceroute）"
&nbsp; &nbsp; &nbsp; &nbsp; tcpdump -nn -i&nbsp;"${IFACE}"&nbsp;-w&nbsp;"${OUTPUT_DIR}/icmp.pcap"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;'icmp or icmp6'&nbsp;-c 200
&nbsp; &nbsp; &nbsp; &nbsp; ;;

&nbsp; &nbsp; *)
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"用法:&nbsp;$0&nbsp;<网卡> <场景> [参数]"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;""
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;"可用场景:"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;conn-fail &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;TCP 连接建立失败"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;dns &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;DNS 解析异常"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;http-err <IP> &nbsp; &nbsp; &nbsp;HTTP 错误排查"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;tls-fail <IP> &nbsp; &nbsp; &nbsp;TLS 握手失败"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;latency <IP> &nbsp; &nbsp; &nbsp; 网络延迟分析"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;retrans &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;丢包与重传分析"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;port <端口> &nbsp; &nbsp; &nbsp; &nbsp;特定端口全量抓包"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;echo&nbsp;" &nbsp;icmp &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ICMP 问题排查"
&nbsp; &nbsp; &nbsp; &nbsp; ;;
esac

四、最佳实践和注意事项

4.1 最佳实践

4.1.1 抓包策略设计

生产环境抓包的核心矛盾：既要抓到足够信息用于定位问题，又不能对业务造成可感知的性能影响。

性能影响控制：

# 1. 使用 -s 截断包体，只抓包头（绝大多数网络层问题只需要前96字节）
tcpdump -i eth0 -s 96 -w /tmp/headers_only.pcap

# 2. 限制抓包数量，避免无限制运行
tcpdump -i eth0 -c 10000 -w /tmp/limited.pcap

# 3. BPF 过滤器在内核层过滤，比抓全量再 grep 高效几个数量级
# 只抓目标 IP 的 TCP 流量，排除 SSH（避免抓到自己的会话）
tcpdump -i eth0&nbsp;'tcp and host 10.0.1.50 and not port 22'&nbsp;-w /tmp/target.pcap

# 4. 组合使用：截断 + 限量 + 精确过滤
tcpdump -i eth0 -s 128 -c 50000 \
&nbsp; &nbsp;&nbsp;'tcp port 443 and host 10.0.1.50'&nbsp;\
&nbsp; &nbsp; -w /tmp/precise.pcap

抓包文件轮转（长时间抓包必备）：

# -C 100：每个文件 100MB 自动切割
# -W 10：最多保留 10 个文件，循环覆盖
# -G 3600：每 3600 秒（1小时）轮转一次
# -z gzip：轮转后自动压缩

# 方案一：按大小轮转（适合流量不均匀场景）
tcpdump -i eth0 -w /data/capture/trace.pcap \
&nbsp; &nbsp; -C 100 -W 10&nbsp;'tcp port 80'

# 方案二：按时间轮转（适合定时分析场景）
tcpdump -i eth0 -w /data/capture/trace_%Y%m%d_%H%M.pcap \
&nbsp; &nbsp; -G 3600 -W 24&nbsp;'tcp port 80'

4.1.2 高效分析方法论

逐包看 pcap 是最低效的分析方式。推荐从宏观到微观的四步分析法：

# 第一步：协议分布概览 —— 快速了解流量构成
tshark -r capture.pcap -q -z io,phs

# 第二步：会话统计 —— 找出流量最大的通信对
tshark -r capture.pcap -q -z conv,tcp

# 第三步：过滤异常流 —— 聚焦问题流量
# 重传统计
tshark -r capture.pcap -q -z io,stat,1,"tcp.analysis.retransmission"
# HTTP 错误码统计
tshark -r capture.pcap -q -z http,stat,

# 第四步：定位到具体异常流后，再用显示过滤器做单包分析
tshark -r capture.pcap -Y&nbsp;"tcp.stream eq 42"&nbsp;-T fields \
&nbsp; &nbsp; -e frame.time_relative -e ip.src -e ip.dst \
&nbsp; &nbsp; -e tcp.flags.str -e tcp.len -e tcp.analysis.retransmission

tshark 批量统计替代 GUI 的典型场景：

# 统计每秒请求数（QPS）
tshark -r capture.pcap -q -z io,stat,1,"http.request"

# 统计 DNS 查询耗时分布
tshark -r capture.pcap -Y&nbsp;"dns.flags.response == 1"&nbsp;\
&nbsp; &nbsp; -T fields -e dns.qry.name -e dns.time | \
&nbsp; &nbsp; sort -t$'\t'&nbsp;-k2 -rn | head -20

# 导出所有 HTTP 请求的 URL 和响应码
tshark -r capture.pcap -Y&nbsp;"http.response"&nbsp;\
&nbsp; &nbsp; -T fields -e http.request.uri -e http.response.code | \
&nbsp; &nbsp; sort | uniq -c | sort -rn

4.1.3 安全与合规

生产环境抓包涉及用户隐私数据，必须建立规范流程：

权限控制：tcpdump 需要 CAP_NET_RAW 权限，禁止给普通用户直接授予 root。推荐通过 setcap 精细授权：

  # 仅授予抓包权限，不给完整 root
  sudo&nbsp;setcap&nbsp;cap_net_raw,cap_net_admin=eip /usr/sbin/tcpdump

敏感数据脱敏：抓包文件可能包含明文密码、Cookie、Token 等。分析完成后及时清理，传输时加密：

  # 加密存储 pcap 文件
  openssl enc -aes-256-cbc -salt -in&nbsp;capture.pcap -out capture.pcap.enc
  # 安全删除原始文件
  shred -vfz -n 3 capture.pcap

审批流程：生产环境抓包建议走工单审批，记录抓包人、时间、目的、涉及的 IP 范围，事后归档备查。

4.1.4 远程抓包方案

# 容器环境抓包：通过 nsenter 进入目标 Pod 的网络命名空间
# 第一步：获取容器 PID
CONTAINER_ID=$(crictl ps --name <容器名> -q)
PID=$(crictl inspect&nbsp;"$CONTAINER_ID"&nbsp;| jq .info.pid)

# 第二步：进入网络命名空间抓包
nsenter -t&nbsp;"$PID"&nbsp;-n tcpdump -i eth0 -s 128 -c 5000 \
&nbsp; &nbsp; -w /tmp/pod_capture.pcap&nbsp;'tcp port 8080'

# 一行式写法（适合快速排查）
nsenter -t $(crictl inspect $(crictl ps --name myapp -q) | jq .info.pid) \
&nbsp; &nbsp; -n tcpdump -nn -i eth0 -c 1000 -w /tmp/pod.pcap

4.2 注意事项

4.2.1 配置注意事项

⚠️ 警告：生产环境抓包的三大风险

❗ 磁盘空间耗尽：不加 -C/-W 限制的 tcpdump 在高流量接口上可以在几分钟内写满磁盘，直接导致服务宕机。务必提前检查 df -h，并使用轮转参数。
❗ CPU 开销：万兆网卡全量抓包时，tcpdump 的 BPF 编译和包拷贝会占用 5%-15% CPU。高峰期抓包前评估余量，优先使用精确的 BPF 过滤器减少内核态到用户态的数据拷贝。
❗ 混杂模式安全隐患：-p 参数可关闭混杂模式。在共享网络环境中，混杂模式会捕获非本机流量，可能触发安全审计告警。

4.2.2 常见错误

4.2.3 兼容性问题

Linux 发行版差异：CentOS 7 自带 tcpdump 4.9.x 不支持 --print 参数；Ubuntu 22.04+ 默认 tcpdump 4.99.x 支持更多协议解析。建议统一升级到 4.99+ 版本。
容器/K8s 环境：Docker 默认的 bridge 网络模式下，宿主机抓 docker0 可以看到容器流量；但 K8s 的 CNI（Calico/Cilium）使用 veth pair，需要找到对应的 veth 接口或用 nsenter 进入 Pod 网络命名空间。
云环境 VPC：AWS VPC Flow Logs 只记录五元组元数据，不含包体。需要完整抓包时，使用 VPC Traffic Mirroring 将流量镜像到专用分析实例。阿里云/腾讯云有类似的流量镜像功能。

五、故障排查和监控

5.1 故障排查

5.1.1 经典案例一：TCP 连接超时（SYN 无响应）

故障现象：应用日志报 Connection timed out，curl 请求目标服务超时。

抓包定位：

# 在客户端抓取发往目标的 SYN 包
tcpdump -nn -i eth0&nbsp;'tcp[tcpflags] & (tcp-syn) != 0 and host 10.0.2.100'&nbsp;\
&nbsp; &nbsp; -c 50 -w /tmp/syn_debug.pcap

# 实时查看（不写文件）
tcpdump -nn -i eth0&nbsp;'tcp[tcpflags] & (tcp-syn) != 0 and host 10.0.2.100'

典型输出解读：

# 只有 SYN，没有 SYN-ACK —— 对端未收到或未响应
14:23:01.001 IP 10.0.1.10.45678 > 10.0.2.100.8080: Flags [S], seq 123456
14:23:02.003 IP 10.0.1.10.45678 > 10.0.2.100.8080: Flags [S], seq 123456 &nbsp;# 1秒后重传
14:23:04.007 IP 10.0.1.10.45678 > 10.0.2.100.8080: Flags [S], seq 123456 &nbsp;# 2秒后重传
14:23:08.015 IP 10.0.1.10.45678 > 10.0.2.100.8080: Flags [S], seq 123456 &nbsp;# 4秒后重传（指数退避）

排查路径：SYN 重传间隔呈指数退避（1s → 2s → 4s）是 Linux 内核默认行为。只有 SYN 无 SYN-ACK，说明包在路径上被丢弃。依次检查：

目标主机防火墙：iptables -L -n | grep 8080
中间网络设备 ACL：在目标主机同时抓包确认 SYN 是否到达
安全组/NACL 规则（云环境）

5.1.2 经典案例二：间歇性丢包（TCP 重传分析）

故障现象：业务偶发超时，监控显示 TCP 重传率周期性升高。

抓包与统计：

# 持续抓包 5 分钟，只抓 TCP
tcpdump -i eth0 -s 96 -w /tmp/retrans.pcap \
&nbsp; &nbsp;&nbsp;'tcp and host 10.0.2.100'&nbsp;-G 300 -W 1

# 用 tshark 统计重传包数量和时间分布
tshark -r /tmp/retrans.pcap -q -z io,stat,10,"tcp.analysis.retransmission"

输出解读：

| Interval &nbsp; &nbsp; &nbsp; | tcp.analysis.retransmission |
|----------------|-----------------------------|
| 0 &nbsp;<> &nbsp;10 &nbsp; &nbsp; &nbsp;| 2 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; |
| 10 <> &nbsp;20 &nbsp; &nbsp; &nbsp;| 3 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; |
| 20 <> &nbsp;30 &nbsp; &nbsp; &nbsp;| 47 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;| &nbsp;← 这个时段重传突增
| 30 <> &nbsp;40 &nbsp; &nbsp; &nbsp;| 52 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;| &nbsp;← 持续异常
| 40 <> &nbsp;50 &nbsp; &nbsp; &nbsp;| 5 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; |

进一步定位：

# 提取重传包的源目IP，找出哪对通信最严重
tshark -r /tmp/retrans.pcap \
&nbsp; &nbsp; -Y&nbsp;"tcp.analysis.retransmission"&nbsp;\
&nbsp; &nbsp; -T fields -e ip.src -e ip.dst -e tcp.srcport -e tcp.dstport | \
&nbsp; &nbsp; sort | uniq -c | sort -rn | head -10

排查路径：重传集中在特定时段 → 检查该时段交换机端口错误计数（ethtool -S eth0 | grep error）；重传集中在特定 IP 对 → 检查链路 MTU 是否一致（ping -M do -s 1472 目标IP）。

5.1.3 经典案例三：HTTP 502/504 网关超时

故障现象：Nginx 返回 502/504，upstream 日志显示 upstream timed out。

在 Nginx 所在机器同时抓取前端和后端流量：

# 抓 Nginx 到 upstream 的流量（假设 upstream 端口 8080）
tcpdump -nn -i eth0 -s 0 \
&nbsp; &nbsp;&nbsp;'tcp port 8080 and host 10.0.3.50'&nbsp;\
&nbsp; &nbsp; -w /tmp/upstream.pcap -c 10000

用 tshark 分析请求-响应时间差：

# 提取 HTTP 请求和对应响应的时间差
tshark -r /tmp/upstream.pcap \
&nbsp; &nbsp; -Y&nbsp;"http.request or http.response"&nbsp;\
&nbsp; &nbsp; -T fields -e frame.time_relative -e ip.src -e ip.dst \
&nbsp; &nbsp; -e http.request.method -e http.request.uri \
&nbsp; &nbsp; -e http.response.code -e http.time

关键判断：如果 http.time（请求到响应的间隔）> Nginx 的 proxy_read_timeout，说明 upstream 处理慢导致 504。如果 TCP 层出现 RST，说明 upstream 进程崩溃导致 502。

5.1.4 经典案例四：DNS 解析慢

故障现象：应用启动慢、HTTP 请求首次访问延迟高，dig 查询耗时 > 1 秒。

抓包分析 DNS 查询链路：

# 抓取所有 DNS 流量
tcpdump -nn -i eth0&nbsp;'udp port 53'&nbsp;-w /tmp/dns.pcap -c 500

# 用 tshark 统计每个查询的耗时
tshark -r /tmp/dns.pcap -Y&nbsp;"dns.flags.response == 1"&nbsp;\
&nbsp; &nbsp; -T fields -e dns.qry.name -e dns.time -e dns.flags.rcode | \
&nbsp; &nbsp; sort -t$'\t'&nbsp;-k2 -rn | head -20

典型输出：

api.example.com &nbsp; &nbsp; &nbsp; &nbsp; 2.105 &nbsp; &nbsp;No error &nbsp; &nbsp; &nbsp;← 解析耗时 2.1 秒，异常
cdn.example.com &nbsp; &nbsp; &nbsp; &nbsp; 0.003 &nbsp; &nbsp;No error &nbsp; &nbsp; &nbsp;← 正常
db.internal.local &nbsp; &nbsp; &nbsp; 5.002 &nbsp; &nbsp;Server failure ← 5 秒超时，SERVFAIL

排查路径：dns.time > 1s 的记录重点关注。Server failure 通常是递归 DNS 无法解析该域名（内部域名配置错误、上游 DNS 不可达）。检查 /etc/resolv.conf 的 nameserver 顺序，确认第一个 DNS 是否可达。

5.2 性能监控

5.2.1 网络质量基线指标

日常运维中，建立网络质量基线是发现异常的前提。以下四个指标覆盖了绝大多数网络层问题：

| 指标名称 | 正常范围 | 告警阈值 | 说明 | | — | — | — | — | | RTT（往返时延） | 同机房 < 1ms，跨机房 < 10ms | > 50ms 或突增 5 倍 | 通过 TCP 握手的 SYN→SYN-ACK 时间差计算 | | TCP 重传率 | < 0.1% | > 1% | tcp.analysis.retransmission 包数 / 总 TCP 包数 | | 零窗口频率 | 极少出现 | > 10次/分钟 | 接收端缓冲区满，tcp.analysis.zero_window | | RST 比例 | < 0.5% | > 2% | 异常连接终止，可能是端口未监听或防火墙拦截 |

5.2.2 监控指标采集

# 快速查看当前网络质量快照
tshark -i eth0 -a duration:60 -q \
&nbsp; &nbsp; -z io,stat,60,"tcp.analysis.retransmission","tcp.analysis.zero_window","tcp.flags.reset==1"

5.2.3 基于 tshark 的持续网络质量监控脚本

#!/bin/bash
set&nbsp;-euo pipefail

# 网络质量监控脚本 —— 每分钟采集一次指标，输出 Prometheus 格式
# 用法: ./net_monitor.sh <网卡> [Prometheus pushgateway 地址]

IFACE="${1:-eth0}"
PUSHGW="${2:-}"
INTERVAL=60
TMPFILE=$(mktemp /tmp/netmon.XXXXXX.pcap)

cleanup() { rm -f&nbsp;"$TMPFILE"; }
trap&nbsp;cleanup EXIT

while&nbsp;true;&nbsp;do
&nbsp; &nbsp;&nbsp;# 抓包 60 秒
&nbsp; &nbsp; timeout&nbsp;"${INTERVAL}"&nbsp;tcpdump -i&nbsp;"${IFACE}"&nbsp;-s 96 -w&nbsp;"$TMPFILE"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;'tcp'&nbsp;2>/dev/null ||&nbsp;true

&nbsp; &nbsp;&nbsp;# 统计各项指标
&nbsp; &nbsp; TOTAL=$(tshark -r&nbsp;"$TMPFILE"&nbsp;-T fields -e frame.number 2>/dev/null | wc -l)
&nbsp; &nbsp; RETRANS=$(tshark -r&nbsp;"$TMPFILE"&nbsp;-Y&nbsp;"tcp.analysis.retransmission"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; -T fields -e frame.number 2>/dev/null | wc -l)
&nbsp; &nbsp; ZERO_WIN=$(tshark -r&nbsp;"$TMPFILE"&nbsp;-Y&nbsp;"tcp.analysis.zero_window"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; -T fields -e frame.number 2>/dev/null | wc -l)
&nbsp; &nbsp; RST_COUNT=$(tshark -r&nbsp;"$TMPFILE"&nbsp;-Y&nbsp;"tcp.flags.reset==1"&nbsp;\
&nbsp; &nbsp; &nbsp; &nbsp; -T fields -e frame.number 2>/dev/null | wc -l)

&nbsp; &nbsp;&nbsp;# 计算比率（避免除零）
&nbsp; &nbsp;&nbsp;if&nbsp;[&nbsp;"$TOTAL"&nbsp;-gt 0 ];&nbsp;then
&nbsp; &nbsp; &nbsp; &nbsp; RETRANS_RATE=$(awk&nbsp;"BEGIN{printf \"%.4f\",&nbsp;$RETRANS/$TOTAL}")
&nbsp; &nbsp; &nbsp; &nbsp; RST_RATE=$(awk&nbsp;"BEGIN{printf \"%.4f\",&nbsp;$RST_COUNT/$TOTAL}")
&nbsp; &nbsp;&nbsp;else
&nbsp; &nbsp; &nbsp; &nbsp; RETRANS_RATE="0"
&nbsp; &nbsp; &nbsp; &nbsp; RST_RATE="0"
&nbsp; &nbsp;&nbsp;fi

&nbsp; &nbsp; TIMESTAMP=$(date +%s)

&nbsp; &nbsp;&nbsp;# 输出 Prometheus 格式指标
&nbsp; &nbsp; cat <<METRICS
# HELP net_tcp_total 采集周期内 TCP 包总数
net_tcp_total{iface="${IFACE}"}&nbsp;${TOTAL}&nbsp;${TIMESTAMP}
# HELP net_tcp_retrans_rate TCP 重传率
net_tcp_retrans_rate{iface="${IFACE}"}&nbsp;${RETRANS_RATE}&nbsp;${TIMESTAMP}
# HELP net_tcp_zero_window 零窗口事件数
net_tcp_zero_window{iface="${IFACE}"}&nbsp;${ZERO_WIN}&nbsp;${TIMESTAMP}
# HELP net_tcp_rst_rate RST 比例
net_tcp_rst_rate{iface="${IFACE}"}&nbsp;${RST_RATE}&nbsp;${TIMESTAMP}
METRICS

&nbsp; &nbsp;&nbsp;# 推送到 Prometheus Pushgateway（如果配置了地址）
&nbsp; &nbsp;&nbsp;if&nbsp;[ -n&nbsp;"$PUSHGW"&nbsp;];&nbsp;then
&nbsp; &nbsp; &nbsp; &nbsp; cat <<METRICS | curl -s --data-binary @-&nbsp;"http://${PUSHGW}/metrics/job/net_monitor/instance/$(hostname)"
net_tcp_total{iface="${IFACE}"}&nbsp;${TOTAL}
net_tcp_retrans_rate{iface="${IFACE}"}&nbsp;${RETRANS_RATE}
net_tcp_zero_window{iface="${IFACE}"}&nbsp;${ZERO_WIN}
net_tcp_rst_rate{iface="${IFACE}"}&nbsp;${RST_RATE}
METRICS
&nbsp; &nbsp;&nbsp;fi

&nbsp; &nbsp;&nbsp;# 清理临时文件，进入下一轮
&nbsp; &nbsp; rm -f&nbsp;"$TMPFILE"
done

5.3 备份与恢复

5.3.1 pcap 文件管理策略

pcap 文件体积大、包含敏感信息，需要明确的生命周期管理：

#!/bin/bash
set&nbsp;-euo pipefail

# pcap 文件自动清理脚本
# 建议加入 crontab: 0 2 * * * /opt/scripts/pcap_cleanup.sh

PCAP_DIR="/data/capture"
ARCHIVE_DIR="/data/capture/archive"
RETAIN_DAYS=7
ARCHIVE_DAYS=30

mkdir -p&nbsp;"$ARCHIVE_DIR"

# 超过 7 天的 pcap 压缩归档
find&nbsp;"$PCAP_DIR"&nbsp;-maxdepth 1 -name&nbsp;"*.pcap"&nbsp;-mtime +${RETAIN_DAYS}&nbsp;|&nbsp;while&nbsp;read&nbsp;-r f;&nbsp;do
&nbsp; &nbsp; gzip -9&nbsp;"$f"
&nbsp; &nbsp; mv&nbsp;"${f}.gz"&nbsp;"$ARCHIVE_DIR/"
&nbsp; &nbsp;&nbsp;echo&nbsp;"[$(date)] 归档:&nbsp;$(basename "$f")"
done

# 超过 30 天的归档文件安全删除
find&nbsp;"$ARCHIVE_DIR"&nbsp;-name&nbsp;"*.pcap.gz"&nbsp;-mtime +${ARCHIVE_DAYS}&nbsp;|&nbsp;while&nbsp;read&nbsp;-r f;&nbsp;do
&nbsp; &nbsp; shred -fz&nbsp;"$f"
&nbsp; &nbsp;&nbsp;echo&nbsp;"[$(date)] 删除:&nbsp;$(basename "$f")"
done

5.3.2 抓包环境快速部署

#!/bin/bash
set&nbsp;-euo pipefail

# 一键部署抓包分析环境
# 支持 Debian/Ubuntu 和 RHEL/CentOS

echo&nbsp;"=== 抓包环境快速部署 ==="

if&nbsp;command&nbsp;-v apt-get &>/dev/null;&nbsp;then
&nbsp; &nbsp; sudo apt-get update -qq
&nbsp; &nbsp; sudo apt-get install -y -qq tcpdump tshark ngrep mtr-tiny
elif&nbsp;command&nbsp;-v yum &>/dev/null;&nbsp;then
&nbsp; &nbsp; sudo yum install -y -q tcpdump wireshark-cli ngrep mtr
else
&nbsp; &nbsp;&nbsp;echo&nbsp;"不支持的包管理器，请手动安装"&nbsp;>&2
&nbsp; &nbsp;&nbsp;exit&nbsp;1
fi

# 创建抓包目录
sudo mkdir -p /data/capture
sudo chmod 750 /data/capture

# 验证安装
echo&nbsp;"--- 版本信息 ---"
tcpdump --version 2>&1 | head -1
tshark --version 2>&1 | head -1

echo&nbsp;"=== 部署完成 ==="

六、总结

6.1 技术要点回顾

✅ BPF 过滤器是抓包效率的关键：在内核层过滤流量，比抓全量再筛选高效几个数量级，生产环境必须精确指定过滤条件
✅ 从宏观到微观的分析流程：Protocol Hierarchy → Conversations → 异常流过滤 → 单包分析，避免陷入逐包查看的低效模式
✅ tshark 是批量分析的核心工具：统计、导出、过滤一条命令搞定，配合 -z 统计模块可以替代大部分 GUI 操作
✅ 生产环境抓包必须有约束：-s 截断包体、-c 限制数量、-C/-W 文件轮转，三重保险防止磁盘写满
✅ 容器环境用 nsenter 而非容器内安装：通过宿主机进入 Pod 网络命名空间抓包，不污染容器镜像，不依赖容器内工具链

6.2 进阶学习方向

eBPF/XDP 高性能抓包

传统 tcpdump 基于 libpcap，数据从内核拷贝到用户态开销大。eBPF 可以在内核态直接完成过滤和聚合，XDP 在网卡驱动层处理，性能提升 10 倍以上
工具推荐：bpftrace、pwru（Cilium 出品的内核网络路径追踪工具）
实践建议：从 bpftrace -e 'kprobe:tcp_retransmit_skb { printf("retrans: %s\n", ntop(((struct sock *)arg0)->sk_daddr)); }' 开始体验

Packet Capture as a Service

大规模集群中，逐台 SSH 抓包效率极低。可以构建集中式抓包服务：Agent 部署在每个节点，通过 API 触发抓包，pcap 文件自动上传到对象存储
开源方案参考：Arkime（原 Moloch）提供全流量索引和检索能力

AI 辅助流量分析

将 tshark 导出的结构化数据（JSON 格式）输入 LLM，辅助识别异常模式、生成分析报告
tshark -r capture.pcap -T json 导出后，可以让 AI 分析 TCP 会话状态机异常、识别重传模式等

6.3 参考资料

tcpdump 官方手册 – BPF 过滤器语法权威参考
Wireshark 显示过滤器参考 – 完整的显示过滤器字段列表
Wireshark Wiki – TCP Analysis – TCP 分析标记的含义详解
Arkime 全流量分析平台 – 大规模 pcap 索引和检索
pwru – eBPF 网络路径追踪 – 内核网络栈数据包路径可视化

附录

A. 命令速查表

tcpdump BPF 捕获过滤器（内核层过滤，抓包时使用）：

# 基础过滤
tcpdump&nbsp;'host 10.0.1.50'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 指定主机（源或目的）
tcpdump&nbsp;'src host 10.0.1.50'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 仅源地址
tcpdump&nbsp;'dst host 10.0.1.50'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 仅目的地址
tcpdump&nbsp;'net 10.0.1.0/24'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 指定网段
tcpdump&nbsp;'port 80'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 指定端口（TCP/UDP）
tcpdump&nbsp;'tcp port 443'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 仅 TCP 443
tcpdump&nbsp;'portrange 8000-9000'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 端口范围

# 协议过滤
tcpdump&nbsp;'tcp'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 仅 TCP
tcpdump&nbsp;'udp'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 仅 UDP
tcpdump&nbsp;'icmp'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 仅 ICMP

# TCP 标志位过滤
tcpdump&nbsp;'tcp[tcpflags] & (tcp-syn) != 0'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 包含 SYN 的包
tcpdump&nbsp;'tcp[tcpflags] & (tcp-rst) != 0'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 包含 RST 的包
tcpdump&nbsp;'tcp[tcpflags] == tcp-syn'&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 仅 SYN（不含 SYN-ACK）
tcpdump&nbsp;'tcp[tcpflags] & (tcp-syn|tcp-ack) == (tcp-syn|tcp-ack)'&nbsp;&nbsp;# SYN-ACK

# 组合过滤
tcpdump&nbsp;'tcp port 80 and host 10.0.1.50 and not port 22'
tcpdump&nbsp;'(dst port 80 or dst port 443) and src net 192.168.0.0/16'

tshark 显示过滤器（用户层过滤，分析时使用）：

# 基础过滤
tshark -Y&nbsp;"ip.addr == 10.0.1.50"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 指定 IP
tshark -Y&nbsp;"tcp.port == 80"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 指定端口
tshark -Y&nbsp;"tcp.stream eq 5"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 指定 TCP 流编号

# TCP 分析标记
tshark -Y&nbsp;"tcp.analysis.retransmission"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 重传包
tshark -Y&nbsp;"tcp.analysis.fast_retransmission"&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 快速重传
tshark -Y&nbsp;"tcp.analysis.zero_window"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 零窗口
tshark -Y&nbsp;"tcp.analysis.duplicate_ack"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 重复 ACK
tshark -Y&nbsp;"tcp.analysis.lost_segment"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 丢包标记
tshark -Y&nbsp;"tcp.flags.reset == 1"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# RST 包

# HTTP 过滤
tshark -Y&nbsp;"http.request"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# HTTP 请求
tshark -Y&nbsp;"http.response.code >= 400"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# HTTP 错误响应
tshark -Y&nbsp;"http.request.uri contains \"/api\""&nbsp; &nbsp; &nbsp;&nbsp;# URL 包含 /api

# DNS 过滤
tshark -Y&nbsp;"dns.qry.name contains \"example.com\""&nbsp; &nbsp;# 指定域名查询
tshark -Y&nbsp;"dns.flags.rcode != 0"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# DNS 错误响应
tshark -Y&nbsp;"dns.time > 0.5"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# DNS 响应超过 500ms

B. 配置参数详解

tcpdump 关键参数：

| 参数 | 说明 | 常用值 | 示例 | | — | — | — | — | | -i | 指定网卡 | eth0 、any | -i eth0 | | -nn | 不解析主机名和端口名 | – | 加速输出，生产环境必加 | | -s | 截取包长度（字节） | 0 =完整、96=仅包头 | -s 96 减少磁盘写入 | | -c | 抓包数量上限 | 视需求 | -c 10000 | | -w | 写入 pcap 文件 | 文件路径 | -w /tmp/cap.pcap | | -r | 读取 pcap 文件 | 文件路径 | -r /tmp/cap.pcap | | -C | 文件大小轮转（MB） | 100 | -C 100 每 100MB 切割 | | -W | 最大文件数（配合 -C） | 10 | -W 10 循环覆盖 | | -G | 按时间轮转（秒） | 3600 | -G 3600 每小时切割 | | -A | 以 ASCII 打印包内容 | – | 快速查看 HTTP 明文 | | -X | 以 HEX + ASCII 打印 | – | 调试二进制协议 | | -p | 关闭混杂模式 | – | 安全环境下使用 |

tshark 关键参数：

C. 术语表

今日福利

为了帮助大家早日习得网络安全核心知识，快速入行网络安全圈，给大家整理了一套【2026最新网安资料】网络安全工程师必备技能资料包（文末一键领取），内容有多详实丰富看下图！

Web安全👇

渗透测试👇

安全面试题👇

代码审计👇

红队笔记👇

入门视频👇

以上所有资料获取请扫码

识别上方二维码

备注：2026安全合集

100%免费领取

（是扫码领取，不是在公众号后台回复，别看错了哦）

免责声明：

本文所载程序、技术方法仅面向合法合规的安全研究与教学场景，旨在提升网络安全防护能力，具有明确的技术研究属性。

任何单位或个人未经授权，将本文内容用于攻击、破坏等非法用途的，由此引发的全部法律责任、民事赔偿及连带责任，均由行为人独立承担，本站不承担任何连带责任。

本站内容均为技术交流与知识分享目的发布，若存在版权侵权或其他异议，请通过邮件联系处理，具体联系方式可点击页面上方的联系我。

本文转载自：马哥网络安全点击关注👉 点击关注👉《网络抓包分析实战：Wireshark与tcpdump定位网络故障》