别再只会wrk -t -c -d了！用Lua脚本模拟真实用户行为，让你的压力测试更贴近生产环境-平芜编程栈

用Lua脚本解锁wrk的隐藏能力：构建真实业务场景的压力测试方案

当你的电商网站在促销活动中崩溃，或是社交平台在热点事件中响应迟缓，这些往往不是简单的并发请求能模拟出来的问题。传统的wrk -t -c -d命令虽然能提供基础的性能数据，但真实的业务场景远比这复杂得多——用户需要先登录获取token，然后浏览商品列表，随机选择商品加入购物车，最后完成支付流程。这种多步骤、有状态的用户行为，正是大多数压力测试工具难以真实模拟的痛点。

1. 为什么需要基于业务逻辑的压力测试

在2023年的一次行业调研中，超过67%的技术团队承认他们的压力测试与真实用户行为存在显著差异。这种差异导致测试环境中的优异表现无法转化为生产环境的稳定性。一个典型的例子是，某知名电商平台在测试中能够轻松应对每秒10万次请求，但在双11活动中，实际流量只有测试流量的三分之一时系统就出现了严重延迟。

传统压力测试的三大局限性：

无状态请求：简单重复相同请求，无法模拟用户会话
固定参数：所有请求使用相同参数，忽略业务多样性
独立操作：无法构建请求间的依赖关系（如先登录后操作）

-- 典型的基础wrk测试脚本 wrk.method = "GET" wrk.path = "/api/products/123"

而真实的用户行为是这样的：

登录 → 浏览商品 → 随机选择商品 → 查看详情 → 加入购物车 → 结算

2. Lua脚本基础：从静态到动态测试

wrk的Lua脚本支持让我们能够突破这些限制。通过几个关键函数，我们可以构建复杂的测试场景：

setup(thread)：初始化线程级变量
init(args)：测试初始化阶段
request()：生成每个请求
response(status, headers, body)：处理响应

动态参数生成示例：

-- 生成随机用户ID和商品ID function random_string(length) local chars = "abcdefghijklmnopqrstuvwxyz0123456789" local result = "" for i = 1, length do local rand = math.random(#chars) result = result .. chars:sub(rand, rand) end return result end wrk.method = "GET" wrk.path = "/api/products/" .. random_string(8)

提示：使用math.randomseed(os.time())确保每次运行生成不同的随机序列

参数类型	生成方法	应用场景
用户标识	UUID/随机字符串	会话跟踪
数字范围	math.random(min,max)	分页、商品ID
时间戳	os.time()	时效性请求
枚举值	数组随机选择	分类查询

3. 构建有状态的用户旅程

真实的用户操作往往是一系列相关联的动作。通过Lua脚本，我们可以模拟这种有状态的行为：

local token = nil -- 登录并获取token function login() local req = wrk.format("POST", "/api/login", {["Content-Type"]="application/json"}, '{"username":"testuser","password":"p@ssw0rd"}') return req end -- 使用token查询用户信息 function get_profile() return wrk.format("GET", "/api/profile", {["Authorization"]="Bearer " .. token}) end -- 请求序列 function request() if not token then return login() else return get_profile() end end -- 处理响应，提取token function response(status, headers, body) if not token and status == 200 then token = string.match(body, '"token":"([^"]+)"') end end

这个脚本模拟了：

首次请求发送登录信息
从登录响应中提取token
后续请求携带token访问受保护资源

4. 复杂业务场景实战：电商购物流程

让我们构建一个完整的电商用户行为模型：

local user_steps = { "login", -- 1. 登录 "browse", -- 2. 浏览商品 "view_detail",-- 3. 查看详情 "add_to_cart",-- 4. 加入购物车 "checkout" -- 5. 结算 } local current_step = 1 local session_data = {} function create_request(step) if step == 1 then -- 登录 return wrk.format("POST", "/login", nil, json.encode({username="user_"..math.random(1000), password="123456"})) elseif step == 2 then -- 浏览商品 return wrk.format("GET", "/products?category=".. math.random(1,10).."&page="..math.random(1,5)) elseif step == 3 then -- 查看详情 return wrk.format("GET", "/products/"..math.random(1,1000)) elseif step == 4 then -- 加入购物车 return wrk.format("POST", "/cart", {["X-Auth-Token"]=session_data.token}, json.encode({product_id=math.random(1,1000), quantity=math.random(1,3)})) else -- 结算 return wrk.format("POST", "/checkout", {["X-Auth-Token"]=session_data.token}) end end function request() local req = create_request(current_step) return req end function response(status, headers, body) if current_step == 1 and status == 200 then session_data.token = headers["X-Auth-Token"] end -- 50%概率继续下一步，50%概率重复当前步骤或回退 if math.random() > 0.5 then current_step = math.min(#user_steps, current_step + 1) else current_step = math.max(1, current_step - math.random(0,1)) end end

这个脚本实现了：

完整的用户购物流程
步骤间的状态保持（通过token）
非线性的用户行为（可能回退或重复步骤）
动态参数生成（用户、商品、分类等）

5. 高级技巧与性能考量

当脚本变得复杂时，需要注意以下性能优化点：

1. 脚本预热策略

function init(args) -- 预先执行100次随机数生成，避免初始随机性不足 for i=1,100 do math.random() end -- 预加载常用数据 local f = io.open("common_products.json") common_products = json.decode(f:read("*a")) f:close() end

2. 连接复用优化

wrk.headers["Connection"] = "keep-alive" function response(status) -- 遇到5xx错误时关闭当前连接 if status >= 500 and status < 600 then wrk.thread:stop() end end

3. 结果分析与可视化

虽然wrk本身提供基础指标，但对于复杂测试，我们需要扩展结果分析：

local latency_stats = {} function response(status, headers, body, latency) local step = user_steps[current_step] if not latency_stats[step] then latency_stats[step] = { count = 0, total = 0, min = math.huge, max = 0 } end local stat = latency_stats[step] stat.count = stat.count + 1 stat.total = stat.total + latency stat.min = math.min(stat.min, latency) stat.max = math.max(stat.max, latency) end function done(summary, latency, requests) print("\nStep-wise Latency Analysis:") for step, stat in pairs(latency_stats) do local avg = stat.total / stat.count print(string.format("%-15s: min=%.2fms, avg=%.2fms, max=%.2fms", step, stat.min, avg, stat.max)) end end

6. 真实案例：社交平台的Feed流测试

某社交平台在测试其Feed流接口时，最初使用简单的固定参数测试：

wrk -t12 -c100 -d60s "https://api.social.com/feed?user_id=123&page=1"

迁移到基于Lua的智能测试后，发现了几个关键问题：

热点用户效应：某些大V的Feed加载性能显著低于普通用户
分页性能衰减：第5页以后的加载时间呈指数增长
缓存失效问题：随机访问模式导致缓存命中率低于预期

改进后的测试脚本核心逻辑：

-- 80%概率访问普通用户，20%概率访问大V function select_user_type() if math.random() < 0.2 then return math.random(1, 100) -- 大V用户ID范围 else return math.random(101, 1000000) -- 普通用户 end end -- 生成请求 function request() local user_id = select_user_type() local page = 1 -- 60%概率只看第一页，30%看2-3页，10%看更多 local r = math.random() if r < 0.6 then page = 1 elseif r < 0.9 then page = math.random(2,3) else page = math.random(4,10) end return wrk.format("GET", string.format("/feed?user_id=%d&page=%d", user_id, page)) end

这个案例最终帮助团队发现了数据库分片策略的缺陷，优化后使P99延迟降低了47%。