macros - 在 Julia 中使用 @generated 宏进行渐变的符号
问题描述
出于性能原因,我需要与用户定义的函数一样快的渐变和 Hessians(例如,ForwardDiff 库使我的代码明显变慢)。然后我尝试使用@generated
宏进行元编程,用一个简单的函数进行测试
using Calculus
hand_defined_derivative(x) = 2x - sin(x)
symbolic_primal = :( x^2 + cos(x) )
symbolic_derivative = differentiate(symbolic_primal,:x)
@generated functional_derivative(x) = symbolic_derivative
这正是我想要的:
rand_x = rand(10000);
exact_values = hand_defined_derivative.(rand_x)
test_values = functional_derivative.(rand_x)
isequal(exact_values,test_values) # >> true
@btime hand_defined_derivative.(rand_x); # >> 73.358 μs (5 allocations: 78.27 KiB)
@btime functional_derivative.(rand_x); # >> 73.456 μs (5 allocations: 78.27 KiB)
我现在需要将其推广到具有更多参数的函数。显而易见的推断是:
symbolic_primal = :( x^2 + cos(x) + y^2 )
symbolic_gradient = differentiate(symbolic_primal,[:x,:y])
的symbolic_gradient
行为符合预期(就像在一维情况下一样),但 @generated 宏不会像我认为的那样响应多个维度:
@generated functional_gradient(x,y) = symbolic_gradient
functional_gradient(1.0,1.0)
>> 2-element Array{Any,1}:
:(2 * 1 * x ^ (2 - 1) + 1 * -(sin(x)))
:(2 * 1 * y ^ (2 - 1))
也就是说,它不会将符号转换为生成的函数。有没有简单的方法来解决这个问题?
PS:我知道我可以将每个参数的导数定义为一维函数,并将它们捆绑在一起形成一个渐变(这就是我目前正在做的),但我确信一定有更好的方法。
解决方案
首先,我认为您不需要在@generated
这里使用:这是代码生成的“简单”案例,我认为使用@eval
更简单且不那么令人惊讶。
所以一维情况可以这样重写:
julia> using Calculus
julia> symbolic_primal = :( x^2 + cos(x) )
:(x ^ 2 + cos(x))
julia> symbolic_derivative = differentiate(symbolic_primal,:x)
:(2 * 1 * x ^ (2 - 1) + 1 * -(sin(x)))
julia> hand_defined_derivative(x) = 2x - sin(x)
hand_defined_derivative (generic function with 1 method)
# Let's check first what code we'll be evaluating
# (`quote` returns the unevaluated expression passed to it)
julia> quote
functional_derivative(x) = $symbolic_derivative
end
quote
functional_derivative(x) = begin
2 * 1 * x ^ (2 - 1) + 1 * -(sin(x))
end
end
# Looks OK => let's evaluate it now
# (since `@eval` is macro, its argument will be left unevaluated
# => no `quote` here)
julia> @eval begin
functional_derivative(x) = $symbolic_derivative
end
functional_derivative (generic function with 1 method)
julia> rand_x = rand(10000);
julia> exact_values = hand_defined_derivative.(rand_x);
julia> test_values = functional_derivative.(rand_x);
julia> @assert isequal(exact_values,test_values)
# Don't forget to interpolate array arguments when using `BenchmarkTools`
julia> using BenchmarkTools
julia> @btime hand_defined_derivative.($rand_x);
104.259 μs (2 allocations: 78.20 KiB)
julia> @btime functional_derivative.($rand_x);
104.537 μs (2 allocations: 78.20 KiB)
现在二维情况不起作用,因为输出differentiate
是一个表达式数组(每个组件一个表达式),您需要将其转换为构建组件数组(或元组,用于性能)的表达式。这是symbolic_gradient_expr
在下面的示例中:
julia> symbolic_primal = :( x^2 + cos(x) + y^2 )
:(x ^ 2 + cos(x) + y ^ 2)
julia> hand_defined_gradient(x, y) = (2x - sin(x), 2y)
hand_defined_gradient (generic function with 1 method)
# This is a vector of expressions
julia> symbolic_gradient = differentiate(symbolic_primal,[:x,:y])
2-element Array{Any,1}:
:(2 * 1 * x ^ (2 - 1) + 1 * -(sin(x)))
:(2 * 1 * y ^ (2 - 1))
# Wrap expressions for all components of the gradient into a single expression
# generating a tuple of them:
julia> symbolic_gradient_expr = Expr(:tuple, symbolic_gradient...)
:((2 * 1 * x ^ (2 - 1) + 1 * -(sin(x)), 2 * 1 * y ^ (2 - 1)))
julia> @eval functional_gradient(x, y) = $symbolic_gradient_expr
functional_gradient (generic function with 1 method)
与 1D 情况一样,这与手写版本的执行方式相同:
julia> rand_x = rand(10000); rand_y = rand(10000);
julia> exact_values = hand_defined_gradient.(rand_x, rand_y);
julia> test_values = functional_gradient.(rand_x, rand_y);
julia> @assert isequal(exact_values,test_values)
julia> @btime hand_defined_gradient.($rand_x, $rand_y);
113.182 μs (2 allocations: 156.33 KiB)
julia> @btime functional_gradient.($rand_x, $rand_y);
112.283 μs (2 allocations: 156.33 KiB)
推荐阅读
- objective-c - 我该如何处理这种欧式时间戳?
- java - What should be the name of the prev_node argument passed into the method given below?
- mysql - 更改端口号后phpMyAdmin登录不起作用
- batch-file - 为什么我的批处理文件在 Windows 10 上停止工作?
- pandas - 如何在熊猫数据框中提取组的最后一个值
- python - 使用 aiohttp 的多个请求,但每个请求单独超时
- ruby-on-rails - 我如何使用 rspec 运行测试,使用 simple_token_authentication 并设计
- http - 为什么以下代码中的最后一个错误处理程序存在无效参数错误?
- c - 除了字母值之外,错误字符串变量正在接收 0x603270
- javascript - 如何在 React Hook 中添加自定义类名?