Blender Git Loki
Git Commits -> Revision 9800837
Revision 9800837 by Brecht Van Lommel (master) February 15, 2019, 07:56 (GMT) |
Cycles: Support multithreaded compilation of kernels This patch implements a workaround to get the multithreaded compilation from D2231 working. So far, it only works for Blender, not for Cycles Standalone. Also, I have only tested the Linux codepath in the helper function. Depends on D2231. Patch by lukasstockner97, jbakker, brecht job | scene_name | compilation_time ----------+-----------------+------------------ Baseline | empty | 22.73 D2264 | empty | 13.94 Baseline | bmw | 56.44 D2264 | bmw | 41.32 Baseline | fishycat | 59.50 D2264 | fishycat | 45.19 Baseline | barbershop | 212.28 D2264 | barbershop | 169.81 Baseline | victor | 67.51 D2264 | victor | 53.60 Baseline | classroom | 51.46 D2264 | classroom | 39.02 Baseline | koro | 62.48 D2264 | koro | 49.03 Baseline | pavillion | 54.37 D2264 | pavillion | 38.82 Baseline | splash279 | 47.43 D2264 | splash279 | 37.94 Baseline | volume_emission | 145.22 D2264 | volume_emission | 121.10 This patch reduced compilation time as the split kernels and base kernels are compiled in parallel. In cycles debug mode (256) you can set unmark the opencl single program file, what reduces the compilation time even further (bmw 17 seconds, barbershop 53 seconds). Reviewers: brecht, dingto, sergey, juicyfruit, lukasstockner97 Reviewed By: brecht Subscribers: Loner, jbakker, candreacchio, 3dLuver, LazyDodo, bliblubli Differential Revision: https://developer.blender.org/D2264 |
Commit Details:
Full Hash: 9800837b987930e6152c2dc27cae5bd55873d306
Parent Commit: de0e456
Committed By: Jeroen Bakker
Lines Changed: +420, -45
1 Added Path:
/intern/cycles/kernel/kernels/opencl/kernel_split_bundle.cl (+28, -0) (View)
11 Modified Paths:
/intern/cycles/blender/blender_python.cpp (+30, -0) (Diff)
/intern/cycles/blender/CMakeLists.txt (+4, -0) (Diff)
/intern/cycles/device/device_intern.h (+1, -0) (Diff)
/intern/cycles/device/opencl/opencl.h (+17, -3) (Diff)
/intern/cycles/device/opencl/opencl_base.cpp (+5, -15) (Diff)
/intern/cycles/device/opencl/opencl_mega.cpp (+20, -4) (Diff)
/intern/cycles/device/opencl/opencl_split.cpp (+158, -13) (Diff)
/intern/cycles/device/opencl/opencl_util.cpp (+130, -10) (Diff)
/intern/cycles/kernel/CMakeLists.txt (+1, -0) (Diff)
/intern/cycles/util/util_system.cpp (+22, -0) (Diff)
/intern/cycles/util/util_system.h (+4, -0) (Diff)
/intern/cycles/blender/CMakeLists.txt (+4, -0) (Diff)
/intern/cycles/device/device_intern.h (+1, -0) (Diff)
/intern/cycles/device/opencl/opencl.h (+17, -3) (Diff)
/intern/cycles/device/opencl/opencl_base.cpp (+5, -15) (Diff)
/intern/cycles/device/opencl/opencl_mega.cpp (+20, -4) (Diff)
/intern/cycles/device/opencl/opencl_split.cpp (+158, -13) (Diff)
/intern/cycles/device/opencl/opencl_util.cpp (+130, -10) (Diff)
/intern/cycles/kernel/CMakeLists.txt (+1, -0) (Diff)
/intern/cycles/util/util_system.cpp (+22, -0) (Diff)
/intern/cycles/util/util_system.h (+4, -0) (Diff)