April 28, 2021, 17:56 (GMT) |
Cycles: add GPU queue kernel time stats * Print summary of kernel execution times for GPU queues with --verbose 3 * This is for combinations of kernels rather than individual ones, since measuring individual ones has a performance impact. * Refactor code to be independent of CUDA queue Differential Revision: https://developer.blender.org/D11112 |
April 28, 2021, 17:56 (GMT) |
Cleanup: add mechanism for devices to not have a megakernel |
April 28, 2021, 17:56 (GMT) |
Cleanup: move number of concurrent GPU paths to device queue So different devices can set it to different values, though it's still hardcoded as of now. |
April 28, 2021, 17:49 (GMT) |
Fix wrong order of graphics interop destruction The GPU display needs to be destroyed after path trace works, otherwise the CUDA interop resource destruction will error with an invalid graphics context. Additionally, added guards to ensure the CUDA graphics interop object is never copied. |
April 28, 2021, 17:19 (GMT) |
Fix wrong number of integrator kernels The value was off-by-one, causing buffer overflow on accessing the queue counters. |
April 28, 2021, 16:19 (GMT) |
Only apply edge panning in translation mode. |
April 28, 2021, 16:13 (GMT) |
Longer edge pan delay and higher max speed. |
April 28, 2021, 15:53 (GMT) |
Clean up comments. |
April 28, 2021, 15:45 (GMT) |
Remove unused include. |
April 28, 2021, 15:43 (GMT) |
fix |
April 28, 2021, 15:36 (GMT) |
Merge branch 'master' into temp-gpencil-bezier-stroke-type Conflicts: source/blender/editors/transform/transform_convert_gpencil.c |
April 28, 2021, 15:35 (GMT) |
Merge branch 'master' into greasepencil-object |
April 28, 2021, 15:13 (GMT) |
[WIP] Compositor: Full-frame base system This patch adds the base code needed to make the full-frame system work for both current tiled/per-pixel implementation of operations and full-frame. Two execution models: - Tiled: Current implementation. Renders execution groups in tiles from outputs to input. Not all operations are buffered. Runs the tiled/per-pixel implementation. - FullFrame: All operations are buffered. Fully renders operations from inputs to outputs. Runs full-frame implementation of operations if available otherwise the current tiled/per-pixel. Creates output buffers on first read and free them as soon as all its readers have finished, reducing peak memory usage of complex/long trees. This should allow us to convert operations to full-frame in small steps with the system already working and solve the problem of high memory usage. FullFrame breaking changes respect Tiled system, mainly: - Translate, Rotate, Scale, and Transform take effect immediately instead of next buffered operation. - Any sampling is always done over inputs instead of last buffered operation. Differential Revision: https://developer.blender.org/D11113 |
April 28, 2021, 15:09 (GMT) |
support vertical panning |
April 28, 2021, 14:38 (GMT) |
add profiling to a few more places |
April 28, 2021, 14:11 (GMT) |
Merge branch 'master' into profiler-editor |
April 28, 2021, 12:56 (GMT) |
Merge branch 'master' of https://git.blender.org/blender |
April 28, 2021, 12:36 (GMT) |
Remove unused code. |
April 28, 2021, 12:18 (GMT) |
Use attach vertex buffer. |
April 28, 2021, 11:18 (GMT) |
Merge branch 'temp-gpu-compute-shaders' into temp-gpu-compute-shader-hair |
|