Blender Git Commit Log
Git Commits -> Revision a87fb34
Revision a87fb34 by Sergey Sharybin (master) June 13, 2014, 18:38 (GMT) |
Use advantage of SSE2 instructions in gaussian blur node This gives around 30% of speedup for gaussian blur node. Pretty much straightforward implementation inside the node itself, but needed to implement some additional things: - Aligned malloc. It's needed to load data onto SSE registers faster. based on the aligned_malloc() from Libmv with some additional trickery going on to support arbitrary alignment (this magic is needed because of MemHead). In the practice only 16bit alignment is supported because of the lack of aligned malloc with arbitrary alignment for OSX. Not a bit deal for now because we need 16 bytes alignment at this moment only. Could be tweaked further later. - Memory buffers in compositor are now aligned to 16 bytes. Should be harmless for non-SSE cases too. just mentioning. Reviewers: campbellbarton, lukastoenne, jbakker Reviewed By: campbellbarton CC: lockal Differential Revision: https://developer.blender.org/D564 |
Commit Details:
Full Hash: a87fb34edaf1a10f5527b6dc8a506a1c9ecbc683
Parent Commit: b0708dd
Lines Changed: +362, -22
12 Modified Paths:
/intern/guardedalloc/intern/mallocn.c (+36, -0) (Diff)
/intern/guardedalloc/intern/mallocn_guarded_impl.c (+86, -9) (Diff)
/intern/guardedalloc/intern/mallocn_intern.h (+31, -0) (Diff)
/intern/guardedalloc/intern/mallocn_lockfree_impl.c (+105, -7) (Diff)
/intern/guardedalloc/MEM_guardedalloc.h (+6, -0) (Diff)
/source/blender/compositor/intern/COM_MemoryBuffer.cpp (+2, -2) (Diff)
/source/blender/compositor/operations/COM_BlurBaseOperation.cpp (+12, -0) (Diff)
/source/blender/compositor/operations/COM_BlurBaseOperation.h (+7, -0) (Diff)
/source/blender/compositor/operations/COM_GaussianXBlurOperation.cpp (+35, -2) (Diff)
/source/blender/compositor/operations/COM_GaussianXBlurOperation.h (+3, -0) (Diff)
/source/blender/compositor/operations/COM_GaussianYBlurOperation.cpp (+36, -2) (Diff)
/source/blender/compositor/operations/COM_GaussianYBlurOperation.h (+3, -0) (Diff)
/intern/guardedalloc/intern/mallocn_guarded_impl.c (+86, -9) (Diff)
/intern/guardedalloc/intern/mallocn_intern.h (+31, -0) (Diff)
/intern/guardedalloc/intern/mallocn_lockfree_impl.c (+105, -7) (Diff)
/intern/guardedalloc/MEM_guardedalloc.h (+6, -0) (Diff)
/source/blender/compositor/intern/COM_MemoryBuffer.cpp (+2, -2) (Diff)
/source/blender/compositor/operations/COM_BlurBaseOperation.cpp (+12, -0) (Diff)
/source/blender/compositor/operations/COM_BlurBaseOperation.h (+7, -0) (Diff)
/source/blender/compositor/operations/COM_GaussianXBlurOperation.cpp (+35, -2) (Diff)
/source/blender/compositor/operations/COM_GaussianXBlurOperation.h (+3, -0) (Diff)
/source/blender/compositor/operations/COM_GaussianYBlurOperation.cpp (+36, -2) (Diff)
/source/blender/compositor/operations/COM_GaussianYBlurOperation.h (+3, -0) (Diff)