Commits · master · einsteinathome / libclfft

Apr 25, 2023
- Merge branch 'override_cl_compile_options' into 'master' · 47f3a152
  Heinz-Bernd Eggenstein authored 2 years ago
  
  allow to override default OpenCL compile options by defining CLFFT_COMPILE_OPTIONS macro See merge request !6
  47f3a152
Aug 10, 2022
- allow to override default OpenCL compile options by defining CLFFT_COMPILE_OPTIONS macro · 278b5999
  Bernd Machenschalk authored 2 years ago
  
  278b5999
Mar 04, 2021
- Merge branch 'add-clFFT_GetSize-MR' into 'master' · a9efa1be
  Bernd Machenschalk authored 4 years ago
  
  Add clFFT_GetSize() for getting the estimated size of a plan See merge request !5
  a9efa1be
Mar 02, 2021
- Add clFFT_GetSize() for getting the estimated size of a plan · 46c5dc03
  Maximillian Bensch authored 5 years ago
  
  - similar to cufftGetSize()
  46c5dc03
Dec 03, 2019
- Merge branch 'remove-GPU-constraint' into 'master' · 0edfa5d2
  Bernd Machenschalk authored 5 years ago
  
  Remove GPU constraint See merge request !4
  0edfa5d2
Aug 21, 2019
- Remove GPU constraint · 8d760160
  Maximillian Bensch authored 5 years ago
  
  8d760160
Aug 12, 2019
- Merge branch 'improve_Makefile' into 'master' · 137b4784
  Bernd Machenschalk authored 5 years ago
  
  Makefile improvements See merge request !3
  137b4784
- example/Makefile: adapt the include path selection from src/Makefile here · 4967ca64
  Bernd Machenschalk authored 5 years ago
  
  4967ca64
Jun 17, 2019
- Makefile: renamed target 'sample' to avoid conflict with directory name · 06adfda0
  Bernd Machenschalk authored 5 years ago
  
  06adfda0
Jun 14, 2019
- don't force building the static version when installing · cec37b69
  Bernd Machenschalk authored 5 years ago
  
  cec37b69
- allow static and shared builds from top level · 6123b933
  Bernd Machenschalk authored 5 years ago
  
  6123b933
- fix selection of possible include paths · 9ba1bd5c
  Bernd Machenschalk authored 5 years ago
  
  9ba1bd5c
- Makefile: improve selection of possible include paths · e01a2fd1
  Bernd Machenschalk authored 5 years ago
  
  e01a2fd1
Jun 13, 2019
- fix MinGW build · aca23d27
  Bernd Machenschalk authored 5 years ago
  
  aca23d27
Feb 21, 2019

- allow to build shared and static versions separately

- fix OSX shared build

167e5b78

Feb 20, 2019
- Merge branch 'master' into 'master' · 7439e79e
  Bernd Machenschalk authored 6 years ago
  
  add a shared library version See merge request !2
  7439e79e
Feb 18, 2019
- add a shared library version · ad6ba6d3
  Maximillian Bensch authored 6 years ago
  
  ad6ba6d3
Apr 23, 2018

Merge branch 'rename_to_eclfft' into 'master' · a7bd5410
Oliver Bock authored 7 years ago
```
renamed library and header to 'eclfft' to avoid conflicts with clFFT

See merge request !1
```
a7bd5410

add a target 'install' · 3bb1a461

Bernd Machenschalk authored 7 years ago

- this installs the header in $PREFIX/include/eclfft and the lib in $PREFIX/lib/eclfft.a

3bb1a461

Jun 07, 2016
- Added README · eafab99f
  Oliver Bock authored 8 years ago
  
  eafab99f
Sep 21, 2012
- added comments in header file for extended plan generation function · 9c5a4b48
  Heinz-Bernd Eggenstein authored 12 years ago
  
  9c5a4b48
Jul 26, 2012
- Ensure static linking of libgcc and libstdc++ (the latter requires GCC 4.5) · dc050e3a
  Oliver Bock authored 12 years ago
  
  dc050e3a
- Switching to archive files (OpenCL.lib for 64 bit seems to be compiled to a... · 227182c4
  Oliver Bock authored 12 years ago
  
  Switching to archive files (OpenCL.lib for 64 bit seems to be compiled to a different/incompatible format)
  227182c4
Jul 25, 2012
- Added MinGW 64 bit build target · 52add4a0
  Oliver Bock authored 12 years ago
  
  52add4a0
- Update APP SDK version to 2.6 · 3a684d72
  Oliver Bock authored 12 years ago
  
  3a684d72
- Bug #1608: clFFT use of native_sin , native_cos can cause validation problems · d7ecf3a2
  Heinz-Bernd Eggenstein authored 12 years ago
  
  fixed previous commit for C99 compliant float printf format
  d7ecf3a2
- Bug #1608: clFFT use of native_sin , native_cos can cause validation problems · 233caaf3
  Heinz-Bernd Eggenstein authored 12 years ago
  
  -fixed compilation warning -fixed problem of using "%a" printf-format which is only supported in C99 and later, which cannot be assumed for mingw cross compiles for Windows. Now uses this format only conditionally if supported, otherwise falls back to %f for generated float literals
  233caaf3
Jul 24, 2012
- Allow overriding of default build tools · 214ec2d8
  Oliver Bock authored 12 years ago
  
  214ec2d8
Jul 23, 2012

added file comment headers to express that this is now derived work and not... · 9a1e9f83

Heinz-Bernd Eggenstein authored 12 years ago

added file comment headers to express that this is now derived work and not the original Apple source code
The original Apple comment headers with (c) and license info are retained

9a1e9f83

Jul 13, 2012

Bug #1649: wrong results for transform lengths > 2^24 · a04104cb

Heinz-Bernd Eggenstein authored 12 years ago

Prevent integer overruns for long transforms in Taylor approx of sin cos.
Still to do: check all uses of mad24 etc in generated code where overruns could occur as well

a04104cb

Jul 07, 2012
- Bug #1641: double fp literals cause compilation errors with OpenCL · 2df1083e
  Heinz-Bernd Eggenstein authored 12 years ago
  
  fix: use compiler flag to globally convert all double constants to floats
  2df1083e
Jun 26, 2012

Bug #1608: clFFT use of native_sin , native_cos can cause validation problems · 7d3bdca6

Heinz-Bernd Eggenstein authored 12 years ago

added plan class creation method that allows to set flags to direct code generation
currently limited to select among 4 methods to compute twiddle factors:
-native_sin,native_cos function
-sincos() function
-set of two LUTs in global memory
-Taylor series approx via a smaller LUT in shared memory

7d3bdca6

Jun 25, 2012

Bug #1608: clFFT use of native_sin , native_cos can cause validation problems · 8d6fe913

Heinz-Bernd Eggenstein authored 12 years ago

experimanetal: improved Taylor series approx by copying LUT to shared mem.
TODO: cleanup, expose sin/cos method on plan creation interface,
do proper calculation of available shared mem for sin cos LUT

8d6fe913

Jun 22, 2012

Bug #1608: clFFT use of native_sin , native_cos can cause validation problems · 20314512

Heinz-Bernd Eggenstein authored 12 years ago

experimental: -added alternative method for twiddle factor calc, using a smaller LUT (256 * float2 )
               via Taylor series to 3rd order, seems to be almost as accurate as method with 2 bigger LUTs, but faster.
              -improved method w/ 2 bigger LUTs to use LUTs of float2
              -improved method using slow sin/cos functions (now uses sincos combined function), still slow
              - preparaed plan struct to have method switchable at plan creation time.

              TODO: load smaller LUT for Taylor series approx into shared mem.

20314512

Jun 08, 2012

Bug #1608: clFFT use of native_sin , native_cos can cause validation problems · 48a3c019

Heinz-Bernd Eggenstein authored 12 years ago

Still experimental: replace calls to native_sin in clFFT
This change explores the performance impacts of using a set of LUTs, precomputed on the CPU
to perform sin(x_i) and cos(x_i) in a grid x_i= +/- 2*pi *i/N , N fixed.

On a 6770M, this code is still ca 3% slower than the original native_sin/native_cos varaint
for a BRP4-like transform

This variant should have a very high accuracy, versions with lesser accuracy but
higher performance should be explored next. Eventually the method should be selectable
by a parameter to the plan creator as suggested by Bernd.

TODO: - remove some diagnostic code,
      - optimze total size of LUTs perhaps by using
        cos(x) = sin(x+pi/2), so no need to keep separate LUTs for sin and cos, just one slighly longer with
        an additional alias pointer
      - try caching the LUTs in shared memory (using constant memory didn't help)

48a3c019

Oct 20, 2011
- Updated Win32 build to APP SDK 2.5 · ac856b1c
  Oliver Bock authored 13 years ago
  
  ac856b1c
Oct 17, 2011
- Fixed arch handling · 9bb73e1c
  Oliver Bock authored 13 years ago
  
  9bb73e1c
Sep 13, 2011
- Updated AMD APPSDK environment settings/defaults · 95fdd54e
  Oliver Bock authored 13 years ago
  
  95fdd54e
May 20, 2011
- Added library import file for Windows OpenCL runtime · af519111
  Oliver Bock authored 13 years ago
  
  af519111
- Added top-level convenience Makefile · a92bfaeb
  Oliver Bock authored 13 years ago
  
  * Supported targets: linux (default), macos, win32, clean
  a92bfaeb