Click here to Skip to main content
15,887,746 members
Everything / OpenCL

OpenCL

OpenCL

Great Reads

by Nick Kopp
This article builds upon the earlier High Performance Queries: GPU vs. PLINQ vs. LINQ and ports this to also support OpenCL devices and adds benchmarking so you can easily compare performance.
by tugrulGtx
Multi-device OpenCL load balancer and pipeliner for C# in few lines of code.
by John Michael Hauck
It has never been easier for C# desktop developers to write code that takes advantage of the amazing computing performance of modern graphics cards. In this post I will share some techniques for solving a simple (but still interesting) image analysis problem. Source Code https://www.assembla.com/co
by Matt Scarpino
Using GPU Acceleration to Compute Ray-Triangle Intersection

Latest Articles

by aroman
In this post I explore Lattice Boltzmann methods and build a related project
by tugrulGtx
Header-only C++ tool that supports basic array-like usage pattern and uses multiple graphics cards in system as storage with LRU caching
by tugrulGtx
Accessing VRAM-cached nucleotide sequences in FASTA formatted files (*.fna, *.faa) by index
by Arthur V. Ratz
In this article I will thoroughly discuss about the several aspects of using the revolutionary new Intel® oneAPI HPC Toolkit to deliver a modern code that implements a parallel “stable” sort

All Articles

Sort by Updated

OpenCL 

6 Jan 2014 by Adam Wojnar
Simple .jp2/.j2k viewer using Kakadu executables demonstration pack for decoding
8 Aug 2011 by Adnan Boz
An entry level example of how to use NVIDIA CUDA technology to achieve better performance within C# with minimum possible amount of code
13 Oct 2012 by Alesiani Marco
A Wave PDE simulation using GPGPU capabilities
14 Aug 2014 by Android on Intel
The standard API for 3D graphics on Android is OpenGL ES, which is the most widely used 3D graphics API on all mobile devices today.
17 Nov 2014 by Android on Intel
In the conclusion of this two-part series, I detail the best 3D game engine and middleware solutions for Android* tablets, including free, open source, and proprietary options. I also note which have native support for x86 Intel® processors.
2 Feb 2015 by Android on Intel
This tutorial will guide you through Intel® INDE 2015 installation and demonstrate how to develop native Android* applications that target either x86 based or ARM based processors.
20 Jan 2015 by Android on Intel
This tutorial shows how to use two powerful features of OpenCL™ 2.0: enqueue_kernel functions that allow you to enqueue kernels from the device and work_group_scan_exclusive_add and work_group_scan_inclusive_add
20 Jan 2015 by Android on Intel
This tutorial will guide you through writing a native “Hello World” Android* app in Visual Studio* through the IDE Integration feature of Intel® INDE 2015.
7 Apr 2015 by Android on Intel
The intention of this guide is to provide quick steps to create, build, debug, and analyze OpenCL™ applications with the OpenCL™ Code Builder, a part of Intel® Integrated Native Development Environment (Intel® INDE)
1 Oct 2015 by Android on Intel
This article walks through an example Android application that offloads image processing using OpenCL™ and RenderScript programming languages.
1 Dec 2015 by Android on Intel
Using OpenCL™ 2.0 Read-Write Images
1 Mar 2016 by Android on Intel
In this article we are going to do a walkthrough of how to do CPU-bound offline analysis of the workflow.
19 Apr 2016 by Android on Intel
In this guide, we will show a variety of tools to use as well as features in the Unity software that can help you enhance the performance of your Unity project.
1 Jun 2016 by Android on Intel
Intel® System Studio 2017 Beta has been released. This is the Beta program page which guides you further on Intel® System Studio 2017 Beta new features and enhanced usability experience.
18 Feb 2019 by Apriorit Inc, ruksovdev
A detailed description of an FPGA-specific framework called ISE Design Suite, and the main steps you need to take in order to create a VGA driver using FPGA
21 Jul 2021 by aroman
In this post I explore Lattice Boltzmann methods and build a related project
7 May 2020 by Arthur V. Ratz
In this article I will thoroughly discuss about the several aspects of using the revolutionary new Intel® oneAPI HPC Toolkit to deliver a modern code that implements a parallel “stable” sort
14 Apr 2014 by Bartlomiej Filipek
Flexible Particle System - Start
2 Aug 2014 by Bartlomiej Filipek
How to start optimizing the particle system code.
20 Sep 2015 by Bartlomiej Filipek
A little guide about modern OpenGL and why it gives us so much value.
26 Aug 2013 by Buddhi Chaturanga
I want to distinguish of these two technologies relevant to their technological aspects.What are major differences and usage of each one of them?Pros and Cons.How we can handle process through GPU using each one of them?How those technologies can be implemented for 3D game programming?
28 Jul 2011 by caglarozbek89
Is there anybody who can give me a direction to write a code that calculate the pi number by using OpenCL..If you have any Pi calculator sample code, please share with me..Thanks for your great interest..
15 Aug 2011 by caglarozbek89
Is there any one who can give me some hints about writing the OpenCL internet checksum algorithm..Or who has the OpenCL code of this algorithm??
1 Sep 2009 by ChaoJui
Image processing with a burst of performance from CUDA
6 Nov 2014 by Colleen Culbertson
This article, aimed at developers, will provide a glimpse into this 64-bit, multi-core SOC processor, and gives an overview of the available Intel® technologies, including Intel® HD Graphics 5300.
3 Jan 2014 by CPallini
You might write a useful parallelized sorting algorithm.
24 Dec 2015 by Dave Kreskowiak
OK, so when do you start writing it? If you came here looking for someone to just hand over completed code to you you've come to the wrong site.
10 Sep 2021 by Dave Kreskowiak
You already have an expert. It's your teacher.
28 Aug 2018 by DaveAuld
The pursuit of Serenity, it's new build time!
9 Jan 2013 by Denis Demidov
This article is an introduction to VexCL. VexCL is vector expression template library created for ease of C++ based OpenCL development.
29 Jan 2013 by Dilan Shaminda
Hi, i want to know what are the techniques that i can follow in order to reduce noises in webcam images in low,high and normal lighting conditions.I am going to extract the features in the image.Any suggestions? Thank you
30 Nov 2016 by Dino Konstantopoulos
Running Theano with an Nvidia 1070 GPU on Windows 10, with CUDA 8 and Visual Studio 2015
30 May 2013 by Doug Wyrembek
A fun utility to apply blend modes to an image.
10 Nov 2016 by Farhad Reza
This article will show you how you can use the OpenGL graphics library in Google's Go language.
19 Oct 2011 by headmyshoulder
odeint v2 - Solving ordinary differential equations in C++
26 Jul 2012 by headmyshoulder, Denis Demidov
This article shows how ordinary differential equations can be solved with OpenCL. In detail it shows how odeint - a C++ library for ordinary differential equations - can be adapted to work with VexCL - a library for OpenCL. The resulting performance is studied on two examples.
3 Dec 2012 by Ilya Suzdalnitski
Image processing basics on the GPU using OpenCL.NET.
18 Mar 2015 by Intel
This tutorial demonstrates how to share surfaces between OpenCL™ and DirectX 11 with Intel ® Processor Graphics on Microsoft Windows, using the surface sharing extension in OpenCL.
26 May 2015 by Intel
In this article we are going to demonstrate how to optimize Single precision floating General Matrix Multiply (SGEMM) kernels for the best performance on Intel® Core™ Processors with Intel® Processor Graphics.
16 Sep 2015 by Intel
In this article, we will introduce the components of INDE and show how developers can use them to create new applications and optimize existing applications. To start with Intel® INDE provides support for IDE integration.
23 Feb 2017 by Intel
From safe roads to enjoyable commutes, automated driving is poised to change lives and society for the better.
11 Apr 2017 by Intel
As IoT demand drives increases in data volume, a more powerful processor is required, as well as additional storage.
11 Apr 2017 by Intel
Digital displays and signs are all around you. You may have seen them cropping up at shopping centers and doctors’ offices. From video walls, to AR fitting mirrors, to ordering menus, digital signs are pervasive and are becoming a part of everyday shopping experience.
1 Jun 2017 by Intel
This paper introduces Intel software tools recently made available to accelerate deep learning inference in edge devices (such as smart cameras, robotics, autonomous vehicles, etc.) incorporating Intel® Processor Graphics solutions across the spectrum of Intel SOCs.
14 Aug 2017 by Intel
Intel is uniquely positioned for AI development—the Intel’s AI Ecosystem offers solutions for all aspects of AI by providing a unified front end for a variety of backend technologies, from hardware to edge devices.
18 Aug 2017 by Intel
This paper addresses how the Smart Video (SV) system architecture is increasing in complexity and evolving into new industries and use cases.
18 Aug 2017 by Intel
The Intel® Computer Vision SDK is an Intel-optimized and accelerated computer vision software development kit based on the OpenVX standard. The SDK integrates pre-built OpenCV with deep learning support using an included Deep Learning (DL) Deployment toolkit.
31 Aug 2017 by Intel
Intel® GO™ SDK Offers Automotive Solution Developers an Integrated Solutions Environment
18 Sep 2017 by Intel
OpenCL™ Drivers and Runtimes for Intel® Architecture
10 Oct 2017 by Intel
The Face Access Control application is one of a series of IoT reference implementations aimed at instructing users on how to develop a working solution for a particular problem.
24 Oct 2017 by Intel
The Intel® Computer Vision SDK is a new software development package for development and optimization of computer vision and image processing pipelines for Intel System-on-Chips (SoCs).
25 Jan 2018 by Intel
This tutorial will walk you through the basics of using the Deep Learning Deployment Toolkit's Inference Engine (included in the Intel® Computer Vision SDK).
13 Feb 2018 by Intel
The SDK includes components to develop applications: IDE integration, offline compiler, debugger, and other tools.
13 Feb 2018 by Intel
Intel just released Intel® System Studio 2018, an all-in-one, cross-platform, comprehensive tool suite for system and IoT device application development.
3 Apr 2018 by Intel
The Retail Workshop: Hands on Learning with Intel®-based Retail Solutions
23 Dec 2013 by Jake Drew
ABSTRACT This article comprises a detailed overview of the various multicore and parallel programming options available within the C# programming language.
21 Mar 2016 by Javier Luis Lopez
Many people have a lot of troubles installing the Microsoft SDKs, then discovered that I must uninstall the faulty VS2010 runtimes and installing of the directx_Jun2010_redist.exe instead of DXSDK_Jun10.exe The questions are: - It is needed to reinstall VS2010 runtime or if it is better...
29 Aug 2017 by Javier Luis Lopez
I have done a VS2013 project to test opencl at github OpenCL dir: GitHub - jlopez2022/cpp_utils: Example of c++ programs[^] In that example I calculated differential rms of a big vector (200mega size), then on CPU and debug mode it calculated at 100 Megaops/data At CPU and release mode the...
31 Aug 2017 by Javier Luis Lopez
To obtain data resume from results like following code: int k = get_global_id(0); double result=d[k]*d[k]; It must be used reductions that is very difficult to perform and reduces the code cleariness as said in following link:...
31 Aug 2017 by Javier Luis Lopez
I tried the posted code. My idea was to obtain partial sums of input data on array rms, then make barriers (GLOBAL and LOCAL) to wait until all rms[k] are filled, then sum all them to obtain the media value. I placed some printf to advises if there are errors in the calculus. I obtained errors...
4 Sep 2017 by Javier Luis Lopez
I am not very happy with this solution: Opencl 1.2 does not allow synchronize across all work groups, as I stated, so it must be going out of kernel and enter in a new one to use data from all work items. If somebody know how to do it in the new openCL 2.x standard I would appreciate it. ...
13 Sep 2017 by Javier Luis Lopez
It is possible to copy float arrays to float4? I do not know if a float4 array elements can be aligned with a float array to copy them. I tried this but failed to compile: What I have tried: #define WD2 WIDTH/4 __global float A[WIDTH*HEIGHT]; ... __local float4 B[WD2];...
13 Sep 2017 by Javier Luis Lopez
Finally vloadn works, but unfortunately I have to copy to only one vector, not an array of them: __global float* imagen0 ... long pix = get_global_id(0); if (pix==0) { float16 vv=vload16(0,imagen0); printf("===GPU vv: %6v16f \n",vv); } if (pix
16 May 2018 by Javier Luis Lopez
It is very hard to use the GPU because the user has to do memory segmentation and transfer, the use of local memory and in the most applications very low performance increase 10-20x is reached. In other hand using multithreads is easy and fast. It would be better use 1280 threads in parallel...
16 May 2018 by Javier Luis Lopez
The only one solution is to RUN C++ MULTITHREAD on GPU and completely ABANDON OPENCL or cuda forever. I am speaking about increase the performance of a simple PC with multithread by 41x And a improvement over a very complex opencl sw by 12x Of course a lot of modifications on HW and drivers...
11 Nov 2018 by Javier Luis Lopez
I program in GPUs using OpenCL but I would be happy with a easier system to parallelize the program What of them implies less code to be changed to introduce in GPU? C++ amp and Trust allows run several functions sequentially inside the GPU before returning results? What I have tried: I made...
9 Jun 2010 by JeffHeaton
Learn the basics of how to install and use OpenCL with Java, unleash the power of your GPU.
18 May 2012 by jfriedman
An article on string manipulation in the .NET Micro Framework.
29 Aug 2017 by Jochen Arndt
The speed doubling in release mode is not sourced by parallel processing. It is sourced by the compiler optimising the code in release mode and omitting additional checks which are done in debug builds. You have to explicitly write code for parallel processing. Which method is finally faster...
22 May 2013 by John Michael Hauck
It has never been easier for C# desktop developers to write code that takes advantage of the amazing computing performance of modern graphics cards. In this post I will share some techniques for solving a simple (but still interesting) image analysis problem. Source Code https://www.assembla.com/co
22 May 2013 by John Michael Hauck
Some ad hoc performance test results for a simple program written in C# as obtained from my current desktop computer: Dell Precision T3600, 16GB RAM, Intel Xeon E5-2665 0 @ 2.40GHz, NVidia GTX Titan.
18 May 2013 by John Michael Hauck
“Programming Massively Parallel Processors (second edition)” by Kirk and Hwu is a very good second book for those interested in getting started with CUDA.
22 Nov 2015 by John Michael Hauck
It has never been easier for C# desktop developers to write code that takes advantage of the amazing computing performance of modern graphics cards.
21 Mar 2016 by KarstenK
If you really need that installation you must reinstall it. Install the newest SDK which fulfills your needs.Tip 1: You shouldnt touch the registry, but first run once the official deinstallation.Tip 2: deactivate the UAC or set it at a low level for the installation.
16 May 2018 by KarstenK
It is depending on what you want to do. Even multithreading isnt optimal, when a lot of short threads are running because multithreading means also overhead in the CPU. Graphical output and low level computations are best done on GPU, computations also when the usage of the GPU leads to less...
17 Dec 2013 by kdgupta87
A 2D analog clock designed using OpenTK in C# and WinForms.
4 Aug 2011 by Ken Domino
Please visit my web site @ http://domemtech.com/?p=669[^] This page from my blog is a cursory comparison of CUDA vs. OpenCL using just your example, estimating the value of pi. Basically, the solution is via numerical integration using the Composite Simpson's Rule. The solution uses IEEE...
14 Aug 2014 by kyleK89
The original C code isvoid GaussianArray(float* Gauss, int variable){ for(int i = 1; i 0; j-- ) { Gauss[j] += Gauss[j - 1]; } ...
18 Aug 2014 by kyleK89
I'm trying to make kernel.cl with opencvfor (i = 0; i
19 Dec 2023 by M Imran Ansari
Creating a flowchart for OpenCL code involves representing the logical flow of the program with steps. Note that OpenCL programming typically involves both host (CPU) and device (GPU) code flow. Check the below link as guide and try to build of...
14 May 2017 by Mahdi Nejadsahebi
Have a good timei have a problem in OpenCL 1.2.Look, i have an array as global in the kernel and the group size is 1000.The problem is that the atomic_add() function doesn't work correctly.My kernel code is :buffer[3] = 100;atomic_add(&buffer[3], 1);if i create 1000...
17 Sep 2010 by manythreads
Curious about GPGPU programming? Read Rob Farber’s Massively Parallel Programming series. Learn how to get more from your CPU, GPU, APU, DSP, and more.
27 Oct 2010 by manythreads
In his second tutorial, GPGPU expert Rob Farber discusses OpenCL™ memory spaces and the OpenCL memory hierarchy, and how to start thinking in terms of work items and work groups. This tutorial also provides a general example to facilitate experimentation with a variety of OpenCL kernels.
6 Jan 2011 by manythreads
In his third tutorial, GPGPU expert Rob Farber will introduce the OpenCL™ execution model and discuss how to coordinate computations among the work items in a work group
10 Mar 2011 by manythreads
Read Rob Farber’s Massively Parallel Programming series. This fourth article in a series on portable multithreaded programming using OpenCL™ will discuss the OpenCL™ runtime and demonstrate how to perform concurrent computations among the work queues of heterogeneous devices.
24 May 2011 by manythreads
This fifth article in a series on portable multithreaded programming using OpenCL™ Rob Farber discusses OpenCL™ buffers and demonstrates how to tie computation to data in a multi-device, multi-GPU environment.
2 Apr 2012 by manythreads
This sixth article in a series on portable multithreaded programming using OpenCL™ where Rob Farber discusses how to calculate data in OpenCL™ and render it with OpenGL within the same application.
13 Feb 2012 by manythreads
This article will demonstrate how to create C/C++ plugins that can be dynamically loaded at runtime to add massively parallel OpenCL capabilities to an already running application
13 Feb 2012 by manythreads
This article will demonstrate how to incorporate OpenCL into heterogeneous workflows via a general-purpose “click together tools” framework that can stream arbitrary messages within a single workstation, across a network of machines, or within a cloud computing framework.
14 Feb 2012 by manythreads
This article discusses OpenCL extensions that provide programmers with additional capabilities such as double-precision arithmetic and Device Fission
23 Jul 2014 by martin_bisson
How to use OpenCL with OpenGL to implement algorithms on GPU.
12 Sep 2013 by Matt Scarpino
Using GPU Acceleration to Compute Ray-Triangle Intersection
11 Jul 2013 by Matthew Faithfull
Querysoft Open Runtime: Architecture compatibility aspect.
16 Feb 2016 by Max R McCarty
OWASP's #6 most vulnerable security risk has to do with keeping secrets secret.
6 Nov 2014 by Maxim_Shevtsov
This article is an overview of the OpenCL support provided in System Analyzer and Platform Analyzer on the Windows* OS
2 Jan 2014 by Member 10501094
Hello,I am student on high school (not University) and technically i dont study programming but i can do my project on programming too. My teachers suggested me to make some useless sorting algorithms programs but that not really something i would like to do. I would like to create some...
26 Jun 2015 by Member 11794279
When processor is not enough
15 Sep 2020 by Member 12087553
Hello, I have been reviewing OpenCL Library in C# for what will use in my project. But I don't know how to get calculated data from OpenCL Library. public static void RunGPU() { try { EasyCL cl = new EasyCL() { ...
5 Oct 2020 by Member 12087553
I have been developing a project. it is developed with C#, OpenCL.NET. Aforge, OpenCV. I have a big problem. it is I can't release Memory on OpenCL.NET. I tried release() and dispose() but there weren't released. So I need the method that can...