The Chronicles of Phi - part 5 - Plesiochronous phasing barrier – tiled_HT3
For the next optimization, I knew what I wanted to do; I just didn’t know what to call it. In looking for words that describes loosely-synchronous, I came across plesiochronous:In telecommunications, a...
View ArticleCompare Intrinsics
Intel® Streaming SIMD Extensions 2 (Intel® SSE2) intrinsics for integer comparison operations are listed in this topic. The prototypes for Intel® SSE2 intrinsics are in the emmintrin.h header file.The...
View ArticleMiscellaneous Intrinsics
The intrinsics listed here are designed for the Intel® Pentium® 4 processor with Streaming SIMD Extensions 3 (Intel® SSE3).The prototypes for these intrinsics are in the pmmintrin.h header...
View ArticlePacked Compare for Equal Intrinsic
The prototype for this Intel® Streaming SIMD Extensions (Intel® SSE4) intrinsic is in the smmintrin.h file._mm_cmpeq_epi64__m128i _mm_cmpeq_epi64(__m128i a, __m128i b);Performs a packed integer 64-bit...
View ArticleTemplated user-defined type conversion to abstract reference type
I have the following (very simplified) "container" class to store objects of "any" type in shared pointers: class container { public: template<typename T> container(const...
View ArticleVectorization Issue with loop iterations
Hi All,I am trying to compile following sample kernel with Intel (ICC) 14.0.0 20130728 (or version > 12 ). I see strange behaviour with vectorization. I have following questions:If I change _iml...
View ArticleIssues building MPI
Hi all,I recently installed both Intel Fortran and Intel C++ compilers for Linux on my system (Ubuntu 12.04 LTS with IA32). When trying to install MPICH, I'm unable to configure the installation, and I...
View Articleinternal error with -fno-rounding-math
$ icc -fno-rounding-math icc: internal error: Assertion failed (shared/driver/options.c, line 1684) $ icc --version icc (ICC) 14.0.2 20140120
View ArticlePerformance BKMs: Introduction and Super-secret Intel Tools
At SC13 (Super Computing 2013)*, someone commented that Intel seems to have some super-secret set of tricks in its pocket, allowing us to optimize “far beyond those of mortal man”+. We don’t really...
View ArticleOptimization icpc 13 vs icpc 14
Currently I am working on a simulation project which I develop in my own pc (Chakra linux, intel core i5). When I compile using icpc v 14.0.2 I get really nice results in terms of speed. The problem is...
View ArticleError: A license for BetaSTMCCompL could not be obtained.
Hi,I have installed Intel® C++ Composer XE 2013 for Linux and Intel® C++ STM Compiler Prototype Edition 4.0 with a non-commercial/student license. When I try to compile using icc, I get the following...
View ArticleHow to uninstall Intel® C++ STM Compiler Prototype Edition 3.0?
Hi,I have installed Intel® C++ STM Compiler Prototype Edition 3.0. What is the procedure to uninstall it. I could not find any uninstall script.Thanks!
View ArticleIntel MPI Library and Composer XE Compatibility
The following table lists all supported versions of the Intel® MPI Library and the Intel® Composer XE. Use this as a reference on the cross-compatibility between the library and associated...
View ArticleTrouble with main, header and C ++ compiler
I made all the modification needed to work with x64 material.I had a header file, a main and a rpogrammI had the following message : 1>------ Build started: Project: Archi-1D-Rld-V.4, Configuration:...
View ArticleOpenMP / memory saturation
Hi,I am working on an application which I'm pretty sure is memory bound. I tried doing some simple OpenMP, but there was no speedup, which seems to confirm that the kernel is indeed memory...
View ArticleGuide to Profile-guided Optimization of Computational Fluid Dynamics with...
Intel® C++ Compiler has supported Profile-guided Optimization (PGO) for a long history, PGO improves application performance by reorganizing code layout to reduce instruction-cache problems,...
View ArticleICC benchmarking and ICC vs GCC
Hi,as you see I am new here in this forum.I have a project comparing ICC vs GCC. I am looking for the best way to start this project. any links/ tutorial/ documentations are more than welcome.thinking...
View ArticleAVX is slower than serial execution ?
I write a simple program and build with icpc to examine the performance of AVX in my mathine. The code snippet is as following, #define T 2000000 #define X 16 #define Y 16 #define Z 16 for(int...
View ArticleIntel® Developer Zone at Microsoft //Build/ 2014
Software developers by the thousands visited the Intel Software booth at the sold-out Microsoft Build conference April 2 – 4 in San Francisco’s Moscone Center. They were able to learn about how the...
View ArticleOpenMP 4.0 target directives
I tried to get an OpenMP 4.0 test case to run where the code calls a user library that is compiled for host and mic (-mmic)The host library is in current directory (pwd), the MIC lib is in pwd/mic and...
View Article