Blog

Nov 14, 2017   |   Wei Wang, Peng Zhang, Rong Zhang, Jayaram Bobba

neon v2.3.0: Significant Performance Boost for Deep Speech 2 and VGG models

We are excited to announce the release of neon™ 2.3.0.  It ships with significant performance improvements for Deep Speech 2 (DS2) and VGG models running on Intel® architecture (IA). For the DS2 model, our tests show up to 6.8X improvement1,4 with the Intel® Math Kernel Library (Intel® MKL) backend over the NumPy CPU backend with…

Read more

#Release Notes

BDW-SKX Normalized Throughput

Sep 18, 2017   |   Jayaram Bobba

neon v2.1.0: Leveraging Intel® Advanced Vector Extensions 512 (Intel® AVX-512)

We are excited to announce the availability of neon™ 2.1 framework. An optimized backend based on Intel® Math Kernel Library (Intel® MKL), is enabled by default on CPU platforms with this release. neon™ 2.1 also uses a newer version of the Intel ® MKL for Deep Neural Networks (Intel ® MKL-DNN), which features optimizations for…

Read more

#Release Notes

Jun 28, 2017   |   Jayaram Bobba

neon™ 2.0: Optimized for Intel® Architectures

neon™ is a deep learning framework created by Nervana Systems with industry leading performance on GPUs thanks to its custom assembly kernels and optimized algorithms. After Nervana joined Intel, we have been working together to bring superior performance to CPU platforms as well. Today, after the result of a great collaboration between the teams, we…

Read more

#Release Notes

Dec 29, 2016   |   Jennifer Myers

neon v1.8.0 released!

Highlights from this release include:  * Skip Thought Vectors example * Dilated convolution support * Nesterov Accelerated Gradient option to SGD optimizer * MultiMetric class to allow wrapping Metric classes * Support for serializing and deserializing encoder-decoder models * Allow specifying the number of time steps to evaluate during beam search * A new community-contributed Docker image…

Read more

#Release Notes

Nov 22, 2016   |   Jennifer Myers

neon v1.7.0 released!

Highlights from this release include:  Update Data Loader to aeon for flexible, multi-threaded data loading and transformations. More information can be found in the docs, but in brief, aeon: provides an easy interface to adapt existing models to your own, custom, datasets supports images, video and audio and is easy to extend with your own providers for custom…

Read more

#Release Notes

Sep 22, 2016   |   Jennifer Myers

neon v1.6.0 released!

Highlights from this release include:  Faster RCNN model Sequence to Sequence container and char_rae recurrent autoencoder model Reshape Layer that reshapes the input[#221] Pip requirements in requirements.txt updated to latest versions [#289] Remove deprecated data loaders and update docs Use NEON_DATA_CACHE_DIR envvar as archive dir to store DataLoader ingested data Eliminate type conversion for FP16…

Read more

#Release Notes

Jul 01, 2016   |   Jennifer Myers

neon v1.5 released!

We’re excited to release neon v1.5 with Python 2 and Python 3 support, support for Pascal GPUs (GTX 1080) and performance enhancements such as persistent RNN kernels (based on the paper by Greg Diamos at Baidu), bringing a 12x performance gain compared to v1.4.0. Highlights from this release include: Python2/Python3 compatibility [#191] Support for Pascal…

Read more

#Release Notes

Apr 30, 2016   |   Jennifer Myers

neon v1.4.0 released!

Highlights from this release include:  * VGG16 based Fast R-CNN model using winograd kernels * new, backward compatible, generic data loader * C3D video loader model trained on UCF101 dataset * Deep Dream example * make conv layer printout more informative [#222] * fix some examples to use new arg override capability * improve performance…

Read more

#Release Notes

Feb 02, 2016   |   Arjun Bansal

neon v1.2 release: Kepler & AWS support are back, Deep ResNets, and more

We are excited to share neon’s v1.2 release with the community, which has several major features (Kepler support, new macrobatch and serialization enhancements) and examples, along with an expanded Model Zoo to help users get started with their use cases. New storage format (docs) and data loader for loading datasets that do not fit in…

Read more

#Release Notes #Scene Recognition

Jan 19, 2016   |   Scott Leishman

neon v1.1.5 released!

Highlights from this release include:  * CUDA kernels for lookuptable layer. This results in a 4x speedup for our sentiment analysis model example * support for determinstic Conv layer updates * custom dataset walkthrough utilizing bAbI data * reduced number of threads in deep reduction EW kernels [#171] * additional (de)serialization routines [#106] * CPU…

Read more

#Release Notes

Stay Connected

Keep tabs on all the latest news with our monthly newsletter.