Author Bio Image

Jennifer Myers

Senior Director Deep Learning Frameworks, Artificial Intelligence Products Group

We’re excited to release neon v1.5 with Python 2 and Python 3 support, support for Pascal GPUs (GTX 1080) and performance enhancements such as persistent RNN kernels (based on the paper by Greg Diamos at Baidu), bringing a 12x performance gain compared to v1.4.0.

Highlights from this release include:

  • Python2/Python3 compatibility [#191]
  • Support for Pascal GPUs
  • Persistent RNN kernels [#262]
  • Dataloader enhancements (audio loader with examples)
  • HDF5 file data iterator
  • Convolution kernel improvements
  • API documentation improvements [#234, #244, #263]
  • Cache directory cleanup
  • Reorganization of all unit tests
  • Bug fixes [#182, #183, #231, #241, #252, #253, #257, #259]

RNN kernels benchmarked on Titan X with batch size 4 and 1152 activations. Public Baidu kernels from github.

As always, you can grab this release from github at: https://github.com/NervanaSystems/neon

Author Bio Image

Jennifer Myers

Senior Director Deep Learning Frameworks, Artificial Intelligence Products Group

Related Blog Posts

neon™ 2.6.0: Inference Optimizations for Single Shot MultiBox Detector on Intel® Xeon® Processor Architectures

We are excited to release the neon™ 2.6.0 framework, which features improvements for CPU inference path on a VGG-16 based Single Shot multibox Detector (SSD) neural network. These updates, along with the training optimizations released in neon 2.5.0, show that neon is gaining significant boosts in both training and inference performance.  (Granular configuration details, as well…

Read more

#Release Notes

Reinforcement Learning Coach v0.9

Since the release of Coach a couple of months ago, we have been working hard to push it into new frontiers that will improve its usability for real world applications. In this release, we are introducing several new features that will move Coach forward in this direction. Imitation Learning First, we added several convenient tools…

Read more

#Release Notes #Technology

neon v2.3.0: Significant Performance Boost for Deep Speech 2 and VGG models

We are excited to announce the release of neon™ 2.3.0.  It ships with significant performance improvements for Deep Speech 2 (DS2) and VGG models running on Intel® architecture (IA). For the DS2 model, our tests show up to 6.8X improvement1,4 with the  (Intel® MKL) backend over the NumPy CPU backend with neon™ 2.3.0, and more…

Read more

#Release Notes