Oregon State University Provides Power9 GPU Resources


By: Chris Sullivan, assistant director for biocomputing, Oregon State University Center for Genome Research and Biocomputing

The Oregon State University Open Source Lab (OSUOSL) and Center for Genome Research and Biocomputing (CGRB) are excited to now provide access to POWER9 AC922 Newell Systems (8335-GTG).

The AC922 is the newest in the IBM set of AI-based servers used by many of the Oregon State research groups to overcome limits when processing large data sets. To ensure developers can take full advantage of these exciting new machines, we are allowing free access to several of these AC922 setups. We believe these new machines significantly change the way we can address limits in scope and remove bias in the work we currently do. The only limit we see is having access to all the great open source tools available on other platforms –  providing developers with access can help overcome that problem.

The systems accessible to developers are set up with two processor sockets, offering 20-core (with 160 thread) at 3.0 GHz, four Tesla V100 with NVLink GPUs, 1TB of system memory, two 1.6TB CAPI-enabled NVMe SSD Controller and 40G network cards. These are the standard setups we look at for processing data as the high thread count on the CPU side allows us to process quickly along with the ability to do massive deep-learning and AI processing.

Using GPU’s to Classify Oceans of Data

For example, we currently take video from various locations in the ocean and process that data to identify all plankton to help manage ocean health. These AC922 machines are able to do all the video processing using FFMPEG with threading on the CPU side, generate images and then directly send the data to the GPUs with NVLink to process the images using a Convolutional Neural Network (CNN) to identify the plankton.

This is only one example where we can treat this machine as a cluster in a box and do all the work starting with video files and ending with CSV output with counts. We have found that the higher the threading the better the return when using the Power9 (as well as the Power8) processors.

Below is a list of processors we have available to test and some quick numbers showing the benefits of threading on these machines.

  EPYC 7601
32-Core 64 thread
Xeon E5-2620
8 core 16 thread
20 core 40 thread
  1200 MHz 3400 MHz 2016 MHz
  seconds s * MHz seconds s * MHz seconds s * MHz
Fibonacci 76.4435 91732.2000 53.8354 183040.3600 47.7507 96265.4112
Pi 154.2242 185069.0400 105.5235 358779.9000 129.1436 260353.4976
Float math 41.2044 49445.2800 34.5253 117386.0200 47.7137 96190.8192
Factorize 1 process 69.0709 82885.0800 58.8655 200142.7000 71.8679 144885.6864
Factorize 2 process 71.9220 86306.4000 48.7508 165752.7200 52.2643 105364.8288
Factorize 8 process 22.2354 26682.4800 18.2673 62108.8200 15.2357 30715.1712
Factorize 16 process 16.4457 19734.8400 15.1000 51340.0000 11.3186 22818.2976
Factorize 32 process 23.9592 28751.0400 23.7475 80741.5000 11.9565 24104.3040
Factorize 36 process 24.2955 29154.6000 25.7965 87708.1000 11.6990 23585.1840

Table 1:
Processing time for different calculations showing the lower times for Power9 machines. The big return on this hardware is the threading and this table shows over 2 times faster times on Power9 as we increase threads. Many groups have achieved an order of 4 times greater return when running against the most current x86-based machines.  

The CGRB is focused on working with processor companies that are changing the threading on CPUs and bringing GPUs into play, like IBM and the new AC922. Right now for workloads that take months to complete on x86 boxes we are working with developers to move tools to Power9 so we can take advantage of these returns. Because the value around these machines is centered on threading and AI, we invite developers to come and get free access to a few Power9 and other Power8 machines to port tools and optimize performance.

To get access, simply sign up for an account at the link below and we will get back to you.


AC922 Hardware: