IBM Watson

IBM Watson
A Power 775 High Performance Computing Cluster
Dan Christiani
Ben Mihevc
Agenda
●
●
●
●
●
●
What is Watson
Power 775
Software Stack
Communication
Applications and Motivation
Wrap-up
[8]
What is the IBM Watson
● Watson is a project to use natural
language processing to analyse data.
● The goal is to search through data
and draw meaning from it.
● Made famous on Jeopardy in 2011
where it successfully defeated top opponents.
● As an adaptive system, it learns when it is wrong
and corrects itself
● A cognitive AI system for understanding complex data
and information.
● Multiple Incarnations in IBM research facilities
● Involved Universities: RPI, MIT, Carnegie Mellon, UT Austin, ...
[1][3]
The Power 775 Details
● 10 Drawers per rack
o 8 Nodes per drawer
o 4 Processors per node
 3.83 GHz
o 8 Cores per processor
 64 KB L1 cache
(32 data, 32 instruction)
 256 KB L2 cache
 4 MB private L3 cache
 32 MB shared L3 cache.
[1][3]
The Power 775 Details cont.
● Makes for an impressive rack
o 80 Nodes
o 320 processors
o 2560 compute cores
o 4 hardware threads per core
o Theoretical computational limit: 78.43 Teraflops
● Interconnected racks can add power
o A small setup could utilize 10 Racks
 That is 25600 compute cores.
[2][7][9][10]
Software Stack
●
The IBM Parallel Environment
o MPI, LAPI, OpenSHMEM
 Cross Core Communication
o Parallel Operating Environment
 Provides job support across nodes
o Load Leveler Scheduler
 Manages the job scheduling based on
priority and available resources
o IBM Parallel Debugger (pdb)
 similar to gdb, but for parallel
applications
o Engineering and Scientific Subroutine Library
(ESSL) and Parallel ESSL
 ESSL is a library of optimised
mathematical functions that have been
optimised.
 Parallel ESSL is ESSL implemented
with MPI for scalable parallel
computation.
[2][7][10]
IBM Parallel Environment
● Parallel Operating Environment (POE)
o
Scalably manages over 1,000,000 tasks.
o
Organizes core dumps
o
Supports many communication protocols

o
MPI, LAPI, OpenSHMEM, etc…
Runs on Linux or AIX 7.1
● IBM Parallel Debugger (pdb)
o
Uses most of the same commands as gdb
o
Capable of observing the state of running tasks
o
It enables the user to start jobs under debug
control or take running jobs and place them under
debug control.
[2][7][10]
IBM Parallel Environment
● LoadLeveler
o
A parallel job scheduling application.
o
Creates a priority queue to determine which
job is running.

o
Considers: execution time, available
resources, time of day, and
administrative instructions.
Also ensures that jobs are only scheduled to
functional machines.
[2]
Physical Communication
● We will start from the lowest physical
component and work our way up
○ Processor level 0
○ Node level 1
○ Cabinet level 2
○ Rack level 3
Collective Acceleration Unit (CAU)
○ A Coprocessor for parallel applications.
●
Physical
Communication
Level 0
[2]
Physical
Communication
Level 1
[2]
Physical
Communication
Level 2
[2]
Physical
Communication
Level 3
Physical
Communication
Level 3
Basic Model
and CAU flow
[2][6]
[2][6]
CAU Step 1
[2][6]
CAU Step 2
[2][6]
CAU Step 3
[2][6]
CAU Step 4
[14][4][5]
What does this all create?
● Watson system were designed for Data
Intensive Super Computing (DISC)
● Search and understand massive stores of
data
○ Telescope Imagery
○ Medical Records
○ Financial Transaction Records
○ Web Pages or any other text
● DISC influences unique research
○ System design, resource management
○ Programming models, Algorithms
[13][4][5]
Motivation
● Pattern Recognition
o
Focused on large unstructured data sources
● Healthcare
○
○
○
○
●
Patient Records
Diagnostics
Nurses Notes
Discharge Papers
Outcomes: +Diagnosis +Treatment +Care Intervention +Machine Learned Research
[12]
Oh! And we Forgot Jeopardy!
● The Jeopardy Challenge
o Competed against historic champions
 Ken Jennings and Brad Rutter
o 3 Day Tournament in 2011.
Results:
Ken Jennings:
$24,000
Brad Rutter:
$21,600
Watson:
$77,147
Outcome: Definitive Win for Watson
“I for one welcome our new computer overlords” - Ken Jennings
[5][12]
To Wrap Up...
●
Much more than Jeopardy
○ Cross industry solution for understanding huge amounts of
structured and unstructured data Information.
○
●
Machine Learning combined with advanced Natural
Language Processing allows for powerful and accurate
Question-Answer applications.
Watson’s special software is driven by Parallel High Performance
POWER 775 systems developed for Data Intensive
Supercomputing.
Citations
[1]
[6]
[7]
Maciej Cytowski. (2014, Feb.) PRACE. [Online]. http://www.prace-ri.eu/Best-Practice-GuideIBM-Power-775-HTML#id-1.3
IBM. (2012, Oct.) IBM RedBooks. [Online].
http://www.redbooks.ibm.com/redbooks/pdfs/sg248003.pdf
IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/hardware/775/specs.html
IBM. IBM.com. [Online]. http://www03.ibm.com/systems/power/software/i/db2/products/smp.html
Mike Cain. (2011, Dec.) IBM.com. [Online].
http://public.dhe.ibm.com/partnerworld/pub/pdf/courses/4aea.pdf
(2013, Apr.) wikipedia.com. [Online]. http://en.wikipedia.org/wiki/IBM_RSCT
IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/software/aix/resources.html
[8]
IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/hardware/hpc.html
[9]
IBM, "MPI Programming Guide. Parallel Environment Runtime Edition," 2013.
[10]
IBM. (2013) IBM.com. [Online]. http://publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp?topic=%2Fcom.ibm.cluster.pe.doc%2Fpebooks.html
[11]
Manoj Saxena, "IBM Watson Progress and 2013 Roadmap," IBM, 2014. [Online]. http://www.slideshare.net/manojsaxena2/ibm-watson-progress-androadmap-saxena
[12]
Eric Brown, Jennifer Chu-Carroll, James Fan, David Gondek, Aditya A. Kalyapur, Adam Lally, J. Willian Murdock, Eric Nyberg, John Pragen, Nico
Schlaefer, Chris Welty David Ferrucci, "Building Watson: An Overview of the DeepQA Project," Association for the Advancement of Artificial Intelligence,
2010.
[13]
IBM Corporation, "IBM Content and Predictive Analytics for Healthcare," IBM Cororation, Somers, 2012.
[2]
[3]
[4]
[5]