IBM Watson A Power 775 High Performance Computing Cluster Dan Christiani Ben Mihevc Agenda ● ● ● ● ● ● What is Watson Power 775 Software Stack Communication Applications and Motivation Wrap-up [8] What is the IBM Watson ● Watson is a project to use natural language processing to analyse data. ● The goal is to search through data and draw meaning from it. ● Made famous on Jeopardy in 2011 where it successfully defeated top opponents. ● As an adaptive system, it learns when it is wrong and corrects itself ● A cognitive AI system for understanding complex data and information. ● Multiple Incarnations in IBM research facilities ● Involved Universities: RPI, MIT, Carnegie Mellon, UT Austin, ... [1][3] The Power 775 Details ● 10 Drawers per rack o 8 Nodes per drawer o 4 Processors per node 3.83 GHz o 8 Cores per processor 64 KB L1 cache (32 data, 32 instruction) 256 KB L2 cache 4 MB private L3 cache 32 MB shared L3 cache. [1][3] The Power 775 Details cont. ● Makes for an impressive rack o 80 Nodes o 320 processors o 2560 compute cores o 4 hardware threads per core o Theoretical computational limit: 78.43 Teraflops ● Interconnected racks can add power o A small setup could utilize 10 Racks That is 25600 compute cores. [2][7][9][10] Software Stack ● The IBM Parallel Environment o MPI, LAPI, OpenSHMEM Cross Core Communication o Parallel Operating Environment Provides job support across nodes o Load Leveler Scheduler Manages the job scheduling based on priority and available resources o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications o Engineering and Scientific Subroutine Library (ESSL) and Parallel ESSL ESSL is a library of optimised mathematical functions that have been optimised. Parallel ESSL is ESSL implemented with MPI for scalable parallel computation. [2][7][10] IBM Parallel Environment ● Parallel Operating Environment (POE) o Scalably manages over 1,000,000 tasks. o Organizes core dumps o Supports many communication protocols o MPI, LAPI, OpenSHMEM, etc… Runs on Linux or AIX 7.1 ● IBM Parallel Debugger (pdb) o Uses most of the same commands as gdb o Capable of observing the state of running tasks o It enables the user to start jobs under debug control or take running jobs and place them under debug control. [2][7][10] IBM Parallel Environment ● LoadLeveler o A parallel job scheduling application. o Creates a priority queue to determine which job is running. o Considers: execution time, available resources, time of day, and administrative instructions. Also ensures that jobs are only scheduled to functional machines. [2] Physical Communication ● We will start from the lowest physical component and work our way up ○ Processor level 0 ○ Node level 1 ○ Cabinet level 2 ○ Rack level 3 Collective Acceleration Unit (CAU) ○ A Coprocessor for parallel applications. ● Physical Communication Level 0 [2] Physical Communication Level 1 [2] Physical Communication Level 2 [2] Physical Communication Level 3 Physical Communication Level 3 Basic Model and CAU flow [2][6] [2][6] CAU Step 1 [2][6] CAU Step 2 [2][6] CAU Step 3 [2][6] CAU Step 4 [14][4][5] What does this all create? ● Watson system were designed for Data Intensive Super Computing (DISC) ● Search and understand massive stores of data ○ Telescope Imagery ○ Medical Records ○ Financial Transaction Records ○ Web Pages or any other text ● DISC influences unique research ○ System design, resource management ○ Programming models, Algorithms [13][4][5] Motivation ● Pattern Recognition o Focused on large unstructured data sources ● Healthcare ○ ○ ○ ○ ● Patient Records Diagnostics Nurses Notes Discharge Papers Outcomes: +Diagnosis +Treatment +Care Intervention +Machine Learned Research [12] Oh! And we Forgot Jeopardy! ● The Jeopardy Challenge o Competed against historic champions Ken Jennings and Brad Rutter o 3 Day Tournament in 2011. Results: Ken Jennings: $24,000 Brad Rutter: $21,600 Watson: $77,147 Outcome: Definitive Win for Watson “I for one welcome our new computer overlords” - Ken Jennings [5][12] To Wrap Up... ● Much more than Jeopardy ○ Cross industry solution for understanding huge amounts of structured and unstructured data Information. ○ ● Machine Learning combined with advanced Natural Language Processing allows for powerful and accurate Question-Answer applications. Watson’s special software is driven by Parallel High Performance POWER 775 systems developed for Data Intensive Supercomputing. Citations [1] [6] [7] Maciej Cytowski. (2014, Feb.) PRACE. [Online]. http://www.prace-ri.eu/Best-Practice-GuideIBM-Power-775-HTML#id-1.3 IBM. (2012, Oct.) IBM RedBooks. [Online]. http://www.redbooks.ibm.com/redbooks/pdfs/sg248003.pdf IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/hardware/775/specs.html IBM. IBM.com. [Online]. http://www03.ibm.com/systems/power/software/i/db2/products/smp.html Mike Cain. (2011, Dec.) IBM.com. [Online]. http://public.dhe.ibm.com/partnerworld/pub/pdf/courses/4aea.pdf (2013, Apr.) wikipedia.com. [Online]. http://en.wikipedia.org/wiki/IBM_RSCT IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/software/aix/resources.html [8] IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/hardware/hpc.html [9] IBM, "MPI Programming Guide. Parallel Environment Runtime Edition," 2013. [10] IBM. (2013) IBM.com. [Online]. http://publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp?topic=%2Fcom.ibm.cluster.pe.doc%2Fpebooks.html [11] Manoj Saxena, "IBM Watson Progress and 2013 Roadmap," IBM, 2014. [Online]. http://www.slideshare.net/manojsaxena2/ibm-watson-progress-androadmap-saxena [12] Eric Brown, Jennifer Chu-Carroll, James Fan, David Gondek, Aditya A. Kalyapur, Adam Lally, J. Willian Murdock, Eric Nyberg, John Pragen, Nico Schlaefer, Chris Welty David Ferrucci, "Building Watson: An Overview of the DeepQA Project," Association for the Advancement of Artificial Intelligence, 2010. [13] IBM Corporation, "IBM Content and Predictive Analytics for Healthcare," IBM Cororation, Somers, 2012. [2] [3] [4] [5]
© Copyright 2024 ExpyDoc