### What is a Good PhD?

### - some «common sense» and personal views

Lasse Natvig Professor in <u>computer architecture</u> Lasse@computer.org



EANUTS reprinted by permission of United Feature Syndicate, Inc.

The Review Process (How to Deal with Editors) 123

From: How to Write and Publish a Scientific Paper, Robert A. Day 5<sup>th</sup> ed.

and Technology Norwegian University of Science

□ NTNU

### **Presentation Overview**



- What is a Good PhD?
  - Context
    - PhD theses I have supervised (6 + 3?) or evaluated (13)
  - <u>Quality</u>  $\rightarrow$  Importance of focus
    - Research group / Supervisor / PhD student
  - Reproducibility and testing (Method)
  - Quantity
    - 6 papers «The Reidar Model»
  - From NTNU regulations
  - Surprise



## From the official NTNU regulations

Underlining and coloring by Lasse

- From *Guidelines for the Assessment of Candidates for Norwegian Doctoral Degrees,* Section 3.2 Assessment of the thesis [NTNU12b]:
  - A Norwegian doctoral degree is awarded as proof that <u>the candidate's</u> research qualifications are of a certain standard
  - ... the academic standard and quality of the work submitted
  - ...the candidate must satisfy the *minimum requirements to qualify as a* researcher demonstrated through requirements related to the formulation of <u>research questions</u>, <u>precision and logical stringency</u>, <u>originality</u>, a good command of current methods of analysis and be able to <u>reflect</u> on their possibilities and limitations.
  - ... thesis must <u>contribute new knowledge</u> to the discipline and be of an academic standard <u>appropriate for publication</u> as part of the scientific literature in the field



– And more! ©



### CONTEXT AND FOCUS



### What is Computer Architecture?

- Computer architecture "is a specification detailing how a set of <u>software</u> and <u>hardware</u> <u>technology</u> …<u>interact</u> to form a computer <u>system</u> … ... determining the needs of the <u>user/system/technology</u>, and creating a logical <u>design</u> and standards based on those requirements" [Techop]
  - + performance evaluation
  - Includes parallel processing (My personal interest in 30 years)
- Broad knowledge vs. deep knowledge
- … There is an old saying, "Architects know a little about almost everything and an engineer knows a lot about almost nothing."[Career]



### How to focus within architecture?





### A PhD student must focus even more!

• JUMP to

The illustrated guide to a Ph.D by Matt Might





### **Research Workflow**



From: How to Write a Computer Architecture Paper, lecture about miniproject report writing in TDT4260 comp.arch [Jahre-14]



### REPRODUCIBILITY



### Abstraction/Models & Reproducibility

- Model of a system
  - Model the interesting parts with high accuracy
  - Model the rest of the system with sufficient accuracy
- "The Danger of Abstraction"
  - George E. P. Box:
    - "All models are wrong but some are useful"
    - "Remember that all models are wrong; the practical question is how wrong do they have to be to not be useful"
- Abstractions and simplifications
  - Even more important for small countries/groups!



• Hm...,

how to get people to trust our research?

- 100% precise documentation!
- <u>Reproducibility</u>





### Give "all" experimental details

| -                                         | Crossbar Based Architecture |           |           | Ring Based Architecture |           |           |
|-------------------------------------------|-----------------------------|-----------|-----------|-------------------------|-----------|-----------|
|                                           | 4-core                      | 8-core    | 16-core   | 4-core                  | 8-core    | 16-core   |
| ITRS Year of Production                   | 2007                        | 2010      | 2013      | 2007                    | 2010      | 2013      |
| Feature Size (nm)                         | 65                          | 45        | 32        | 65                      | 45        | 32        |
| Shared Cache Size (MB)                    | 8                           | 16        | 32        | 8                       | 16        | 32        |
| Memory Bus Channels                       | 1, 2 or 4                   | 1, 2 or 4 | 1, 2 or 4 | 1, 2 or 4               | 1, 2 or 4 | 1, 2 or 4 |
| Interconnect Latency (End-to-End/Per Hop) | 8/-                         | 16/-      | 30/-      | -/4                     | -/4       | -/8       |

#### Table III CACHE PARAMETERS

| Cache                  | Size                    | Associativity | Access Latency | Cycle Time | MSHRs / WB  | Banks | Area           |
|------------------------|-------------------------|---------------|----------------|------------|-------------|-------|----------------|
|                        | (4-core/8-core/16-core) |               | (cycles)       | (cycles)   | (per bank)  |       | $(mm^2)$       |
| Level 1 Private Cache  | 64KB                    | 2             | 3/2/2          | 2          | 16MSHRs/4WB | 1     | 2.3/1.1/0.5    |
| Level 2 Private Cache  | 1 MB                    | 4             | 9/6/5          | 4/3/2      | 16          | 1     | 14.6/7.0/3.6   |
| Level 2/3 Shared Cache | 8/16/32 MB              | 16            | 16/12/12       | 4          | 16/32/64    | 4     | 94.0/91.9/84.7 |

#### Table IV PROCESSOR CORE PARAMETERS

#### Table V INTERCONNECT AND DRAM INTERFACE

| Parameter               | Value                                 | Parameter             | Value                                                                                         |  |  |
|-------------------------|---------------------------------------|-----------------------|-----------------------------------------------------------------------------------------------|--|--|
| Clock frequency         | 4 GHz                                 | Crossbar Interconnect | 8/16/30 cycles end-to-end transfer                                                            |  |  |
| Reorder Buffer          | 128 entries                           |                       | latency, 32 entry request queue,<br>Dipatined (2006 pipe stages)                              |  |  |
| Store Buffer            | 32 entries                            | Dine Interconnect     | A/A/8 cycles par hop transfar latancy                                                         |  |  |
| Instruction Queue       | 64 instructions                       | King Interconnect     | $\frac{4}{4}$ cycles per hop traisfer fatency,<br>$\frac{1}{12}$ nine states per hop 32 entry |  |  |
| Instruction Fetch Queue | 32 entries                            |                       | request queue, 1/2/2 request rings, 1                                                         |  |  |
| Load/Store Queue        | 32 instructions                       |                       | response ring                                                                                 |  |  |
| Issue Width             | 4 instructions/cycle                  | Point to Point Link   | 4/3/2 transfer latency, 32 entry                                                              |  |  |
| Functional units        | 4 Integer ALUs, 2 Integer             |                       | request queue                                                                                 |  |  |
|                         | Multipy/Divide, 4 FP ALUs, 2 FP       | Main memory           | DDR2-800, 4-4-4-12 timing, 64 entry                                                           |  |  |
|                         | Multiply/Divide                       |                       | read queue, 64 entry write queue, 1                                                           |  |  |
| Branch predictor        | Hybrid, 2048 local history registers, |                       | KB pages, 8 banks, FR-FCFS                                                                    |  |  |
|                         | 4-way 2048 entry BTB                  |                       | scheduling [21], Closed page policy                                                           |  |  |

From: A Quantitative Study of Memory System Interference in Chip Multiprocessors, Jahre et al., HPCC09

## Reproducibility

*Ten Simple Rules for Reproducible Computational Research*, by Geir Kjetil Sandve et.al. [SNTH13]

- 1: For Every Result, Keep Track of How It Was Produced
- 2: Avoid Manual Data Manipulation Steps
- 3: Archive the Exact Versions of All External Programs Used
- 4: Version Control All Custom Scripts
- 5: Record All Intermediate Results, When Possible in Standardized Formats
- 6: For Analyses That Include Randomness, Note Underlying Random Seeds
- 7: Always Store Raw Data behind Plots Matplotlib, gnuplot
- 8: Generate Hierarchical Analysis Output, Allowing Layers of Increasing Detail to Be Inspected
- 9: Connect Textual Statements to Underlying Results
- 10: Provide Public Access to Scripts, Runs, and Results



Parallel computers using random numbers might execute nondeterministically



## More on reproducibility

- 4'th Int'l Workshop on Adaptive Self-tuning Computing Systems [ADAPT'14]
  - Two papers got the quality mark reproducible
- 1st ACM SIGPLAN Workshop on Reproducible Research Methodologies and New Publication Models in Computer Engineering [TRUST14]

### See also:

http://ctuning.org/reproducibility (Grigori Fursin)





### More on reproducibility

- *Repeatability* in Computer Science
- Techn. Report (68 pages)
- http://reproducibility.cs.arizona.edu/





### TESTING



### The importance of testing

- (Industry typically use 50% of work force for testing)
  - They cannot afford low quality
- Running benchmarks in computational comp.arch.
  - Common practice has not been perfect: Assumed OK if simulator does not crash





### From ADEPT workshop January 2014

Presented by David Black-Schaffer, Uppsala [SHBS14]:

|                                                                                                                                                                                                       | ic         | d Black-Schaffer Uppsala University / Department of Information Technology 1/21/14   24 |                                                      |                                                       |                                |                       |                            |  |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-----------------------------------------------------------------------------------------|------------------------------------------------------|-------------------------------------------------------|--------------------------------|-----------------------|----------------------------|--|
|                                                                                                                                                                                                       |            | An Aside: the Importance of Verification                                                |                                                      |                                                       |                                |                       |                            |  |
|                                                                                                                                                                                                       |            | Benchmark                                                                               |                                                      |                                                       | Verifies in<br>Reference       | Verifies<br>using VFF | Verifies when<br>Switching |  |
|                                                                                                                                                                                                       | 1          | 400.perlbench<br>433.milc<br>458.sjeng<br>471.omnetpp<br>483.xalancbmk                  | 401.bzip2<br>453.povray<br>462.libquantum<br>481.wrf | 416.gamess<br>456.hmmer<br>464.h264ref<br>482.sphinx3 | Yes                            | Yes                   | Yes                        |  |
|                                                                                                                                                                                                       |            | 410.bwaves<br>436.cactusADM<br>470.lbm                                                  | 434.zeusmp<br>444.namd                               | 435.gromacs<br>459.GemsFDTD                           | No                             | Yes                   | Yes                        |  |
|                                                                                                                                                                                                       |            | 445.gobmk                                                                               | 450.soplex                                           | 454.calculix                                          | Fatal Error <sup>1</sup>       | Yes                   | Yes                        |  |
|                                                                                                                                                                                                       |            | 429.mcf                                                                                 | 473.astar                                            |                                                       | Fatal Error <sup>2</sup>       | Yes                   | Yes                        |  |
|                                                                                                                                                                                                       |            | 437.leslie3d                                                                            |                                                      |                                                       | Fatal Error                    | Yes                   | Yes                        |  |
|                                                                                                                                                                                                       |            | 403.gcc                                                                                 |                                                      |                                                       | Fatal Error*                   | Yes                   | res                        |  |
|                                                                                                                                                                                                       |            | 447.dealII                                                                              |                                                      |                                                       | Fatal Error                    | Yes                   | Ves                        |  |
|                                                                                                                                                                                                       |            | 465.tonto                                                                               |                                                      |                                                       | 13/20 verified                 | 29/29 verified        | 28/29 verified             |  |
| 1. Simulator get                                                                                                                                                                                      | s stuck    |                                                                                         |                                                      |                                                       | 9/29 fatal                     | 29729 Vermed          |                            |  |
| <ol> <li>Triggers a memory leak causing the simulator crash.</li> <li>Terminates prematurely for unknown reason.</li> </ol>                                                                           |            |                                                                                         |                                                      |                                                       | Only 13 verify<br>when doing a |                       |                            |  |
| <ol> <li>Fails with internal error. Likely due to unimplemented instructions.</li> <li>Benchmark segfaults due to unimplemented instructions.</li> <li>30B instruction OoO simulation ons.</li> </ol> |            |                                                                                         |                                                      |                                                       |                                |                       |                            |  |
| 6. Terminated by                                                                                                                                                                                      | y internal | benchmark sanit                                                                         | y check.                                             |                                                       |                                |                       |                            |  |







# When is 6 papers good enough?

- First/main author of most
  - "If the thesis consists primarily of papers, the candidate must normally be the main author or first author of at least half the papers" [NTNU12a]
- At least 2 4 in high quality conferences or good journals
- All in acceptable journals, conferences or good workshops
  - IDI Relevant Conferences (357), A and B rating (can have weaknesses) [IDI-AB]
  - 1 (or maybe 2) can be in state submitted, if ...
- Watch out!
  - There are "fake conferences" and "bogus journals" (and websites)
    - Accepting papers written by paper-automata
  - You can easily get papers published that NEVER should have been published
  - Your and (your supervisors) responsibility



# PhD as a collection of papers

 If the thesis consists of several interrelated minor pieces of work, the candidate must document the integrated nature of the work and the assessment committee <u>must decide whether the content comprises a coherent entity</u>. In such cases, the candidate must compile a separate part of the thesis that not only summarizes but also compares the research questions and conclusions presented in the separate pieces... [NTNU12b]











Figure 3.1: Research process and relation of papers



### SURPRISE



# How to supervise within a topic you do <u>not</u> know?

- ... or know only to some extent
- Case b) Change of main supervisor (not common)
- Case a) Your own student working efficiently and independently/self-driven
  - A normal case, or ideal case
  - How well can the PhD student answer your questions?
  - Clear and precise descriptions?
  - "General attitude"
    - from "maximum quality" to ... (worst case) "don't care attitude"



### Motivate your supervisor!

- Use the time with the supervisor efficiently
- Be prepared
  - Bring results, ideas, questions
- Take notes
- Give your supervisor time to prepare
- Help him/her supervise
  - Write readable
  - Use figures, visualizations
  - Use abstraction
  - Be precise and pedagogical
- You have one project, your supervisor might have 10-30 "projects"



### Scientific writing, precision

- Notation/concepts
  - Often new concepts
  - Use best/most common terminology --- if it exist
  - Define your terminology precisely
  - Stick to it, be consistent!
- "help the reader"
- More (in Norwegian)
  - Lasse's enkle tips om rapportskriving



### References

Disclaimer: Some of these are "low-value references" (All are incomplete, but contain hyperlinks)

[ADAPT14] The 4'th Int'l Workshop on Adaptive Self-tuning Computing Systems, Vienna, January 2014

[Career] What Is The Difference Between Architecture And Civil Engineering?

[Djup08] Evolving Static Hardware Redundancy for Defect Tolerant FPGAs, PhD thesis by Asbjørn Djupdal

[Dybd07] Architectural Techniques to Improve Cache Utilization. Dr.ing. thesis by Haakon Dybdahl, 2007

[Hart05] Evolution of Fault and Noise Tolerant Digital Circuits, PhD thesis by Morten Hartmann, 2005

[IDI-AB] IDI Relevant Conferences (list for travel grants, A and B rating)

[Jahre10] Managing Shared Resources in Chip Multiprocessor Memory Systems, PhD thesis by M. Jahre, 2010

[Jahre14] <u>How to Write a Computer Architecture Paper</u>, lecture about miniproject report writing in course TDT4260 comp.arch, given by Nico this spring

[JN10] <u>Computational Computer Architecture Research at NTNU</u>, ERCIM News April 2010

[NTNU12a] Regulations For The Philosophiae Doctor Degree (PhD) at NTNU, 23 January 2012.

[NTNU12b] Guidelines for the Assessment of Candidates for Norwegian Doctoral Degrees, NTNU 13 June 2012

[SHBS14] Full Speed Ahead: Detailed Architectural Simulation at Near-Native Speed,

Andreas Sandberg, Erik Hagersten, and David Black-Schaffer. March 2014, Tech.report 2014-005

[SNTH13] Ten Simple Rules for Reproducible Computational Research, Geir Kjetil Sandve et.al., 2013

[Techop] Computer Architecture, from Techopedia

[TRUST14] <u>1st ACM SIGPLAN Workshop on Reproducible Research Methodologies and New Publication Models</u> in Computer Engineering, Edinburgh, 12 June 2014.



### Questions





### Visit the EECS website: http://www.ntnu.edu/ime/eecs/

**Contact:** *Lasse.Natvig@idi.ntnu.no* 

http://research.idi.ntnu.no/multicore/

