



# cs470 - Computer Architecture 1 Fall 2001

# **Midterm Exam**

open books, open notes

Starts: 18:25

Ends: 20:00

Name:\_\_\_\_\_(please print)

ID:\_\_\_\_\_

| Problem | Max points | Your mark | Comments |
|---------|------------|-----------|----------|
| 1       | 10         |           | 5+5      |
| 2       | 25         |           | 20+5     |
| 3       | 20         |           | 10+10    |
| 4       | 25         |           | 15+5+5   |
| 5       | 5          |           |          |
|         | 80         |           |          |



**1.** You know the following about the frequency of instructions in your favorite application (which may be a word processor, a spreadsheet or maybe a database):

|            | f <sub>i</sub> | CPI <sub>i</sub> |
|------------|----------------|------------------|
| ALU        | 50%            | 4                |
| Load/store | 40%            | 6                |
| Control    | 10%            | 5                |

a) compute the average CPI for your application;

b) compute the MIPS rating for your machine using the above table; assume a clock rate of 500 MHz.

**2.** Given the following piece of MIPS assembly:

```
.data 0x1000000
var1: .word 0x8192a3b4
var2: .word 0
      .text 0x400040
main: subu $sp, $sp, 4
            $ra, 0($sp)
      sw
      jal
            Mistery
      lw
            $ra, 0($sp)
      addu
            $sp, $sp, 4
      jr
            $ra
      .text 0x400100
Mistery:
      lui
            $t0, 8192
      lb
            $t1, X($t0)
                               \# X is the last digit of your SSN modulo 4
            $t1, 4($t0)
      sw
      jr $ra
```



a) Show the sequence of addresses issued by the CPU to execute this code. The initial value of the stack pointer (sp) is 0x7ffffec.

| Instruction | Address (in hexadecimal) | Read/Write |
|-------------|--------------------------|------------|
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |
|             |                          |            |

b) What is the final value of var2?

**3.** You have decided to upgrade your computer's hardware with a new disk and controller. This new disk makes every disk access two times faster. With the new disk *in use*, disk accesses account for 20% of the running time.

a) what is the overall speedup?

b) what is the overall speedup if you also improve the graphics system with a new graphics card (you'll have both the new disk and the new graphics card)? The new card will make all graphics 10 times faster. Graphics represent 40% of the workload of the original machine (before any improvment is done).

- **4.** You have been asked to compare the memory efficiency of two different styles of instruction sets, one for an *accumulator* architecture and the second for a two-address *load-store* architecture with 16 general purpose registers. The following can be assumed:
  - the opcode is always one byte
  - all memory addresses are 16 bit wide
  - all data operands are four bytes
  - all instructions are an integral number of bytes in length

There are no optimizations to reduce memory traffic, and the variables a, b, c, and d are initially in memory.

a) Write the two code sequences for the following C code:

a = b + c; b = a + c;

d = a - b;

# 

### Accumulator Architecture

### **Load-Store Architecture**

| Assembly Instructions | Instruction Bytes Fetched | Data Bytes Transferred |
|-----------------------|---------------------------|------------------------|
|                       |                           |                        |
|                       |                           |                        |
|                       |                           |                        |
|                       |                           |                        |



## Load-Store Architecture

| Assembly Instructions | Instruction Bytes Fetched | Data Bytes Transferred |
|-----------------------|---------------------------|------------------------|
|                       |                           |                        |
|                       |                           |                        |
|                       |                           |                        |
|                       |                           |                        |
|                       |                           |                        |
|                       |                           |                        |

b) Which architecture is more efficient as measured by code size?

c) Which architecture is most efficient as measured by total memory bandwidth required (code + data)?

**5.** What does *re-entrant code* mean?

6 of 6