Practical 3: RISC-V Coding and Procedures
1 Objectives
Following completion of this practical you should be able to:
- Write loops in RISC-V programs.
- Understand the limitations of
beq/bne/blt/bge
,jal
, andjalr
. - Properly access memory in RISC-V programs.
- Understand some of the issues surrounding register allocation.
Your Tasks
This practical will be done alone, each student is expected to write their own code and demonstrate an understanding of assembly programming.
0 Get your repo
Follow this link to get your new repository for this practical. Follow the instructions in Practical 1 if you have trouble getting access or setting up authentication.
1 Setup RARS
Install
RARS is a Java program and should work on any machine that supports a Java Virtual Machine.
If you need java, you can get it here.
Download RARS from here or from the main RARS webpage:
https://github.com/TheThirdOne/rars/releases
.
On Windows and MacOS you should be able to double click the .jar
file.
Sometimes you need to launch it from a command line. Example for running it from a linux command line:
> java -jar rars1_6.jar
Note some MacOS machines will not allow normal file navigation unless you launch the jar from the command line using the linux instructions above.
Some help and info pages are available on the RARS github page.
Navigation
Launch RARS. A window should appear named similar to "RARS 1.5" with three panes.
- The right-most pane shows the contents of the registers.
At the bottom of the register list is the
pc
register, which tracks the address of the current instruction. The tabs at the top of the pane allow you to look at control and status registers. If an error occurs in your code, the simulator might automatically show you the values of the "control and status" registers. We will not use these for a while, so you should switch back to the general registers for now. - The large upper-left pane shows the contents of the currently loaded file. It should be blank when you start up RARS. There are two tabs at the top of the pane.
- The
Edit
tab allows you to edit the currently loaded assembly file. - The
Execute
tab shows the assembled code and processor state as the code runs.
- The
- The bottom panel is the output console. The 'Messages' tab displays messages from the simulator. The 'Run I/O' tab shows any messages generated from the running program.
Running a program
-
There should be a
practical03
directory in your repository. Inside, there should be ap1
folder. Inside that directory there should be ap1.asm
file. -
Open it in RARS, by going to File > Open and browse to your
practical01/p1
folder. The file's contents should appear in the editor panel. Look over the file.- This file contains two main parts the
.globl
part defines what is in memory, this file simply defines a global variablemain
which allows us to refer to the labelmain:
in this file from other files. We can define and add new data to memory here, including lists, constants, etc. - The
.text
part defines the actual assembly code in this program. Labels can be added arbitrarily to this code (no need to declare a global variable for most labels). You can refer to registers by number or name.
- This file contains two main parts the
-
Click the Assemble button or go to Run > Assemble to assemble the file. The assembler will translate the assembly instructions into machine code ready for execution. RARS will automatically switch to the
Execute
tab. The execute tab shows two views of memory: the "text segment" (the program's instructions) and the "data segment" (the data stored for the program).-
The text segment window looks messy, but each instruction has five columns of data. The first column is a checkbox that allows us to set a breakpoint. The second is the address of the instruction. The third item is the hexadecimal representation of the machine instruction. The fourth item is (basically) the assembly language representation of the machine instruction. The fifth item (if it's present) is the actual line of code (and line number) from the source file that generated the machine instruction shown in the "Basic" column.
-
The data segment window shows the contents of memory, including the stack contents. You can jump to different regions in memory using the drop-down selector. For now, we'll only be concerned with the (.data) region.
Look at the text segment tab. Notice that instructions start at address
0x00400000
and continues to address0x00400018
. Also notice that the PC (program counter) register has the value 0x0400000. -
-
When the program is assembled, the "ori x2, x0, 0x00000028" instruction will be highlighted.
-
Notice the value of the PC.
-
Notice that register 0 currently contains zero and register 2 currently contains a big number.
-
Click Run > Step or use the Step button to run the currently highlighted line. Notice that the contents of register
2
have changed, and that the new value is shown in hexadecimal form. Finally, notice the value of the PC. -
Click Run > Reset or use the reset button. The register and text segment contents will return to the state prior to you running the program. This allows you to rerun the program easily.
-
Practice editing, navigating through the code, and executing it until you are comfortable using RARS.
2 Summing an array
In writing these and other assembly language programs you should follow this process:
-
Solve the problem before coding the solution. Usually this means writing the code in a high-level language or pseudo-code first, then converting it to assembly language.
-
IMPORTANT: Write down (perhaps in a comment in your code) the purpose that you have in mind for each register that you use. Here's an example of how you might document register use in your code.
; x3 = i ; x4 = j ; x5 = n ; x6 = key ; x7 = ptr (address of array) ; x8 = test ; x9 = temp
-
These programs involve many loops, so it will be helpful to set breakpoints. Breakpoints mark positions in the code where execution should pause. After assembling a program, click the Bkpt checkbox to set a breakpoint at that line. Execution will stop before that line is run and you can observe the state of the program. Note: All breakpoints are deleted when you assemble the program!
-
All of these programs require RARS to be configured to assemble all files in the directory. To do so, enable Settings > Assemble all files in directory.
-
IMPORTANT: Do not use registers 1 and 2 for this practical.
-
Your solutions to these practicals should not use any pseudoinstructions. The only exception is the
li
pseudoinstruction.
Items below marked with a Q are to be answered on the practical worksheet.
- Open the
p2/p2.asm
file in RARS. This goal of this program is to sum the integers between 0 and an \(N\). - Q Where are
main
,loop
,done
,N
andSum
located (i.e. what address)? Hint: Assemble and use Settings > Show Labels Window in the Execute view. - Before you run the program, calculate the number of instructions
that will be executed within
p2
and the final value ofSum
. Only count the instructions that run between the test setup and teardown (thejal
and thejalr
). - Execute the program and verify that the program sums the values between 0 and
N
correctly. - Modify
p2.asm
so that it will still calculate the sum correctly ifN
is equal to 0. Be sure to test your modifications to make sure they work. You can do this by changing the value of thetest0
variable from 0 to 1. You should also test that it works withN
equal to values 1 to 5. Set thetestN
variable to 1 to test this. - Q Assemble the code and set a breakpoint at address
0x00400004
. Use Run > Go to run the program and it should stop at the breakpoint. Single step to the end of the program. How many instruction does your modified program execute whenN
is equal to 5? Can this be improved? If so, how? Hint: If your modified program executes more than 1 additional instruction, you can do better. - Q Will your modified program work if
N
is less than 0? You do not need to make any additional modifications.
3 Swap max with last
- Note: You can use whatever registers you want in this part, but you will have an easier time on future parts if you avoid using any s-registers now.
- Open the
p3/p3.asm
file. This goal of this program is to find the maximum element in an array and put it at the end of the array. - Assemble the code and set a breakpoint at address
0x00400004
. This is the start of the program. Set another breakpoint at address0x00400050
. This is the end of your program (should be a jalr). - Q Run
p3.asm
until your second breakpoint. What is the value of max and maxindex at the end of the program? Are they what you expect? - Q Comment out
slli x10, x7, 2
and rerun the program. What happens? Compare the address table between the program with and withoutslli
. Which is correct and why? - Undo the changes made in Step 4.
- Modify
p3.asm
so that the largest element ofA
is swapped with the last element ofA
. Be sure to adequately test your modified program. Hint: change the elements and size ofA
, then check your results in the Data segment. - Q If you repeatedly apply your modified program to the subarrays of
A
from 0 to \( N-i \) where \(i\) is the number of times you've applied your program, what is the final state ofA
? - Q Like
p2.asm
this program doesn't work ifN
is equal to 0. It is brittle in other ways as well. For example, what happens if all of the elements are less than -1? What if they are all -2^{31}, the max negative value? Is there a more robust way to ensure the max value is identified? - Try to fix
p2.asm
so that it works with all values (including all negative values).
4 Making a procedure
- NOTE: You may not use the callee-saved (s) registers for this portion of the practical. Make sure to avoid x8-x9 (s0-s1), or x18-x25 (s2-s11). We recommend you use register names (t0-t6 and a0-a7) instead of numbers (xN).
-
Open the files
p4/p4-loop.asm
andp4/p4-swap.asm
in RARS.Note the two locations marked for adding your code.
-
In the previous part of this practical, you worked with a program for swapping the maximum element of an array with the last element. Copy your code for "swapping the maximum value of an array with the last element of the array". You'll need all the code between the label
p3:
and the 'jalr'. Don't copy the label or the 'jalr'.If you have not already addressed the issue discussed in the last step (7) of "Swap max with last", you'll need to do so now.
-
Add your code to
p4-swap.asm
in the spot indicated below the labelSwapMaxWithLast:
. -
Modify your swap code so that it complies with the documentation and specifications in
p4-swap.asm
. Note,SwapMaxWithLast
is a procedure which takes 2 arguments - the location (address) of an array of words in memory and the length (in words) of the array - and order matters. Be sure your code conforms with the RISC-V procedure calling conventions. Specifically, you will need to:-
Get the address of "
A
" from an argument register rather than doing a "la
" on a label. -
Get the value of "
N
" from an argument register rather than loading it from memory. -
Return from the procedure by doing a "
jalr x0, 0(x1)
" (this should already be done in the code provided in p4-swap.asm). -
You may need to reallocate registers or manage the stack to comply with the RISC-V procedure calling convention. Especially to avoid s registers x8-x9 or x18-x25.
-
Note that you will not be able to assemble and run this file on it's own now. Since you made this a procedure the arguments must be set up by some other piece of code. You'll write that next. If you get weird behavior while debugging stop and check if you tried to assemble this file on its own. Always check what is in the argument registers first while debugging.
-
-
Modify the
p4
procedure inp4-loop.asm
so that it callsSwapMaxWithLast
a single time withA
andN
as arguments. What output do you expect when you runp4-loop.asm
? -
Run
p4-loop.asm
. Is the actual output what you expected? -
Replace your call to "
SwapMaxWithLast
" with a call to the procedure "ProcedureConventionTester
". This procedure takes the same arguments asSwapMaxWithLast
and calls "SwapMaxWithLast
", but also checks for compliance with the RISC-V procedure call convention. Run the program with the new call. If the test fails, fix your code so it can pass the test. -
Modify the procedure
p4
inp4-loop.asm
so that it callsSwapMaxWithLast
\( N-1 \) times and with each successive call the length of the array passed is decreased by 1. See the comments inp4-loop.asm
for exactly where to put your code. Do not use s registers. In pseudocode:for (i=N; i>1; i--) { SwapMaxWithLast(A, i); }
What output do you expect when you run
p4-loop.asm
? -
Run
p4-loop.asm
. Is the actual output what you expected? -
Again test your compliance with the RISC-V procedure calling convention by calling "
ProcedureConventionTester
" instead of "SwapMaxWithLast
" -
Q If
SwapMaxWithLast
needed to return a value how would that be accomplished? What if it needed to return more than 2 values? -
Q What changes would you need to make if
SwapMaxWithLast
called another procedure in order to follow procedure calling conventions? What changes would you need to make top4-loop.asm
if the entirety ofp4-loop.asm
needed to be converted into a procedure (which in turn callsSwaMaxWithLast
itself)?
5 Fixing a broken procedure
You have been given a broken implementation of a recursive procedure. The procedure call example on page 108-110 of the book and the factorial example posted on Moodle may be helpful in understanding recursive procedures.
The Fibonacci sequence is defined over nonnegative integers as follows:
$$ \begin{align*} F(0) = 0\\ F(1) = 1\\ F(i) = F(i-1) + F(i-2), i \geq 2\\ \end{align*}$$While there are several ways to calculate the Fibonacci sequence, for this practical you must use a recursive procedure.
The sequence should be:
N | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | .. |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Fibonacci number | 0 | 1 | 1 | 2 | 3 | 5 | 8 | 13 | 21 | 34 | 55 | .. |
Note: You can (and must) use s-registers for this part of the practical.
-
Open
practical03/fib/fib.asm
in RARS. -
This file contains code to execute a recursive procedure which takes one argument i and returns F(i). Pseudo code is provided below - your final code must follow the algorithm presented below.
int fib(int n) { if (n == 0) { return 0; } else if (n == 1) { return 1; } else { return fib(n-1)+fib(n-2); } }
-
The provided code does not follow the RISC-V calling conventions, therefore it loops infinitely. You need to edit this code to follow the conventions.
- We recommend you begin by renaming all the registers from x1-x30 to their
named counterparts (e.g.,
t0
,sp
,ra
,s3
, etc). - Q Next, look at the registers being used and notice that
s
registers are being used to store bothn
andfib(n-1)
. Couldt
registers be used instead? Which would you choose and explain your choice. - Implement your choice of saving strategy using a stack
frame inside your
fib
procedure. Note thatfib
is both caller and callee because it is recursive! - The comment block at the top of
fib.asm
describes how you should arrange stack frames for this code. Your stack frame should be exactly 4 words, specified in the comments at the top of the file. Be sure any changes you make to your code conforms with these specifications as well as the RISC-V procedure calling conventions. Your code will work with alternative stack frames, but you will not receive full credit if you do not follow these requirements. You should only have to modify the code where the comments suggest allocation and deallocation of the stack.
-
Q When you have correctly implemented the calling conventions run
fib.asm
. Does it behave as expected? If you callfib(4)
, how many times isfib
recursively called? -
In your
fib
procedure, replace the calls tofib
with calls tofibtest
. This is similar to theProcedureConventionTester
from the previous question. -
Test your program with the new
fibtest
calls. If there are any issues, fix them. Once your program works correctly, restore the original calls tofib
. Note: this test confirms that your code follows most of the calling conventions, but doesn't test them all. You should check the green sheet to make sure you haven't missed anything. -
Complete the practical question sheet. You will need to trace the execution of the program and record values from the stack. You can view the stack in RARS: in Execute view, the Data Segment window has a drop down that allows you to check memory at
current sp
. Use this to see the stack frames.
Grading Rubric
All the practicals for CSSE232 have these general requirements:
General Requirements for all Practicals
- The solution fits the need
- Aspects of performance are discussed
- The solution is tested for correctness
- The submission shows iteration and documentation
Some practicals will hit some of these requirements more than others. But you should always be thinking about them.
Fill out the Practical Worksheet
In the worksheet, explain how you satisfy each of these items. Some guidelines:
-
None of these answers should be more than 100 words. (Unless otherwise indicated on the worksheet.)
Practical 3 Rubric items Possible Points Practical Worksheet 50 Sum (p2) 5 Swap (p3) 15 Sort (p4) 15 Fib 15 Total out of 100
- Submit your completed worksheet to gradescope. You will not upload code to gradescope for this practical. For the "Code" question just select the first page of your worksheet if gradescope prompts you to assign one.
- Push your code to your github repo. (Make sure you push, dont just commit.)