ROM Addresses: Getting BASIC to do What You Want

From Bitchin100 DocGarden
Jump to navigationJump to search

How does BASIC do what you tell it to do? Clues to the language's subservience lie in ROM.

By JAKE COMMANDER
Portable l00 September 1983, pg. 24

How on earth does Basic know what to do? All those state­ments, commands, and functions, yet the interpreter untiringly plods through your code always knowing what's required next. Just how does it do it?

Well, if the answer were simple, everybody would be writing BASIC interpreters and putting Microsoft out of business. But it is possible to follow at least some of the pathways Basic uses to perform its duties.

Most addresses of the ROM routines which comprise BASIC are held in two tables. These can be unraveled to give a list of routines used to perform various tasks.

Jump Addresses

One table contains jump addresses for the commands (or verbs, as it were) which will always be the first thing the interpreter picks up from a statement. The whole repertoire of such commands is catered for the table located at 0262 hex.

BASIC gets the appropriate jump address by using the token number for the command it's about to execute. All tokens are numbers from 128 to 255; therefore subtracting 128 gives numbers from zero to 127. As each jump address in the table is two bytes long, the token (minus 128) is multiplied by two to give an offset into the table. This points straight at the address which is needed. The two-byte address is picked up and jumped to ? and we're now executing a BASIC command in pure machine code.

What happens next depends entirely on the machine code for the command itself. Various syntaxes are allowed for some commands but not for others. For instance, the print command would allow an expression such as TAB(22);l/3, so would an LPRINT. But a LET would have none of that. LET X = TAB(22); 1/3 would have you on the carpet in no time.

Also various combinations of tokens can do different things. The comparison operators, for example, can be used pretty much interchangeably. These operators, >? <> = < , etc are all OK syntactically. This versatility means a table for such a wide set of possibilities is nigh impossible.

Second Table

However, there is a second table at location 004E in the ROM. This contains many addresses used in the evaluation of Basic math functions and expressions. These are extracted and jumped to in a similar fashion to the first table.

Any BASIC word excluded from either of these tables is handled separately by the interpreter according to its particular use. However, out of a possible 128 tokens, these two tables give us a mechanism by which we can follow the machine-code execution of many of them. It is the combination of these routines and the syntax checking required to logically execute them that makes up an interpreter.

The following list has been compiled from the two tables I've described and a disassembly of other parts of the ROM. It shows the entry points for all important BASIC statements and functions. Certain functions can have more than one possible syntactic use and the list does not cover all such uses. (An example is the statement OFF, which can be SOUND OFF or MOTOR OFF etc.) The list is in four columns. The first is the address in ROM where the BASIC word occurs in the vocabu­lary table. The second entry is the word itself. Third is the token assigned to that word when it is encoded by the BASIC interpreter.

Fourth Column

The fourth column contains the address the interpreter jumps to to execute the token representing the statement or function desired. Once again, some statements can have more than one use such as MID$(LH$)=RH$, and LH$=MID$(RH$). In these cases, two addresses are given: one for use on the left hand side of the equals sign and one for the right hand side of the sign.

Perhaps unsurprisingly, things get a little more complicated with the mathematical functions in BASIC. It's not simply a matter of taking an address for, say, a multiply routine and then jumping to it. The BASIC interpreter has to know the numeric type of operator it has to work on. For instance, with the addition operator, BASIC has four choices:

  1. Signed integer
  2. Single precision
  3. Double precision
  4. String

None of the other binary operators allow string manipulation, so they're limited to the numeric variable types only.

The addresses of these binary operators can be confirmed (if you need confirmation) from three short tables in ROM, one each for double precision, single precision, and integer numbers respectively.

The tables contain six addresses apiece for addition, subtraction, multiplication, division, exponentiation, and comparison. Rather than clutter the token-addresses table, these addresses are contained separately at the end.

In a following article, I'll be looking at ways to use some of these addresses in your own machine-code programs. For the more adventur­ous, an experiment will probably prove irresistible. Remember, though, in a RAM-file machine such as the Model 100, a lock-up may cost you all your files. Use caution.

Jake's ROM Addresses For BASIC Keywords

0080END80409F
0083FOR810726
0086NEXT824174
008ADATA83099E
008EINPUT840CA3
0093DIM85478B
0096READ86OCD9
009ALET8709C3
009DGOTO880936
OOA1RUN89090F
OOA4IF8AOB1A
OOA6RESTORE8B407F
00ADGOSUB8C091E
0082RETURN8D0966
OOB8REM8E09AO
OOBBSTOP8F409A
OOBFWIDTH9D1DC3
OOC4ELSE9109AO
OOC8LINE92OC45
OOCCEDIT935E51
OODOERROR94OBOF
00D5RESUME95OABO
OODBOUT9611OC
OODEON97OA2F
OOEODSKO$985071
OOE5OPEN994CCB
OOE9CLOSE9A4E20
OOEELOAD984D70
OOF2MERGE9C4D71
OOF7FILES9D1F3A
00FCSAVE9E4DCF
0100LFILES9F506F
0106LPRINTAOOB4E
010CDEFA10872
010FPOKEA2128B
0113PRINTA30B56
0118CONTA440DA
011CLISTA51140
0120LLISTA6113B
0125CLEARA740F9
012ACLOADA82377
012FCSAVEA92280
0134TIME$AA19AB 1904
0139DATE$AB19BD 1924
013EDAY$AC19F1 1955
0142COMAD1A9E
0145MDMAE1A9E
0148KEYAF1BB8
014BCLSBO4231
014EBEEPB14229
0152SOUNDB21DC5
0157LCOPYB31E5E
015CPSETB41C57
0160PRESETB51C66
0166MOTORB61DEC
016BMAXB77FOB 19DB
016EPOWERB81419
0173CALLB91DFA
0177MENUBA5797
017BIPLBB1A78
017ENAMEBC2037
0182KILLBD1F91
0186SCREENBE1E22
018CNEWBF20FE
018FTAB(COOC01
0193TOC1076B
0195USINGC24991
D19AVARPTRC3OF7E
01AOERLC4OF56
01A3ERRC5OF47
01A6STRING$C6296D
01ADINSTRC72A37
0182DSKI$C85073
01B7INKEY$C94BEA
01BDCSRLINCA1D90
01C3OFFC8various
01C6HIMEMCC1DB9
01CBTHENCDOB2A
01CFNOTCE1054
01D2STEPCFO783
0106+D0
01D7-D1
01D8*D2
01D9/D3
010A\D4
01DBANDD51097
01DEOR06108C
O1EOXORD710A2
01E3EQVD810AD
01E6IMPD910B5
01E9MODDA37DF
01EC\DB377E
01ED>DCOE29
01EE=DDOE29
01EF<DEOE29
01FOSGNDF3407
01F3INTE03654
OIF6ABSE133F2
01F9FREE22B4C
01FCINPE31100
01FFLPOSE410C8
0203POSE510CE
0206SQRE6305A
0209RNDE7313E
020CLOGE82FCF
020FEXPE930A4
0212COSEA2EEF
0215SINEB2F09
0218TANEC2F58
021BATNED2F71
021EPEEKEE1284
0222EOFEF1889
0225LOGF0506D
0228LOFF1506B
022BCINTF23501
022FCSNGF3352A
0233CDBLF435BA
0237FIXF53645
023ALENF62943
023DSTR$F7273A
0241VALF82A07
0244ASCF9294F
0247CHR$FA295F
0248SPACE$FB298E
0251LEFT$FC29AB
0256RIGHT$FD29DC
025CMID$FE2AC2 29E6
0260'FFOA90
	+       -      *      /        >    Cmpr
D.P.   2B78    2B69   2CFF   2DC7   3D8E    34FA
S.P.   37F4    37FD   3803   380E   3D7F    3498
INT    3704    36F8   3725   OFOD   3DF7    34C2
String 28CC                                 270C