A Stack-based String Library for TurboForth

[ Author's Note: This paper is adapted from a paper that I wrote describing a string library that I developed for ANS Forth systems. The code presented at the end of this paper has been modified where appropriate for compatibility with the Forth-83 standard, and, specifically, TurboForth V1.2. The original ANS paper can be downloaded as a PDF. ]

Introduction – The Concepts behind the Library

Coding Conventions

Stack Notation

Normal Forth stack notation conventions are used. Where words have an effect on the string stack, the string stack effects are shown alongside the normal data stack effects.

The above example indicates that the word VAL$ takes a string from the string stack and results in an unsigned double being pushed to the data stack.

String Stack Library Download

The String Stack Library is supplied on the TurboForth Tools disk, in block format, ready to load. Assuming the Tools disk is in DSK1, and, after booting TurboForth, simply type the following:

To load the menu on the disk, or, simply type 8 LOAD to load the library directly.

Aknowledgements

Whilst the code presented here is original, the concepts used in it are based on concepts developed by Brian Fox, who developed a string stack library originally for TI-Forth, and also HsForth for DOS, circa 1988. Brian was kind enough to correspond with me on the subject of string stacks, and kindly shared his code. This author extends his sincere thanks to Brian for his generosity.

String Constant Words

Since only a handful of words are associated with string constants, they will be documented first:

$CONST ( max_len tib:"name" -- ) ( runtime: -- $Caddr) “string constant”

The word $const declares a string constant. Declared at compile time, string constants require a maximum length and a name. For example:

The above example declares a string with a maximum size of 50 characters. It shall be referenced in code using the name welcome.

Note the runtime stack effect. It can be seen that at run-time when the name with the string is referenced it shall push its address to the data stack. The label $Caddr indicates that it is the address of a string constant. String constants push the address of their maximum length field which can be read with the word $maxLen.

MAXLEN$ ( $Caddr -- max_len ) “maximum length of string”

Given the address of a string constant on the data stack the word $maxLen returns the maximum allowed string length for that string constant.

:=" ( $Caddr tib:"string" -- ) “assign string constant”

Given the address of a string constant on the data stack, the word :=” initialises the string constant with the string from the terminal input buffer.

.$CONST ( $Caddr -- ) “display string constant”

Given the address of a string constant on the data stack the word .$CONST shall display the string.

CLEN$ ( $Caddr – len ) “string constant length”

Given the address of a string constant on the data stack the word clen$ returns its actual length on the data stack. The word $maxLen can be used to determine the maximum length of a string constant.

>$ ( $Caddr -- ) ( ss: -- str) “to string stack”

Given the address of a string constant on the data stack the word >$ copies the contents of the string to the string stack where it can be manipulated.

Note that the string stack has received a copy of the string contained within welcome. The string welcome still exists as a string constant.

String Stack Words

The convention within this document is to refer to words that exist on the string stack as transient strings. They are referred to as transient strings because they generally only exist for a short time on the string stack. Strings are placed on the string stack (which is separate from the data and returns stacks) and then manipulated in some way before being consumed. Memory allocation and de-allocation is managed by virtue of the strings being on the stack in the same way that the size of the data stack is managed by simply adding or removing values on the data stack.

$" ( tib:"string" -- ) ( ss: -- str) “string to string stack”

The word $" takes a string from the terminal input buffer and pushes it to the string stack. The end of the string is indicated by a quotation mark.

In this example the string “Hello, world!” is pushed directly to the string stack, thus becoming the top item on the string stack.

Note that $" is a state-smart word. It can be used in both colon definitions and also directly at the command line. The correct action will be taken in either case.

In order that the run-time actions of $" may be compiled into a definition if so desired, the run-time action of this word is encapsulated within the word ($"). Therefore if the run-time behaviour of this word is to be compiled into another word one must compile, or postpone, the word ($").

DUP$ ( -- ) ( ss: s1 -- s1 s1) “duplicate string”

DROP$ ( -- ) ( ss: str -- ) “drop string”

At this point the string “Hello, World!” is the topmost string the string stack. “How are you?” was pushed onto the string stack, but it was immediately dropped.

SWAP$ ( -- ) ( ss: s1 s2 -- s2 s1) “swap string”

At this point the string how are you? is the topmost string on the string stack. If SWAP$ is executed the two strings are exchanged on the string stack.

NIP$ ( -- ) ( ss: s1 s2 -- s2) “nip string”

The word NIP$ removes the string underneath the topmost string from the string stack.

At this point, “blue” is on the top of the string stack, with “red” underneath it.

At this point, “red” has been removed from the string stack. “blue” is the topmost string.

OVER$ ( -- ) ( ss: s1 s2 – s1 s2 s1 ) “over string”

The word OVER$ pushes a copy of the string s1 to the top of the string stack, above s2.

ROT$ ( -- ) ( ss: s3 s2 s1 -- s2 s1 s3) \ "rotate strings"

The word ROT$ rotates the top three strings to the left. The third string (prior to the execution of ROT$) moves to the top of the string stack.

Note: For ease of implementation, this routine copies (using PICK$) the strings to the top of the string stack in their correct final order, then removes the 3 original strings underneath. Consequently, it is possible to run out of string stack space. If this happens, the condition will be correctly trapped in (set$SP).

-ROT$ ( -- ) ( ss: s3 s2 s1 -- s1 s3 s2) \ "rotate strings"

The word –ROT$ rotates the top three strings to the right. The top string (prior to the execution of –ROT$) moves to the third position. Note: For ease of implementation, this routine copies (using PICK$) the strings to the top of the string stack in their correct final order, then removes the 3 original strings underneath. Consequently, it is possible to run out of string stack space. If this happens, the condition will be correctly caught in (set$SP).

>$CONST ( $Caddr -- ) ( ss: str -- ) “to string constant”

The word $> takes the topmost string from the string stack and moves it into the string constant who’s address is on the data stack.

At this point, the string constant colour has the value “red”. To verify, display the string using .$CONST as follows:

+$ ( -- ) ( ss: str1 str2 – str2&str1 ) ”concatenate strings”

The word +$ replaces the top two strings on the string stack with their concatenated equivalent.

At this point, “red” and “blue” have been removed from the string stack. The topmost string on the string stack has the value “bluered”. Note that the topmost string goes to the left of the newly concatenated string.

LEN$ ( -- len ) ( ss: -- ) “length of string”

MID$ ( start end -- ) ( ss: str1 – str1 str2 ) “mid-string”

The word mid$ produces a sub-string on the string stack, consisting of the characters from the topmost string starting at character start and ending at character end.

Note, as indicated in the string stack signature, the original string (str1) is retained. Note also that the first character in the string (the leftmost character) is character number 0.

LEFT$ ( len -- ) ( ss: str1 – str2 ) “left of string”

The word left$ pushes the leftmost len characters to the string stack as a new string. The original string is retained.

RIGHT$ ( len -- ) ( ss: str1 – str1 str2 ) “right of string”

The word right$ cause the rightmost len characters to be pushed to the string stack as a new string. The original string is retained.

FINDC$ ( char – pos|-1) ( ss: haystack -- haystack) “find character in string”

The word findc$ returns the position of the first occurrence of the character char, beginning at the left side of the topmost string, with the search proceeding towards the right. If the character is not found, -1 is returned.

Displays the value 8, as the character b is found in the 8^th character position (where the first character is character 0).

. The stack contents are retained for convenience to allow further searches to be performed if desired.

FIND$ ( start – pos|-1) ( ss: haystack needle – haystack needle) “find string in string”

The word finds$ searches the second string on the string stack, starting from position start, for the first occurrence of the topmost string and pushes its starting position to the data stack. As a convenience, to make subsequent searches for the same substring easier, both strings are retained on the string stack.

Displays the value 3, as the substring is found at character position 3 (the leftmost character being character 0). The strings “redgreenbluegreen” and “green” remain on the stack, thus, the second instance of “green” could be found if desired.

REPLACE$ ( -- pos ) ( found: ss: s1 s2 s3 -- s4 not found: s1 s2 -- s1 s2) “replace string”

The word replace$ searches string s2 for the first occurance of string s3. If it is found:

.$ ( -- ) ( ss: str – ) “display string”

REV$ ( -- ) ( ss: s1 – s2 ) “reverse string”

The word rev$ replaces the topmost string on the string stack with its reversed equivalent.

LTRIM$ ( -- ) ( ss: str1 – str2 ) “trim left of string”

RTRIM$ ( -- ) ( ss: str1 – str2 ) “trim right of string”

TRIM$ ( -- ) ( ss: str1 – str2 ) “trim string”

The word $trim removes both leading and trailing spaces from the topmost string.

UCASE$ ( -- ) ( ss: str1 – str2 ) “convert to upper case”

The word $ucase converts all lower case characters in the topmost string to upper case.

LCASE$ ( -- ) ( ss: str1 – str2 ) “convert to lower case”

The word lcase$ converts all upper case characters in the topmost string to lower case.

==$? ( -- flag ) ( ss: -- ) “is equal to string?”

The word ==$ performs a case-sensitive comparison of the topmost two strings on the string stack and returns true if both their length and content is identical. If the lengths or the contents differ, false is returned. The strings are retained.

Displays 0 (false) since the strings are different (the comparison is case sensitive).

The above code creates copies of s1 and s2 (using over$) then converts them both to lower case. ==$ then compares the strings placing the appropriate flag on the data stack. Finally, the lower-case versions of s1 and s2 are removed from the string stack, thus s1 and s2 are retained, un-changed.

PICK$ ( index -- ) ( ss: -- str ) “pick string”

Given the index of a string on the string stack, copy the indexed string to the top of the string stack. 0 $pick is equivalent to DUP$, 1 $pick is equivalent to OVER$ etc.

The above causes the string “blue” to be copied to the top of the string stack.

VAL$ ( -- n ) ( ss: str -- )

The word VAL$ interprets the topmost string on the string stack as a number, and returns it on the data stack as an integer. An error occurs if the string cannot be represented as a number.

Note that a double value can be returned by pre-pending the number within the string with a period.

$.S ( -- ) ( ss: -- )

The word $.s displays a non-destructive string stack dump to the output device. The length of each string is given, along with the total number of strings on the string stack. The amount of space allocated to the string stack, the amount of space in use, and the amount of free space is also reported.

DEPTH$ ( -- n ) ( ss: -- )

Returns the current depth of the string stack, with 0 meaning the string stack is empty.

RESET$ ( -- ) ( ss: -- )

The String Stack

The string stack is ALLOTED from dictionary space. The constant ($sSize) determines the amount of space reserved.

Error Checking

Error checking is included in all words that could cause a string stack under or overflow condition. In the event that an under or overflow is detected, the code aborts with an error message.

Other words such as DUP$ also perform checks. For example, DUP$check that there is at least one item on the string stack. SWAP$ checks that there are at least two items on the string stack, etc.

String Stack Format

String Constant Format

String Constants have the same format, but are preceded by a maximum length cell in order to check that a requested string can be accommodated within the string constant:

Throw Codes

The words in the library perform sanity checks on input parameters where necessary. In particular, the string stack, being statically ALLOTed from dictionary space, is carefully guarded, since the string stack is very likely to have code and/or data on either side of it, resulting in catastrophic software failure in the event of a string stack under or over flow. Where errors are detected, the library throws the following THROW codes:

It should be noted that this author has not checked that the THROW codes listed here are used in other systems or libraries elsewhere.

Throw Code	Nature of Error	Thrown By
9900	String stack underflow	(SETS$P)
9901	String too large to assign	:="
9902	String stack is empty	PICK$ DUP$ LEN$ >$CONST MID$ LEFT$ RIGHT$ FINDC$ .$ REV$ LTRIM$ RTRIM$ UCASE$ LCASE$ DROP$
9903	Need at least 2 strings on string stack	SWAP$ NIP$ OVER$ +$ FIND$ ==$?
9904	String too large for string constant	>$CONST
9905	Illegal LEN value	MID$ LEFT$ RIGHT$
9906	Need at least 3 strings on string stack	ROT$ -ROT$ REPLACE$
9907	String is not a legal number	VAL$

Dependencies

Word	ANS Library	ANS Reference	Referenced In
-ROT	None ANS. Defined as follows: : -ROT ( a b c – c b a ) ROT ROT ;		:="
.R	Core Ext	6.2.0210	$.S
HERE	Core	6.1.1650	SWAP$ +$ REV$ LTRIM$ REPLACE$
PARSE	Core Ext	6.2.2008	:=" $"
PICK	Core Ext	6.2.2030	FINDC$
WITHIN	Core Ext	6.2.2440	UCASE$ LCASE$

Author Information

The library was developed by Mark Wills in February 2014. The code is hereby released to the public domain. The author can be contacted by email via: markwills1970@gmail.com. Please also see the aknowledgements section for further attribution information.

Portable String Library Source Code

\ Portable, Stack Based String Library for TurboForth V1.2
\ Version 1.0 - Mark Wills February 2014.
\ Based on a string stack concept developed by Brian Fox circa 1988.

base @ \ save systems' current number base
decimal

256 \ maximum string stack size in bytes.
\ Adjust to your own needs. Choose a value that is a multiple 
\ of your systems' cell size.
constant ($sSize)          \ store stack size
here ($sSize) allot        \ reserve space for string stack 
constant ($sEnd)           \ bottom of string stack
variable ($sp)             \ pointer to top of string stack
($sEnd) ($sSize) + ($sp) ! \ initialise it
variable ($depth)          \ count of items on the string stack
variable ($temp0)          \ reserved for internal use
variable ($temp1)          \ reserved for internal use
variable ($temp2)          \ reserved for internal use
variable ($temp3)          \ reserved for internal use

\ Throw codes used by this library:
: (throw) ( code -- )
    case 
        9900 of ." String stack underflow" endof
        9901 of ." String too large to assign" endof 
        9902 of ." String stack is empty" endof 
        9903 of ." Need at least 2 strings on string stack" endof
        9904 of ." String too large for string constant" endof 
        9905 of ." Illegal LEN value" endof 
        9906 of ." Need at least 3 strings on string stack" endof 
        9907 of ." String is not a legal number" endof
        9908 of ." Illegal start value" endof
    endcase
    cr abort ;

: ($depth+) ( -- )
    \ increments the string stack item count
    1 ($depth) +! ;

: ($sp@) ( -- addr ) ($sp) @ ;

: ($rUp) ( n -- n|n+1)
    \ rounds n up to the next even value
    1+ -2 and ;

: cell+ ( n -- n+2) compile 2+ ; immediate

: (sizeOf$) ( $addr - $size)
    \ given an address of a transient string, compute the stack
    \ size in bytes required to hold it, rounded up to the
    \ nearest even cell size, and including the length cell.
    @ ($rUp) cell+ ;

: (set$SP) ( $size -- ) 
    \ given the stack size of a transient string set the string
    \ stack pointer to the new address required to accomodate it.
    negate dup ($sp@) + ($sEnd) < if 9900 (throw) then 
    ($sp) +! ;
    
: (addrOf$) ( index -- addr )
    \ given an index into the string stack, return the start
    \ address of the string. addr points to the length cell.
    \ topmost string is index 0
    \ next string is index 1 and so on
    ($sp@) swap dup if 0 do 
        dup (sizeOf$) + loop else drop then ;
    
: (lenOf$) ( $addr -- len )
    \ given the address of a transient string on the string
    \ stack (the address of the length cell), return the length
    \ of the string.
    state @ if compile @ else @ then ; immediate
    
: reset$ ( -- ) ( "reset i.e. empty the string stack")
    0 ($depth) !  ($sEnd) ($sSize) + ($sp) ! ;

: depth$ ( -- $sDepth) \ "depth of string stack"
    \ returns the current depth of the string stack.
    ($depth) @ ;

: $const ( max_len tib:"name" -- ) ( runtime: -- $Caddr) \ "string constant"
    \ creates a string constant
    \ when name is referenced the address of the max_len field 
    \ is pushed to the stack.
    \ e.g. 100 string fred \ create a string called fred 
    create  dup ( max_len) , ( actual_len) 0 ,  allot align ;
    
: clen$ ( $Caddr -- len ) \ "string constant length"
    \ given the address of a string constant, returns its 
    \ length.
    cell+ @ ;
    
: maxLen$ ( $Caddr -- max_len ) \ "string constant maximum length"
    \ given the address of a string constant, returns its 
    \ maximum length
    (lenOf$) ;

: .$const ( $Caddr -- ) \ "display string constant"
    \ displays the string constant. e.g. fred .$const
    cell+ dup (lenOf$) swap cell+ swap type ;
    
: :=" ( $Caddr tib:"string" -- ) \ "assign string constant"
    \ assigns the string "string" to the string constant
    \ e.g. fred :=" hello mother!"
    dup @ ascii " word swap >r  2dup < if 9901 (throw) then
    nip 2dup swap cell+ !
    >r [ 2 cells ] literal + r> r> -rot cmove ;

: ($") ( addr len -- ) ( ss: -- str )
    \ run-time action for $" (see below)
    dup ($rUp) cell+ (set$SP)
    dup ($sp@) !  ($sp@) cell+ swap cmove  ($depth+) ;

: $" ( tib:"string" -- ) ( ss: -- str) \ "string to string stack"
    \ pushes a string directly to the string stack
    \ e.g. $" hello world" .$
    [compile] s"  state @ if compile ($") else ($") then ; immediate

: >$ ( $Caddr -- ) ( ss: -- str) \ "string constant to string stack"
    \ moves a string constant to the string stack
    \ e.g. fred >$
    cell+ dup (lenOf$) swap cell+ swap ($") ;

: pick$ ( n -- ) ( ss: -- strN) \ "pick string"
    \ given an index into the string stack, copy the indexed
    \ string to the top of the string stack.
    \ 0 $pick is equivalent to $DUP
    \ 1 $pick is equivalent to $OVER etc.
    depth$ 0= if 9902 (throw) then 
    (addrOf$) dup (lenOf$) swap cell+ swap ($") ;

: dup$ ( -- ) ( ss: s1 -- s1 s1) \ "duplicate string"
    \ duplicates a string on the string stack
    depth$ 0= if 9902 (throw) then  0 pick$ ;

: drop$ ( -- ) ( ss: str -- ) \ "drop string"
    \ drops the top string from the string stack
    depth$ 0= if 9902 (throw) then
    ($sp@) (sizeOf$) negate (set$SP)   -1 ($depth) +! ;
    
: swap$ ( -- ) ( ss: s1 s2 -- s2 s1) \ "swap strings"
    \ swaps the top two string items on the string stack
    depth$ 2 < if 9903 (throw) then 
    ($sp@) dup (sizeOf$) here swap cmove
    1 (addrOf$) dup (sizeOf$) ($sp@) swap cmove
    here dup (sizeOf$)  ($sp@) dup (sizeOf$) + swap cmove ;

: nip$ ( -- ) ( ss: s1 s2 -- s2) \ "nip strings"
    \ remove the string under the top string
    depth$ 2 < if 9903 (throw) then  swap$ drop$ ;
    
: over$ ( -- ) ( ss: s1 s2 -- s1 s2 s1) \ "over string"
    \ move a copy of s1 to top of string stack
    depth$ 2 < if 9903 (throw) then  1 pick$ ;
    
: (rot$) ( -- ) ( ss: s6 s5 s4 s3 s2 s1 -- s3 s2 s1)
    ( internal factor of rot$ and -rot$. See below. )
    ( source:) ($sp@)  ( destination:) 3 (addrOf$)
    ( #bytes to move: ) 
    ($sp@) (sizeOf$)   1 (addrOf$) (sizeOf$)   2 (addrOf$) (sizeOf$) + + 
    ( move s1 to s3 into the space occupied by s4 to s6:) CMOVE
    ( adjust string stack pointer:) 3 (addrOf$) ($sp) !  -3 ($depth) +! ;

: rot$ ( -- ) ( ss: s3 s2 s1 -- s2 s1 s3) \ "string rotate left"
    \ rotates the top three strings to the left.
    depth$ 3 < if 9906 (throw) then 
    1 pick$  1 pick$  4 pick$ (rot$) ;

: -rot$ ( -- ) ( ss: s3 s2 s1 -- s1 s3 s2) \ "string rotate right"
    \ rotates the top three strings to the right.
    depth$ 3 < if 9906 (throw) then
    0 pick$  3 pick$  3 pick$ (rot$) ;
    
: len$ ( -- len ) ( ss: -- ) \ "length of string"
    \ returns the length of the topmost string.
    depth$ 1 < if 9902 (throw) then  ($sp@) @ ;

: >$const ( $Caddr -- ) ( ss: str -- ) \ "to string constant"
    \ move top of string stack to the string constant
    \ e.g. $" blue" fred >$const  fred .$const 
    \ displays "blue"    
    >r  depth$ 1 < if 9902 (throw) then
    len$ r@ @ > if 9904 (throw) then
    ($sp@) dup (sizeOf$) r> cell+ swap cmove drop$ ;

: +$ ( -- ) ( ss: s1 s2 -- s2+s1) \ "concatenate strings"
    \ replaces the top most two strings on the string stack
    \ with their concatenated equivalent.
    \ eg: $" red" $" blue" $& .$
    \ displays "redblue"
    depth$ 2 < if 9903 (throw) then 
    1 (addrof$) cell+  here   1 (addrof$) (lenof$)  cmove
    ($sp@) cell+   1 (addrof$) (lenof$) here +  len$ cmove
    here len$ 1 (addrof$) (lenof$) +  drop$ drop$  ($") ;

: mid$ ( start len -- ) ( ss: str1 -- str1 str2) \ "mid string"
    \ the characters from start to start+len are pushed to the string stack 
    \ as a new string. the original string is retained.
    depth$ 1 < if 9902 (throw) then 
    dup len$ >  over 1 < or  if 9905 (throw) then
    over dup len$ >  swap 0< or if 9908 (throw) then 
    swap ($sp@) cell+ +  swap  ($") ;

: left$ ( len -- ) ( ss: str1 -- str1 str2) \ "left string"
    \ the leftmost len characters are pushed to  the string 
    \ stack as a new string. The original string is retained.
    depth$ 1 < if 9902 (throw) then 
    dup len$ > over 1 < or if 9905 (throw) then 
    0 ($sp@) cell+ +  swap  ($") ;
   
: right$ ( len -- ) ( ss: str1 -- str1 str2) \ "right string"
    \ the rightmost len characters, pushed to the string stack
    \ as a new string. the original string is retained.
    depth$ 1 < if 9902 (throw) then 
    dup len$ > over 1 < or if 9905 (throw) then 
    ($sp@) (lenOf$) over - ($sp@) cell+ +  swap  ($") ;

: findc$ ( char -- pos|-1 ) ( ss: -- ) ( "find char")
    ( returns the first occurance of the character char in )
    ( the top string. The string is retained. )
    ( returns -1 if the char is not found )
    depth$ 1 < if 9902 (throw) then
    -1 ($temp0) ! ( assume not found )
    ($sp@) cell+  ($sp@) (lenOf$) 0 do
        dup c@ 2 pick = if i ($temp0) ! leave then 1+ loop
    drop drop ($temp0) @ ;

: find$ ( offset -- pos|-1 ) ( ss: s1 s2 -- s1) \ "find string"
    \ searches string str1, beginning at offset, for the substring str2.
    \ if the string is found, returns the position of the string relative
    \ to the offset, otherwise returns -1.
    depth$ 2 < if 9903 (throw) then 
    len$ ($temp1) !    1 (addrOf$) (lenOf$) ($temp0) !
    dup ($temp0) @ > if drop -1 exit then 
    1 (addrOf$) cell+ + ($temp2) !    ($sp@) cell+ ($temp3) !
    ($temp1) @ ($temp0) @ > if drop -1 exit then 
    0  ($temp0) @ 0 do
        ($temp3) @ over + c@ 
        ($temp2) @ i + c@ = if
            1+ dup ($temp1) @ = if 
                drop i ($temp1) @ - 1+   -2 leave then 
        else drop 0 then
    loop 
    dup -2 = if drop else drop -1 then drop$ ;

: .$ ( -- ) ( ss: str -- ) \ "display string"
    \ pop and display string from string stack
    depth$ 0= if 9902 (throw) then 
    ($sp@) cell+ ($sp@) (lenOf$) type  drop$ ;
    
: rev$ ( -- ) ( ss: s1 -- s2 ) \ "reverse string"
    \ reverse top string on string stack.
    depth$ 0= if 9902 (throw) then 
    ($sp@) dup cell+ >r  (lenOf$)  r> swap here swap cmove 
    ($sp@) (lenOf$) here 1- +
    ($sp@) cell+  dup ($sp@) (lenOf$) +   swap do
        dup c@ i c!  1- loop  drop ;

: ltrim$ ( -- ) ( ss: s1 -- s2 ) \ "left trim string"
    \ removes leading spaces from s1, resulting in s2.
    depth$ 0= if 9902 (throw) then  
    ($sp@) dup (lenOf$) >r  here over (sizeOf$)  cmove
    0  r> here cell+ dup >r +  r> do
        i c@ bl = if 1+ else leave then loop 
    dup 0> if 
        >r  ($sp@) (lenOf$)  drop$
        here cell+ r@ +  swap r> -  ($")
    else drop then ;

: rtrim$ ( -- ) ( ss: s1 -- s2 ) \ "right trim string"
    \ removes trailing spaces from s1, resulting in s2.
    depth$ 0= if 9902 (throw) then  rev$ ltrim$ rev$ ;

: trim$ ( -- ) ( ss: s1 -- s2 ) \ "trim string"
    \ remove both leading and trailing spaces from s1, 
    \ resulting in s2.
    rtrim$ ltrim$ ;

: replace$ ( -- pos ) \ "replace string"
    \ ( found: ss: s1 s2 s3 -- s4  not found: s1 s2 -- s1 s2)
    depth$ 3 < if 9906 (throw) then
    len$ >r
    0 find$ dup ($temp0) ! -1 > if
        ($sp@) cell+  here  ($temp0) @ cmove  
        1 (addrOf$) cell+   here ($temp0) @ +  
        1 (addrOf$) (lenof$) cmove
        ($sp@) cell+ ($temp0) @ + r@ +    
        here ($temp0) @ + 1 (addrOf$) (lenof$) +
        len$ r> - ($temp0) @ -  dup >r  cmove
        r> ($temp0) @ + 1 (addrOf$) (lenof$) +
        drop$ drop$ here swap ($")
    else r> drop ($temp0) @ then ;

: ucase$ ( -- ) ( ss: str -- STR) \ "to upper case"
    \ on the topmost string, converts all lower case characters
    \ to upper case.
    depth$ 1 < if 9902 (throw) then
    ($sp@) dup (lenOf$) + cell+  ($sp@) cell+  do
       i c@ dup [ char a ] literal  [ char { ] literal within if 
            32 -  i c! else drop then loop ;

: lcase$ ( -- ) ( ss: STR -- str) \ "to lower case"
    \ on the topmost string, converts all upper case characters
    \ to lower case.
    depth$ 1 < if 9902 (throw) then 
    ($sp@) dup (lenOf$) + cell+  ($sp@) cell+  do
       i c@ dup [ char A ] literal  [ char [ ] literal within if 
            32 +  i c! else drop then loop ;

: ==$? ( -- flag ) ( ss: -- ) \ "are strings equal?"
    \ performs a case-sensitive comparison of the topmost 
    \ two strings on the string stack, returning true if their 
    \ length and contents are identical, otherwise returning 
    \ false.
    depth$ 2 < if 9903 (throw) then 
    len$  1 (addrOf$) (lenOf$) = if
        1 (addrOf$) cell+ \ point to first char of string 1
        ($sp@) cell+  dup len$ + swap  do
            dup c@  i c@  <> if drop false leave then 1+ loop
        dup if drop true then 
    else false then ;
   
: val$ ( -- ud ) ( ss: str -- ) \ "value of string"
    \ interprets the topmost string as an integer number, returning its
    \ value on the data stack as an integer.
    \ Note that a string value can be converted to a double by pre-pending
    \ the number with a period. E.g. $" .9900" VAL$ 
    ($sp@) dup (lenOf$) swap cell+ swap 
    number if 9907 (throw) then drop$ ;

: $.s ( -- ) ( ss: -- ) \ "display string stack"
    cr  depth$ 0> if
        ($sp@)  depth$
        ."  Index|Length|String" cr
        ." ------+------+------" cr 
        0 begin
            depth$ 0> while
                dup 5 .r ." |" len$ 5 .r  ." |" .$  1+ cr
        repeat  drop
        ($depth) !  ($sp) !  cr
    else
        ." String stack is empty." cr
    then
    ." Allocated stack space:"
    ($sEnd) ($sSize) + ($sp@) - 4 .r ."  bytes" cr
    ."     Total stack space:"
    ($sSize) 4 .r ."  bytes" cr
    ." Stack space remaining:" 
    ($sp@) ($sEnd) - 4 .r ."  bytes" cr ;

base ! \ restore systems' current number base

$" RED" $" GREEN" $" BLUE" $.S

27th of February 2014
Updated 20th April 2021 - Corrected stack comments in FINDC$ and FIND$.

A Stack-based String Library for TurboForth

Simplifying string handling through the use of a String Stack

Mark Wills

2/27/2014

Abstract

Table of Contents

String Stack Library Download

Aknowledgements

. The stack contents are retained for convenience to allow further searches to be performed if desired.

$.S ( -- ) ( ss: -- )