WorkPlace Shell Programming In Assembler/2: Difference between revisions
mNo edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
Original Work by [[Micho Durdevich]] | Original Work by [[Micho Durdevich]] | ||
==Introduction=== | |||
In this article we are going to highlight the most important steps in creating WorkPlace | |||
Shell objects using the machine language paradigm. The only tools that are necessary to | Shell objects using the machine language paradigm. The only tools that are necessary to | ||
build the corresponding | build the corresponding dynamic link libraries are: the Watcom assembler (WASM), the linker program (WLINK) and the resource compiler (RC). The IBM Toolkit/2 should be installed, too. | ||
dynamic link libraries are: the Watcom assembler (WASM), the linker program (WLINK) and the resource | |||
compiler (RC). The IBM Toolkit/2 should be installed, too. | |||
At a first sight it might appear silly to try WorkPlace Shell in assembler: As is well known, | |||
of | the OS/2 graphical user interface is built on completely object-oriented grounds of SOM by IBM (System Object Model). The WPS design is truly unique, and establishes a "universe of objects" on its own. Traditionally, it is thought that such object-oriented programming can be formulated only within appropriate so-called high-level languages (like C or C++), which | ||
somehow contain the philosophy of objects implicitly or explicitly built in their syntax and semantic rules. In our opinion, this view is entirely wrong: The object orientation has not much to do with the programming language, but with the general vision of the programming model. | |||
As we already mentioned in the introduction to this series, assembly language is really interpretable as the highest level programming language, if we adopt the viewpoint of the language expressive power. Therefore, in principle it should be possible to express any idea written in a "high-level" language (like C or C++) directly in terms of assembler. Of course, such a conversion might not be nice or easy at all, and in order to get a meaningful assembly-level result it might be necessary to rewrite the entire execution environment ensuring the existence of the objects in question. As an example of such a poorly designed system from the assembly viewpoint, we can mention a charming Qt toolkit by Trolltech. | |||
Fortunately, in the case of WPS and SOM for OS/2, thanks to a high internal simplicity and elegance of the API set, we are able to proceed directly with assembly language programming without having to change the internals of the universe of objects. | |||
==The structure of a typical WPS class library== | |||
entry procedure | ===Initialization Structures=== | ||
At first, let us mention that every WPS class DLL should possess the following ''entry procedure'' declared as public and starting point: | |||
<code> | |||
public _dllentry | |||
_dllentry proc | _dllentry proc | ||
mov eax, 1 | mov eax, 1 | ||
ret | ret | ||
_dllentry endp | _dllentry endp | ||
</ | </code> | ||
Next, there are initialization routines for global (class-level) and local (instance) attributes. Here is the corresponding code, concretized to the case the object in question is derived from WPDataFile class (taken from our first sample, see below). During its startup, WorkPlace shell will execute the {xxx}NewClass (local attributes) routine for every registered class. | |||
(instance) attributes. Here is the corresponding code, concretized to the case the object | This routine, in its turn, calls the global "sister" routine. Global symbols are almost always prefixed with "M_". | ||
in question is derived from WPDataFile class (taken from our first sample, see below). During its | <code> | ||
startup, WorkPlace shell will execute the {xxx}NewClass (local attributes) routine for every registered class. | public myObjectNewClass | ||
This routine, in its turn, calls the global "sister" routine. Global symbols are almost always prefixed | myObjectNewClass proc near | ||
with "M_". < | |||
myObjectNewClass proc near | |||
push ebp | push ebp | ||
mov ebp,esp | mov ebp,esp | ||
push ecx | push ecx | ||
push 0x00000002 | push 0x00000002 | ||
push 0x00000001 | push 0x00000001 | ||
Line 79: | Line 52: | ||
mov ecx, num_parentmethods_obj | mov ecx, num_parentmethods_obj | ||
@obj_numresolve_loop: | @obj_numresolve_loop: | ||
mov eax, obj_parentmethods_table[ecx*4] | mov eax, obj_parentmethods_table[ecx*4] | ||
push ecx | push ecx | ||
Line 89: | Line 62: | ||
call somParentNumResolve | call somParentNumResolve | ||
add esp, 0x0000000C | add esp, 0x0000000C | ||
pop ecx | pop ecx | ||
mov obj_parentmethods_table[ecx*4], eax | mov obj_parentmethods_table[ecx*4], eax | ||
loop @obj_numresolve_loop | loop @obj_numresolve_loop | ||
Line 100: | Line 73: | ||
pop ebp | pop ebp | ||
ret | ret | ||
myObjectNewClass endp | myObjectNewClass endp | ||
</ | </code> | ||
The global sister routine is a little simpler... Let us also observe that in the loops involving somParentNumResolve, we had to push/pop the loop counting ecx register. This was necessary because the register is not preserved across the call to this particular API. In general, return codes of OS/2 APIs are stored in the eax register, but many of the OS/2 APIs do not care much about other registers like ebx, ecx and edx. This property should be carefully | |||
we had to push/pop the loop counting ecx register. This was necessary because the register is not preserved across | taken into account. In future versions of eComStation, we will be systematically replacing the APIs so that all non-return-type registers are preserved across the OS calls (it is worth mentioning here that FreeBSD kernel fully complies with this important property). | ||
the call to this particular API. In general, return codes of OS/2 APIs are stored in the eax register, but many | <code> | ||
of the OS/2 APIs do not care much about other registers like ebx, ecx and edx. This property should be carefully | public M_myObjectNewClass | ||
taken into account. In future versions of eComStation, we will be systematically | M_myObjectNewClass proc | ||
replacing the APIs so that | |||
all non-return-type registers are preserved across the OS calls (it is worth mentioning here that FreeBSD kernel fully | |||
complies with this important property).< | |||
M_myObjectNewClass proc | |||
push ebp | push ebp | ||
mov ebp, esp | mov ebp, esp | ||
push ecx | push ecx | ||
push 0x00000002 | push 0x00000002 | ||
push 0x00000001 | push 0x00000001 | ||
call M_WPDataFileNewClass | call M_WPDataFileNewClass | ||
add esp, 0x8 | add esp, 0x8 | ||
push dword [ebp+0xC] | push dword [ebp+0xC] | ||
push dword [ebp+8] | push dword [ebp+8] | ||
Line 134: | Line 101: | ||
mov ecx, num_parentmethods_cls | mov ecx, num_parentmethods_cls | ||
@cls_numresolve_loop: | @cls_numresolve_loop: | ||
mov eax, cls_parentmethods_table[ecx*4] | mov eax, cls_parentmethods_table[ecx*4] | ||
push ecx | push ecx | ||
push dword M_WPObjectClassData[eax] | push dword M_WPObjectClassData[eax] | ||
push 0x00000001 | push 0x00000001 | ||
Line 147: | Line 114: | ||
pop ecx | pop ecx | ||
mov cls_parentmethods_table[ecx*4], eax | mov cls_parentmethods_table[ecx*4], eax | ||
loop @cls_numresolve_loop | loop @cls_numresolve_loop | ||
mov eax, somclass_id_cls | mov eax, somclass_id_cls | ||
Line 155: | Line 122: | ||
pop ebp | pop ebp | ||
ret | ret | ||
M_myObjectNewClass endp | M_myObjectNewClass endp | ||
</ | </code> | ||
===Object & Class Data=== | |||
Let us have a look at important data structures figuring in the initialization calls of the previous section. At first, we have cls_parentmethods_table and obj_parentmethods_table. | |||
These data structures are linear lists of dwords of the form | |||
<tt>parent_wpsmethod dd token_wpsmethod</tt> | |||
At the beginning of each list, we have a static variable num_parentmethods_cls and | |||
num_parentmethods_obj respectively. During the class initialization, original token_wpsmethod | num_parentmethods_obj respectively. During the class initialization, original token_wpsmethod | ||
values (from the class in which these methods are originally introduced) are replaced by the | values (from the class in which these methods are originally introduced) are replaced by the | ||
flat addresses of the corresponding parent methods. In this way, the selected parent methods | flat addresses of the corresponding parent methods. In this way, the selected parent methods | ||
become available to our objects. | become available to our objects. | ||
Next interesting data structure is 8 bytes long and suffixed by ExtraData string. In the | |||
listings above, we have 2 of them: myObjectExtraData and M_myObjectExtraData. The first dword is reserved | listings above, we have 2 of them: myObjectExtraData and M_myObjectExtraData. The first dword is reserved for the corresponding ''parent method table list'' address. It is filled during the processing of somBuildClass. The second dword is filled out with the address of the | ||
for the corresponding | ''object data retriever'' routine with the help of which we can access global and local | ||
somBuildClass. The second dword is filled out with the address of the | |||
variables the object is using. A typical call to this routine would be: | variables the object is using. A typical call to this routine would be: | ||
<code> | |||
< | |||
push somSelf | push somSelf | ||
call WPSObjExtraData[4] | call WPSObjExtraData[4] | ||
Line 187: | Line 146: | ||
mov somThis, eax | mov somThis, eax | ||
</ | </code> | ||
After this, variable somThis contains the flat offset of the allocated global or local data. Almost always, the somSelf pointer is simply given by | |||
<tt>somSelf = dword [ebp+0x00000008]</tt> | |||
Finally, the only exported data structures are global and local ClassData-dwords. They are filled with the addresses of the associated ''token method tables'' (during somBuildClass | |||
processing). This allows other objects to gain access to the specific methods of our object. | |||
===Object & Class Tables=== | |||
(corresponding to global and local aspects, as always). We shall call them class_information_object and | There are 2 critical complex data structures that "coordinate" all aspects of a given WPS class (corresponding to global and local aspects, as always). We shall call them class_information_object and class_information_global. They are passed as parameters to somBuildClass calls. | ||
class_information_global. They are passed as parameters to somBuildClass calls. | Both are incarnations of a fundamental SOMClassInformation structure, which is defined below. Non-applicable parameters are usually left zero, when instantiating the structure. | ||
Both are incarnations of a fundamental SOMClassInformation structure, which is | <code> | ||
defined below. Non-applicable parameters are usually left zero, when | SOMClassInformation struct | ||
instantiating the structure. | |||
< | |||
somVersion dd 4 | somVersion dd 4 | ||
numStaticMethods dd 0 ; Number of fixed internal methods | numStaticMethods dd 0 ; Number of fixed internal methods | ||
Line 251: | Line 201: | ||
directToSOMClass dd 0 | directToSOMClass dd 0 | ||
dynamicallyComputed dd 0 | dynamicallyComputed dd 0 | ||
SOMClassInformation ends | SOMClassInformation ends | ||
</ | </code> | ||
In the above structure, we can see pointers to several other important data objects. At first, we see of2ClassName and of2ClassMeta, they have the 2-fold pointer form | |||
<code> | |||
and of2ClassMeta, they have the 2-fold pointer form< | class_name_off dd flat:class_name class_meta_off dd flat:class_meta | ||
class_name db "the-name-of-the-class", 0 class_meta db "meta-class-name", 0 | |||
class_name db "the-name-of-the-class", 0 class_meta db "meta-class-name", 0 | </code> | ||
</ | In the case of class_information_global, the field for of2ClassMeta is always zero (no meta^2)! Another interesting entry corresponds to of3ParentName. | ||
It is a 3-fold pointer, realized as follows (global/local): | |||
<code> | |||
zero (no meta^2)! Another interesting entry corresponds to of3ParentName. | parent_name_@ff dd flat:parent_name_off parent_meta_@ff dd flat:parent_meta_off | ||
It is a 3-fold pointer, realized as follows (global/local):< | parent_name_off dd flat:parent_name parent_meta_off dd flat:parent_meta | ||
parent_name db "the-name-of-the-parent", 0 parent_meta db "the-meta-parent_name", 0 | |||
parent_name_off dd flat:parent_name parent_meta_off dd flat:parent_meta | </code> | ||
parent_name db "the-name-of-the-parent", 0 parent_meta db "the-meta-parent_name", 0 | Perhaps the most important SOMClassInformation entry is the pointer to method overrides table. This table has the form of the linear list of the pairs | ||
</ | <code> | ||
dd flat:@ff_parent_method_name | |||
dd flat:new_implementation_proc | |||
table has the form of the linear list of the pairs</ | </code> | ||
< | where we have again nice 3-fold pointers | ||
<code> | |||
@ff_parent_method_name dd flat:off_parent_method_name | |||
off_parent_method_name dd flat:str_parent_method_name | |||
str_parent_method_name db "classname::standard-wps-name", 0 | |||
</code> | |||
and new_implementation_proc is the procedure that overrides the method parent_method_name. | |||
Furthermore, let us examine another very important entry: tblStatic. It is a pointer to the table of static methods introduced by the given class. The static method table is a linear | |||
to the table of static methods introduced by the given class. The static method table is a linear | |||
listing of the following 6-fold entries, one for each method: | listing of the following 6-fold entries, one for each method: | ||
< | <code> | ||
dd flat:myobjectClassData[method_index] | |||
dd flat:@ff_methodNameBase | dd flat:@ff_methodNameBase | ||
dd flat:@ff_methodNameFull | dd flat:@ff_methodNameFull | ||
Line 291: | Line 237: | ||
dd flat:method_redispatch | dd flat:method_redispatch | ||
dd flat:method_applystub | dd flat:method_applystub | ||
</ | </code> | ||
where method_procedure is the procedure that implements the method, method_index is offset to the method entry in the ClassData structure, while @ff_methodNameBase and @ff_methodNameFull are nice 3-fold pointers entangled in the following structure (recommended, to avoid duplications): | |||
entry in the ClassData structure, while @ff_methodNameBase and @ff_methodNameFull are nice 3-fold pointers | <code> | ||
entangled in the following structure (recommended, to avoid duplications):< | dd str_methodNameFull: db "::myObject::" | ||
dd str_methodNameBase: db "method_name", 0 | dd str_methodNameBase: db "method_name", 0 | ||
dd off_methodNameBase dd flat:str_methodNameBase | dd off_methodNameBase dd flat:str_methodNameBase | ||
Line 301: | Line 246: | ||
dd off_methodNameFull dd flat:str_methodNameFull | dd off_methodNameFull dd flat:str_methodNameFull | ||
dd @ff_methodNameFull dd flat:off_methodNameFull | dd @ff_methodNameFull dd flat:off_methodNameFull | ||
</code> | |||
The last 2 dwords in the above 6-fold method table entry point to associated ''redispatch and apply'' stubs procedures. In the simplest scenario, these fields should be 0xFFFFFFF and 0x0000000 respectively (no redispatches/apply stubs). | |||
Related to static methods are also entries {numStaticMethods, numClassDataEntries, numMaxMethods}. | |||
When defining ClassData structures, enough space should be left to accommodate all method tokens, and the class information (the first entry). | |||
Finally, let us observe that the field instanceDataSize determines the amount of memory reserved for object data. It is exactly this memory area which is getting mapped by calling WPSObjExtraData[4] with somSelf as the unique argument. As we already mentioned, the result is the value of the somThis pointer. | |||
==Example A: Quantum Rectangles== | |||
===Object Description=== | |||
In the accompanying sample code, we are presenting a simple yet sufficiently illustrative WPS object, based on the random-rectangles PM program (discussed in detail within the | |||
WPS object, based on the random-rectangles PM program (discussed in | |||
PM-assembling article). We are constructing a child of WPDataFile object, displaying randomly | PM-assembling article). We are constructing a child of WPDataFile object, displaying randomly | ||
fluctuating rectangles as the default view. The random number generator routine is the same | fluctuating rectangles as the default view. The random number generator routine is the same | ||
as the one used in the PM example, based on a powerful multiply-with-carry | as the one used in the PM example, based on a powerful multiply-with-carry algorithm. However here we control the rectangles via a special timer (WinCreateTimer, WinSetTimer) while in the PM example it was a simple cyclic thread created by DosCreateThread. Our object also features | ||
algorithm. However here we control the rectangles via a special | |||
timer (WinCreateTimer, WinSetTimer) while in the PM example it was a simple cyclic thread created | |||
by DosCreateThread. Our object also features | |||
* A special settings page, controlling the state of the rectangles system: | |||
speed, stop/go. The mentioned settings page is introduced by overriding | speed, stop/go. The mentioned settings page is introduced by overriding | ||
wpAddBecomePage method; | wpAddBecomePage method; | ||
* A possibility to save the state via wpSaveDeferred method; | |||
* Modification of the object pop-up menu, so that the quantum rectangles view properly appears. | |||
All the samples are available at our download section. | |||
===How To Compile=== | |||
The creation of the class DLL is very simple, in 3 steps: assembling, linking | |||
and resource-compiling. Explicitly, | and resource-compiling. Explicitly, | ||
<code> | |||
< | wasm qr.asm | ||
wlink @qr | wlink @qr | ||
rc qr qr.dll | rc qr qr.dll | ||
</code> | |||
Here, the linking info is stored in the file qr.lnk and the resource info is within qr.rc | |||
file. Don't forget to check the size of the DLL :) | |||
In order to use the newly created class, it is necessary to register it: | |||
<code> | |||
/* Registering qRectangle class */ | |||
< | |||
call RxFuncAdd 'SysLoadFuncs', 'RexxUtil', SysLoadFuncs' | call RxFuncAdd 'SysLoadFuncs', 'RexxUtil', SysLoadFuncs' | ||
call SysLoadFuncs | call SysLoadFuncs | ||
if SysRegisterObjectClass(qRectangle, "QR") then | if SysRegisterObjectClass(qRectangle, "QR") then | ||
say "Okidoki!" | say "Okidoki!" | ||
else say "Oops, a problem :(" | else say "Oops, a problem :(" | ||
</ | </code> | ||
Object instances are created in the standard way. For example, | |||
<code> | |||
/* Creating an instance of qRectangle class */ | |||
call RxFuncAdd 'SysLoadFuncs', 'RexxUtil', SysLoadFuncs' | call RxFuncAdd 'SysLoadFuncs', 'RexxUtil', SysLoadFuncs' | ||
call SysLoadFuncs | call SysLoadFuncs | ||
Line 368: | Line 301: | ||
say "Object created succesfully!" | say "Object created succesfully!" | ||
else say "Oops, a problem :(" | else say "Oops, a problem :(" | ||
</ | </code> | ||
===Connecting PM and WPS stuff=== | |||
An interesting problem appears when interconnecting PM and WPS code. Since PM is not | |||
inherently aware of WPS object-related entities like somSelf and somThis, how to keep this information available | inherently aware of WPS object-related entities like somSelf and somThis, how to keep this information available | ||
to PM windows? The solution is to | to PM windows? The solution is to ''reserve the memory'' during the window creation, and | ||
then save the pointer to this reserved memory during the window initialization procedure. | then save the pointer to this reserved memory during the window initialization procedure. | ||
More precisely: | |||
* When calling WinRegisterClass, specify the amount of reserved memory for the WPS-related | |||
stuff. | stuff. | ||
* Create frame and client windows using WinCreateWindow function; | |||
* When creating the client window, pass the pointer to a reserved memory location | |||
containing the somSelf pointer, as an appropriate argument to WinCreateWindow; | containing the somSelf pointer, as an appropriate argument to WinCreateWindow; | ||
* During the processing of WM_CREATE message, setup a pointer to the reserved memory | |||
location, with the help of WinSetWindowPtr. | location, with the help of WinSetWindowPtr. | ||
< | Here are relevant pieces of code corresponding to the outlined methodology. At first, window creation. This is a fragment of the initialization procedure specified in the object-specific (overridden) version of wpOpen (the creation of the window in which a quantum-fluctuating rectangles will be displayed). | ||
<code> | |||
push HWND_DESKTOP | push HWND_DESKTOP | ||
call WinQueryAnchorBlock | call WinQueryAnchorBlock | ||
add esp, 0x00000004 | add esp, 0x00000004 | ||
mov hab, eax | mov hab, eax | ||
push 0x00000040 | push 0x00000040 | ||
push (CS_SIZEREDRAW+CS_SYNCPAINT) | push (CS_SIZEREDRAW+CS_SYNCPAINT) | ||
Line 406: | Line 332: | ||
call WinRegisterClass | call WinRegisterClass | ||
add esp, 0x00000014 | add esp, 0x00000014 | ||
push 0x00000000 ; At first, creating the frame window... | push 0x00000000 ; At first, creating the frame window... | ||
Line 424: | Line 350: | ||
add esp, 0x00000034 | add esp, 0x00000034 | ||
mov hwnd_frame, eax | mov hwnd_frame, eax | ||
push 0x00000000 ; Let us allocate some memory, for the use | push 0x00000000 ; Let us allocate some memory, for the use | ||
push 0x40 ; of various object parameters, that | push 0x40 ; of various object parameters, that | ||
Line 443: | Line 369: | ||
mov dword [eax][0x18], 0 ; ViewItem.hwndCnr | mov dword [eax][0x18], 0 ; ViewItem.hwndCnr | ||
mov dword [eax][0x1C], 0 ; ViewItem.pRecord | mov dword [eax][0x1C], 0 ; ViewItem.pRecord | ||
mov client_data, eax | mov client_data, eax | ||
; Now we are creating the client window... | ; Now we are creating the client window... | ||
push 0x00000000 | push 0x00000000 | ||
Line 463: | Line 389: | ||
add esp, 0x00000034 | add esp, 0x00000034 | ||
mov hwnd_client, eax | mov hwnd_client, eax | ||
</ | </code> | ||
Now, the fragment of the window WM_CREATE processing procedure. | |||
<code> | |||
push [ebp+0x00000010] | |||
push 0x00000000 | push 0x00000000 | ||
push hwnd | push hwnd | ||
call WinSetWindowPtr | call WinSetWindowPtr | ||
add esp, 0x0000000C | add esp, 0x0000000C | ||
</ | </code> | ||
Here the pointer to the reserved memory is passed as the third argument to the window procedure with arguments {hwnd, ulmsgid, mp1, mp2}, hence mp1=[ebp+0x00000010]. | |||
procedure with arguments {hwnd, ulmsgid, mp1, mp2}, hence mp1=[ebp+0x00000010]. | From this point on, the data will be easily accessible from any window subroutine, by calling WinQueryWindowPtr. | ||
From this point on, the data will be easily accessible from any window | |||
subroutine, by calling WinQueryWindowPtr. | |||
==Examble B: Very Simple Object== | |||
It is one of the simplest possible WPS objects. A child of WPDataFile, with only 4 simple overrides: {wpclsQueryIconData, wpclsQueryTitle, wpclsQueryInstanceType, wpclsQueryDefaultView}. The library creation involves only assembling and linking, as there are no any resources defined. | |||
==Example C: A Dangerous Folder== | |||
wpDrop method, in order to introduce a couple of new options, besides the standard drop | In this example we construct a derived class qHole from WPFolder class. We override wpDrop method, in order to introduce a couple of new options, besides the standard drop behavior: Based on the value of an instance variable, the drop operation will | ||
behavior: Based on the value of an instance variable, the drop operation will | * Call the parent method (standard folder behavior); | ||
* Erase the dropped object (and its element objects, if the dropped object is a folder type); <li> Erase only subobjects that are not of folder-type (leaving the "skeleton" of an initial folder); <li> Allow entry to objects of qHole only. </ul> | |||
the dropped object is a folder type); <li> Erase only subobjects that are not of folder-type | |||
(leaving the "skeleton" of an initial folder); <li> Allow entry to objects of | |||
qHole only. </ul> | |||
The instance variable is controlled via a special settings page, introduced on top of other | |||
settings pages by overriding wpAddSettingsPages method.</div> </p> | settings pages by overriding wpAddSettingsPages method.</div> </p> | ||
Line 683: | Line 599: | ||
OS/2 Programming Toolkit. </p> | OS/2 Programming Toolkit. </p> | ||
<b>[4]</b> Black Hole WorkPlace Shell Class. By Gregory Czaja. Available at | |||
[http://hobbes.nmsu.edu Hobbes Repository]. | [http://hobbes.nmsu.edu Hobbes Repository]. | ||
[[Category:Languages Articles]] | [[Category:Languages Articles]] | ||
[[Category:SOM Articles]] | [[Category:SOM Articles]] |
Revision as of 03:08, 25 March 2016
Original Work by Micho Durdevich
Introduction=
In this article we are going to highlight the most important steps in creating WorkPlace Shell objects using the machine language paradigm. The only tools that are necessary to build the corresponding dynamic link libraries are: the Watcom assembler (WASM), the linker program (WLINK) and the resource compiler (RC). The IBM Toolkit/2 should be installed, too.
At a first sight it might appear silly to try WorkPlace Shell in assembler: As is well known, the OS/2 graphical user interface is built on completely object-oriented grounds of SOM by IBM (System Object Model). The WPS design is truly unique, and establishes a "universe of objects" on its own. Traditionally, it is thought that such object-oriented programming can be formulated only within appropriate so-called high-level languages (like C or C++), which somehow contain the philosophy of objects implicitly or explicitly built in their syntax and semantic rules. In our opinion, this view is entirely wrong: The object orientation has not much to do with the programming language, but with the general vision of the programming model.
As we already mentioned in the introduction to this series, assembly language is really interpretable as the highest level programming language, if we adopt the viewpoint of the language expressive power. Therefore, in principle it should be possible to express any idea written in a "high-level" language (like C or C++) directly in terms of assembler. Of course, such a conversion might not be nice or easy at all, and in order to get a meaningful assembly-level result it might be necessary to rewrite the entire execution environment ensuring the existence of the objects in question. As an example of such a poorly designed system from the assembly viewpoint, we can mention a charming Qt toolkit by Trolltech.
Fortunately, in the case of WPS and SOM for OS/2, thanks to a high internal simplicity and elegance of the API set, we are able to proceed directly with assembly language programming without having to change the internals of the universe of objects.
The structure of a typical WPS class library
Initialization Structures
At first, let us mention that every WPS class DLL should possess the following entry procedure declared as public and starting point:
public _dllentry
_dllentry proc
mov eax, 1
ret
_dllentry endp
Next, there are initialization routines for global (class-level) and local (instance) attributes. Here is the corresponding code, concretized to the case the object in question is derived from WPDataFile class (taken from our first sample, see below). During its startup, WorkPlace shell will execute the {xxx}NewClass (local attributes) routine for every registered class.
This routine, in its turn, calls the global "sister" routine. Global symbols are almost always prefixed with "M_".
public myObjectNewClass
myObjectNewClass proc near
push ebp
mov ebp,esp
push ecx
push 0x00000002
push 0x00000001
call WPDataFileNewClass
push 0x00000002
push 0x00000001
call M_myObjectNewClass
push dword [ebp+0xC]
push dword [ebp+8]
push offset class_information_object
push 0x00000001
call somBuildClass
add esp, 0x00000020
mov somclass_id_obj, eax
mov ecx, num_parentmethods_obj
@obj_numresolve_loop:
mov eax, obj_parentmethods_table[ecx*4]
push ecx
push dword WPObjectClassData[eax]
push 0x00000001
push dword myObjectExtraData
call somParentNumResolve
add esp, 0x0000000C
pop ecx
mov obj_parentmethods_table[ecx*4], eax
loop @obj_numresolve_loop
mov eax, somclass_id_obj
pop ecx
pop ebp
ret
myObjectNewClass endp
The global sister routine is a little simpler... Let us also observe that in the loops involving somParentNumResolve, we had to push/pop the loop counting ecx register. This was necessary because the register is not preserved across the call to this particular API. In general, return codes of OS/2 APIs are stored in the eax register, but many of the OS/2 APIs do not care much about other registers like ebx, ecx and edx. This property should be carefully
taken into account. In future versions of eComStation, we will be systematically replacing the APIs so that all non-return-type registers are preserved across the OS calls (it is worth mentioning here that FreeBSD kernel fully complies with this important property).
public M_myObjectNewClass
M_myObjectNewClass proc
push ebp
mov ebp, esp
push ecx
push 0x00000002
push 0x00000001
call M_WPDataFileNewClass
add esp, 0x8
push dword [ebp+0xC]
push dword [ebp+8]
push offset class_information_global
push 0x00000001
call somBuildClass
add esp, 0x00000010
mov somclass_id_cls, eax
mov ecx, num_parentmethods_cls
@cls_numresolve_loop:
mov eax, cls_parentmethods_table[ecx*4]
push ecx
push dword M_WPObjectClassData[eax]
push 0x00000001
push dword M_myObjectExtraData
call somParentNumResolve
add esp, 0x0000000C
pop ecx
mov cls_parentmethods_table[ecx*4], eax
loop @cls_numresolve_loop
mov eax, somclass_id_cls
pop ecx
pop ebp
ret
M_myObjectNewClass endp
Object & Class Data
Let us have a look at important data structures figuring in the initialization calls of the previous section. At first, we have cls_parentmethods_table and obj_parentmethods_table. These data structures are linear lists of dwords of the form parent_wpsmethod dd token_wpsmethod
At the beginning of each list, we have a static variable num_parentmethods_cls and num_parentmethods_obj respectively. During the class initialization, original token_wpsmethod values (from the class in which these methods are originally introduced) are replaced by the flat addresses of the corresponding parent methods. In this way, the selected parent methods become available to our objects.
Next interesting data structure is 8 bytes long and suffixed by ExtraData string. In the
listings above, we have 2 of them: myObjectExtraData and M_myObjectExtraData. The first dword is reserved for the corresponding parent method table list address. It is filled during the processing of somBuildClass. The second dword is filled out with the address of the
object data retriever routine with the help of which we can access global and local
variables the object is using. A typical call to this routine would be:
push somSelf
call WPSObjExtraData[4]
add esp, 0x00000004
mov somThis, eax
After this, variable somThis contains the flat offset of the allocated global or local data. Almost always, the somSelf pointer is simply given by somSelf = dword [ebp+0x00000008]
Finally, the only exported data structures are global and local ClassData-dwords. They are filled with the addresses of the associated token method tables (during somBuildClass processing). This allows other objects to gain access to the specific methods of our object.
Object & Class Tables
There are 2 critical complex data structures that "coordinate" all aspects of a given WPS class (corresponding to global and local aspects, as always). We shall call them class_information_object and class_information_global. They are passed as parameters to somBuildClass calls.
Both are incarnations of a fundamental SOMClassInformation structure, which is defined below. Non-applicable parameters are usually left zero, when instantiating the structure.
SOMClassInformation struct
somVersion dd 4
numStaticMethods dd 0 ; Number of fixed internal methods
numStaticOverrides dd 0 ; Number of static overrides
numNonInternalData dd 0
numProcMethods dd 0
numVarArgsFuncs dd 0
majorVersion dd 0
minorVersion dd 0
instanceDataSize dd 0
numMaxMethods dd 0
numParents dd 0
of2ClassName dd 0 ; 2-fold pointer
of2ClassMeta dd 0 ; 2-fold pointer
implicitParentMeta dd 0
of3ParentName dd 0 ; 3-fold pointer
offClassData dd 0 ; Offset to ClassData structure
offExtraData dd 0 ; Offset to ExtraData structure
tblStatic dd 0 ; Offset to static methods table
tblMethodOverrides dd 0 ; Offset to method overrides
nitReferenceBase dd 0
datatokensInstance dd 0 ; Datatokens for instance data
arbitraryMembersCD dd 0 ; Arbitrary ClassData members
stubsVarArgs dd 0 ; Varargs stubs
classInitFunction dd 0 ; Class init function
alignementByte dd 0 ; Desired byte alignement
numDirectInitClass dd 0xFFFFFFFF
tblDirectInitClass dd 0
numGeneralMethods dd 0
methodTokens dd 0
protectedDataOffset dd 0
somSciVersion dd 0
numInheritedMethods dd 0
impInheritedMethods dd 0 ; Inherited methods implementations
numClassDataEntries dd 0 ; Number of method entries in ClassData
tblClassDataEntryNames dd 0
numMigratedMethods dd 0
impMigratedMethods dd 0 ; Migrated methods implementations
numInitializers dd 0
tblInitializers dd 0 ; Pointers to initializers, in release order.
directToSOMClass dd 0
dynamicallyComputed dd 0
SOMClassInformation ends
In the above structure, we can see pointers to several other important data objects. At first, we see of2ClassName and of2ClassMeta, they have the 2-fold pointer form
class_name_off dd flat:class_name class_meta_off dd flat:class_meta
class_name db "the-name-of-the-class", 0 class_meta db "meta-class-name", 0
In the case of class_information_global, the field for of2ClassMeta is always zero (no meta^2)! Another interesting entry corresponds to of3ParentName.
It is a 3-fold pointer, realized as follows (global/local):
parent_name_@ff dd flat:parent_name_off parent_meta_@ff dd flat:parent_meta_off
parent_name_off dd flat:parent_name parent_meta_off dd flat:parent_meta
parent_name db "the-name-of-the-parent", 0 parent_meta db "the-meta-parent_name", 0
Perhaps the most important SOMClassInformation entry is the pointer to method overrides table. This table has the form of the linear list of the pairs
dd flat:@ff_parent_method_name
dd flat:new_implementation_proc
where we have again nice 3-fold pointers
@ff_parent_method_name dd flat:off_parent_method_name
off_parent_method_name dd flat:str_parent_method_name
str_parent_method_name db "classname::standard-wps-name", 0
and new_implementation_proc is the procedure that overrides the method parent_method_name.
Furthermore, let us examine another very important entry: tblStatic. It is a pointer to the table of static methods introduced by the given class. The static method table is a linear
listing of the following 6-fold entries, one for each method:
dd flat:myobjectClassData[method_index]
dd flat:@ff_methodNameBase
dd flat:@ff_methodNameFull
dd flat:method_procedure
dd flat:method_redispatch
dd flat:method_applystub
where method_procedure is the procedure that implements the method, method_index is offset to the method entry in the ClassData structure, while @ff_methodNameBase and @ff_methodNameFull are nice 3-fold pointers entangled in the following structure (recommended, to avoid duplications):
dd str_methodNameFull: db "::myObject::"
dd str_methodNameBase: db "method_name", 0
dd off_methodNameBase dd flat:str_methodNameBase
dd @ff_methodNameBase dd flat:off_methodNameBase
dd off_methodNameFull dd flat:str_methodNameFull
dd @ff_methodNameFull dd flat:off_methodNameFull
The last 2 dwords in the above 6-fold method table entry point to associated redispatch and apply stubs procedures. In the simplest scenario, these fields should be 0xFFFFFFF and 0x0000000 respectively (no redispatches/apply stubs).
Related to static methods are also entries {numStaticMethods, numClassDataEntries, numMaxMethods}. When defining ClassData structures, enough space should be left to accommodate all method tokens, and the class information (the first entry).
Finally, let us observe that the field instanceDataSize determines the amount of memory reserved for object data. It is exactly this memory area which is getting mapped by calling WPSObjExtraData[4] with somSelf as the unique argument. As we already mentioned, the result is the value of the somThis pointer.
Example A: Quantum Rectangles
Object Description
In the accompanying sample code, we are presenting a simple yet sufficiently illustrative WPS object, based on the random-rectangles PM program (discussed in detail within the PM-assembling article). We are constructing a child of WPDataFile object, displaying randomly fluctuating rectangles as the default view. The random number generator routine is the same as the one used in the PM example, based on a powerful multiply-with-carry algorithm. However here we control the rectangles via a special timer (WinCreateTimer, WinSetTimer) while in the PM example it was a simple cyclic thread created by DosCreateThread. Our object also features
- A special settings page, controlling the state of the rectangles system:
speed, stop/go. The mentioned settings page is introduced by overriding wpAddBecomePage method;
- A possibility to save the state via wpSaveDeferred method;
- Modification of the object pop-up menu, so that the quantum rectangles view properly appears.
All the samples are available at our download section.
How To Compile
The creation of the class DLL is very simple, in 3 steps: assembling, linking
and resource-compiling. Explicitly,
wasm qr.asm
wlink @qr
rc qr qr.dll
Here, the linking info is stored in the file qr.lnk and the resource info is within qr.rc
file. Don't forget to check the size of the DLL :)
In order to use the newly created class, it is necessary to register it:
/* Registering qRectangle class */
call RxFuncAdd 'SysLoadFuncs', 'RexxUtil', SysLoadFuncs'
call SysLoadFuncs
if SysRegisterObjectClass(qRectangle, "QR") then
say "Okidoki!"
else say "Oops, a problem :("
Object instances are created in the standard way. For example,
/* Creating an instance of qRectangle class */
call RxFuncAdd 'SysLoadFuncs', 'RexxUtil', SysLoadFuncs'
call SysLoadFuncs
if SysCreateObject("qRectangle", "Quantum Rectangles", "<WP_DESKTOP>", "", "R") then
say "Object created succesfully!"
else say "Oops, a problem :("
Connecting PM and WPS stuff
An interesting problem appears when interconnecting PM and WPS code. Since PM is not inherently aware of WPS object-related entities like somSelf and somThis, how to keep this information available to PM windows? The solution is to reserve the memory during the window creation, and then save the pointer to this reserved memory during the window initialization procedure.
More precisely:
- When calling WinRegisterClass, specify the amount of reserved memory for the WPS-related
stuff.
- Create frame and client windows using WinCreateWindow function;
- When creating the client window, pass the pointer to a reserved memory location
containing the somSelf pointer, as an appropriate argument to WinCreateWindow;
- During the processing of WM_CREATE message, setup a pointer to the reserved memory
location, with the help of WinSetWindowPtr.
Here are relevant pieces of code corresponding to the outlined methodology. At first, window creation. This is a fragment of the initialization procedure specified in the object-specific (overridden) version of wpOpen (the creation of the window in which a quantum-fluctuating rectangles will be displayed).
push HWND_DESKTOP
call WinQueryAnchorBlock
add esp, 0x00000004
mov hab, eax
push 0x00000040
push (CS_SIZEREDRAW+CS_SYNCPAINT)
push offset rectangles_procedure
push offset rectangles_window_class
push hab
call WinRegisterClass
add esp, 0x00000014
push 0x00000000 ; At first, creating the frame window...
push offset frame_ct_data
push 0x00000020
push HWND_TOP
push 0x00000000
push 0
push 0
push 0
push 0
push 0
push offset qwindow_title
push WC_FRAME
push HWND_DESKTOP
call WinCreateWindow
add esp, 0x00000034
mov hwnd_frame, eax
push 0x00000000 ; Let us allocate some memory, for the use
push 0x40 ; of various object parameters, that
push somSelf ; are related to windows rectangles stuff.
call dword os2_wpAllocMem
add esp, 0x0000000C ; {somSelf, UseItem, ViewItem}
mov ebx, somSelf ; Store somSelf at the beginning...
mov [eax], ebx
mov dword [eax][0x4], USAGE_OPENVIEW
mov dword [eax][0x8], 0 ; Eight bytes for the UseItem structure.
mov dword [eax][0xC], Q_OPEN ; ViewItem.view
mov ebx, hwnd_frame ; ViewItem.handle
mov [eax+0x10], ebx
mov dword [eax][0x14], 0 ; ViewItem.ulViewState
mov dword [eax][0x18], 0 ; ViewItem.hwndCnr
mov dword [eax][0x1C], 0 ; ViewItem.pRecord
mov client_data, eax
; Now we are creating the client window...
push 0x00000000
push eax
push FID_CLIENT
push HWND_TOP
push hwnd_frame
push 0
push 0
push 0
push 0
push 0
push 0x00000000
push offset rectangles_window_class
push hwnd_frame
call WinCreateWindow
add esp, 0x00000034
mov hwnd_client, eax
Now, the fragment of the window WM_CREATE processing procedure.
push [ebp+0x00000010]
push 0x00000000
push hwnd
call WinSetWindowPtr
add esp, 0x0000000C
Here the pointer to the reserved memory is passed as the third argument to the window procedure with arguments {hwnd, ulmsgid, mp1, mp2}, hence mp1=[ebp+0x00000010].
From this point on, the data will be easily accessible from any window subroutine, by calling WinQueryWindowPtr.
Examble B: Very Simple Object
It is one of the simplest possible WPS objects. A child of WPDataFile, with only 4 simple overrides: {wpclsQueryIconData, wpclsQueryTitle, wpclsQueryInstanceType, wpclsQueryDefaultView}. The library creation involves only assembling and linking, as there are no any resources defined.
Example C: A Dangerous Folder
In this example we construct a derived class qHole from WPFolder class. We override wpDrop method, in order to introduce a couple of new options, besides the standard drop behavior: Based on the value of an instance variable, the drop operation will
- Call the parent method (standard folder behavior);
- Erase the dropped object (and its element objects, if the dropped object is a folder type);
- Erase only subobjects that are not of folder-type (leaving the "skeleton" of an initial folder);
- Allow entry to objects of qHole only.
The instance variable is controlled via a special settings page, introduced on top of other
settings pages by overriding wpAddSettingsPages method.
Our qHole object features two simple instance methods, qholeSetState and qholeGetState, controlling the above mentioned variable.
In constructing this sample (available at our download section) we were inspired by a well known Black Hole class [4].
Here is the main destroyer procedure. It checks first the dropped object type, and if the dropped object is a folder then it would enter a recursive loop to handle the subobjects. In case of WPFileSystem objects, the procedure would reset the file attributes, before deleting. It would also reset the object flags in general, before invoking the (sub)object-specific version of wpFree (calculated via somResolve).
actual_delete proc ; ebx contains the skeleton/full-destroy push ebp ; choice! The unique argument is the mov ebp, esp ; object we are applying the procedure to. sub esp, 0x00000010 mov [ebp-0x00000008], ebx push WPFolderClassData ; Let us first check to see if we deal push [ebp+0x00000008] ; with folder objects... call SOMObjectClassData[tok_somIsA] add esp, 0x00000008 test eax, eax jz @ad_test4filesystem push 0x00000000 ; If yes, fully populate the folder push 0 ; so that we can examine its contents. push 0 push [ebp+0x00000008] call WPFolderClassData[tok_wpPopulate] add esp, 0x00000010 test eax, eax jz @ad_test4filesystem push QC_FIRST ; Let us see if there is at least one push 0 ; object in the folder. push [ebp+0x00000008] call WPFolderClassData[tok_wpQueryContent] add esp, 0x0000000C mov [ebp-0x00000004], eax test eax, eax jz @ad_folder_done ; Folder empty => proceed further. @ad_folder_loop: push QC_NEXT push [ebp-0x00000004] ; <= We are refering to the current push [ebp+0x00000008] ; object in the folder contents list call WPFolderClassData[tok_wpQueryContent] add esp, 0x0000000C mov [ebp-0x0000000C], eax ; Save the next object before deleting! mov ebx, [ebp-0x00000008] push [ebp-0x00000004] call actual_delete add esp, 0x00000004 mov eax, [ebp-0x0000000C] mov [ebp-0x00000004], eax test eax, eax jnz @ad_folder_loop @ad_folder_done: ; Checking for the skeleton mode... cmp dword [ebp-0x00000008], 2 jz @ad_exit push 0x00000000 ; Once again, fully populate the folder push 0 ; so that we can examine new contents. push 0 push [ebp+0x00000008] call WPFolderClassData[tok_wpPopulate] add esp, 0x00000010 push QC_FIRST ; Let us double-check to see if the push 0 ; folder is really empty, if not it push [ebp+0x00000008] ; means an error occurred, so we quit! call WPFolderClassData[tok_wpQueryContent] add esp, 0x0000000C test eax, eax jnz @ad_exit jmp short @ad_filesystem_ok @ad_test4filesystem: push WPFileSystemClassData push [ebp+0x00000008] call SOMObjectClassData[tok_somIsA] add esp, 0x00000008 cmp eax, 0 jz @ad_nofilesystem @ad_filesystem_ok: ; We are resetting the attributes so ; that the fileobject can be deleted. push [ebp+0x00000008] call WPFileSystemClassData[tok_wpQueryAttr] and eax, 0xFFFFFFFEh push eax push [ebp+0x00000008] call WPFileSystemClassData[tok_wpSetAttr] add esp, 0x0000000C @ad_nofilesystem: ; General style modification, before ; calling wpFree. push 0 push OBJSTYLE_NODELETE push [ebp+0x00000008] call WPObjectClassData[tok_wpModifyStyle] add esp, 0x0000000C push WPObjectClassData[tok_wpFree] push [ebp+0x00000008] call somResolve add esp, 0x00000008 push [ebp+0x00000008] call eax add esp, 0x00000004 @ad_exit: mov esp, ebp pop ebp ret actual_delete endp
wasm qhole.asm wlink @qhole rc qhole qhole.dll
Concluding Remarks
There is a lot of fun in constructing WPS objects in assembler. In the above discussed examples, we tried to emphasize the simple internal structure of objects, and therefore we have not always optimized the code for maximum performance (for example, by holding certain variables in registers instead of using memory). We also used the ebp-frame format for majority of procedures, and stack space for procedure arguments... All 3 examples feature custom icons (standard and animation, in case of qHole). These icons are fixed by overriding class methods wpclsQueryIconData and wpclsQueryIconDataN. We decided to specify icons as resources from the main WPS library PMWP.DLL.
And no doubts, it takes more efforts to code a WPS library in assembler, than using pre-defined macros linked with the Interface Definition Language. However all the difficulties are non-essential, and there are quite non-trivial advantages in using our programming model:
- Complete control of objects behavior;
- The best possible optimization;
- Enhanced creativity;
- Removal of junk code;
- Deeper understanding of the WPS internals, and OS/2 in general.
In forthcoming articles, we shall discuss more complex situations, including sophisticated requester WPS objects from our UAME2 package for diskless remote-booting.
References
[1]The Art of Assembly Language Programming and HLA. By Randall Hyde. An extensive and beautiful assembly language tutorial + related topics.
[2] SOM Programming Reference and Guide. By IBM. Part of OS/2 Programming Toolkit {somguide.inf + somref.inf}.
[3] WPS Programming Reference. By IBM. Files {wps1.inf + wps2.inf + wps3.inf} of OS/2 Programming Toolkit.
[4] Black Hole WorkPlace Shell Class. By Gregory Czaja. Available at Hobbes Repository.