
yapp
User
Oct 1, 2001, 2:48 AM
Post #4 of 4
(349 views)
|
Don't forget these files are very complex. airo's example actually opens word (invible though), using OLE/COM. A word file can be RTF (wich is a bit like HTML, and is readable as ASCII) Word files however, are binary. Thay means individual bytes (or combinations of them) in a file represent data. ex. 4 bytes for size, then 4 bytes telling the size of ht e following text, then ascii characters containing the text (maybe the autor field in word), and so on. If Microsoft has documented this structure, you can maybe use it. Still it's difficult to make use of that in perl (with unpack). Word documents can also contain OLE objects, VBA programming lines, styles, images, settings. You all need to take are of those. Either use plain text files, or make a OLE/COM connection using the sample from airo above. If you program in VBA (in word) you \'d reconize there lines.
|