首页   按字顺浏览 期刊浏览 卷期浏览 Formatting texts accessed randomly
Formatting texts accessed randomly

 

作者: John B. Smith,   Stephen F. Weiss,  

 

期刊: Software: Practice and Experience  (WILEY Available online 1987)
卷期: Volume 17, issue 1  

页码: 5-16

 

ISSN:0038-0644

 

年代: 1987

 

DOI:10.1002/spe.4380170103

 

出版商: John Wiley&Sons, Ltd.

 

关键词: Text formatting;Full‐text retrieval;Format grammar

 

数据来源: WILEY

 

摘要:

AbstractFull‐text systems that access text randomly cannot normally determine the format operations in effect for a given target location. The problem can be solved by viewing the format marks as the non‐terminals in a format grammar. A formatted text can then be parsed using the grammar to build a data structure that serves both as a parse tree and as a search tree. While processing a retrieved segment, a full‐text system can follow the search tree from root to leaf, collecting the format marks encountered at each node to derive the sequence of commands active for that segment. The approach also supports the notion of a ‘well formatted’ document and provides a means for verifying the well‐formedness of a given text. To illustrate the approach, a sample set of format marks and a sample grammar are given suitable for formatting and parsing the article as a

 

点击下载:  PDF (534KB)



返 回