public interface HTMLParser
| Modifier and Type | Method and Description |
|---|---|
DocData |
parse(DocData docData,
String name,
Date date,
Reader reader,
TrecContentSource trecSrc)
Parse the input Reader and return DocData.
|
DocData parse(DocData docData, String name, Date date, Reader reader, TrecContentSource trecSrc) throws IOException
docData - result reusedname - name of the result doc data.date - date of the result doc data. If null, attempt to set by parsed data.reader - reader of html text to parse.trecSrc - the TrecContentSource used to parse dates.IOException - If there is a low-level I/O error.Copyright © 2000-2024 Apache Software Foundation. All Rights Reserved.