| Version: | 0.0-6 |
| Title: | R/KEA Interface |
| Description: | An R interface to KEA (Version 5.0). KEA (for Keyphrase Extraction Algorithm) allows for extracting keyphrases from text documents. It can be either used for free indexing or for indexing with a controlled vocabulary. For more information see http://www.nzdl.org/Kea/. |
| Imports: | RKEAjars (≥ 5.0-1), rJava (≥ 0.6-3), tm |
| SystemRequirements: | Java (>= 5.0) |
| License: | GPL-2 |
| Packaged: | 2015-04-03 15:15:00 UTC; hornik |
| Author: | Ingo Feinerer [aut], Kurt Hornik [aut, cre] |
| Maintainer: | Kurt Hornik <Kurt.Hornik@R-project.org> |
| NeedsCompilation: | no |
| Repository: | CRAN |
| Date/Publication: | 2015-04-03 17:27:56 |
Create a KEA Model
Description
Create a keyphrase extraction model.
Usage
createModel(corpus, keywords, model, voc = "none", vocformat = "")
Arguments
corpus |
A list of character vectors containing the text
documents, e.g., a |
keywords |
A list of character vectors containing the keywords
for each document in |
model |
A character giving the path where the created model should be stored. |
voc |
A character pointing to a controlled vocabulary. |
vocformat |
A character giving the format of |
Details
A tutorial on keyword extraction is located at http://www.nzdl.org/Kea/Download/Kea-5.0-Readme.txt. There you can find details on the internals of KEA, including various parameter settings (e.g., details on vocabularies and supported formats for these).
When controlled vocabularies are used (by default: no), the voc
argument should give the file path to the respective files without
their extensions. When vocformat is "skos", the
extension must be ‘.rdf’; when "text", there must be files
with extensions ‘.en’, ‘.rel’ and ‘.use’.
Value
Invisibly returns model, i.e., the path to the created KEA
model.
Author(s)
Ingo Feinerer
References
See Also
Extract Keywords
Description
Extract keywords from text documents.
Usage
extractKeywords(corpus, model, voc = "none", vocformat = "")
Arguments
corpus |
A list of character vectors containing the text
documents, e.g., a |
model |
A character giving the path to a KEA model. |
voc |
A character pointing to a controlled vocabulary. |
vocformat |
A character giving the format of |
Details
A tutorial on keyword extraction is located at
http://www.nzdl.org/Kea/Download/Kea-5.0-Readme.txt. There you can
find details on the internals of KEA, including various parameter
settings (e.g., valid arguments for voc and vocformat).
Value
A list of character vectors corresponding to the keywords in
corpus.
Author(s)
Ingo Feinerer