SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1150, Tue Nov 15 15:37:17 2011 UTC revision 1234, Thu Jul 25 17:45:00 2013 UTC
# Line 1  Line 1 
1    2013-07-25  Ingo Feinerer <feinerer@logic.at>
2    
3            * R/complete.R (stemCompletion): Report NA instead of error when no
4            completion can be found by the prevalent heuristic. Suggested by Hugh
5            Devlin.
6    
7    2013-07-10  Ingo Feinerer <feinerer@logic.at>
8    
9            * R/reader.R (readPDF): Use tm:::pdfinfo() (which needs the pdfinfo
10            command line tool) instead of tools:::pdf_info().
11    
12    2013-04-11  Ingo Feinerer <feinerer@logic.at>
13    
14            * R/transform.R (removeWords): Use PCRE UCP to use Unicode properties
15            to determine character types.
16    
17    2012-12-14  Ingo Feinerer <feinerer@logic.at>
18    
19            * R/matrix.R (TermDocumentMatrix): Ensure dimnames of type character
20            when generating a simple_triplet_matrix. Reported by Arho Suominen.
21    
22    2012-12-10  Ingo Feinerer <feinerer@logic.at>
23    
24            * man/tm_reduce.Rd: Document right to left folding order. Adapt
25            example as well. Suggested by Mark Rosenstein.
26    
27    2012-12-04  Ingo Feinerer <feinerer@logic.at>
28    
29            * R/filter.R (sFilter): Avoid attach() and simplify.
30    
31    2012-11-02  Ingo Feinerer <feinerer@logic.at>
32    
33            * R/doc.R (.TextDocument): Use casts to ensure data types and to avoid
34            removal of attributes.
35    
36    2012-10-03 Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/weight.R (weightTfIdf, weightSMART): Gracefully handle empty
39            columns and rows (avoids blow-up due to NaN values). Suggested by Jaap
40            Frölich.
41    
42    2012-07-27 Ingo Feinerer  <feinerer@logic.at>
43    
44            * R/transform.R (removeWords): Allow longer stopword lists.
45    
46    2012-01-31  Ingo Feinerer  <feinerer@logic.at>
47    
48            * R/reader.R (readXML): Readers can now set the document language
49            themselves.
50    
51    2012-01-14  Ingo Feinerer  <feinerer@logic.at>
52    
53            * R/source.R (XMLSource, getElem.XMLSource): Simplifications as
54            proposed by Milan Bouchet-Valat.
55    
56    2012-01-11  Ingo Feinerer  <feinerer@logic.at>
57    
58            * R/matrix.R (termFreq): Fix processing of user provided
59            stopwords. Reported by Bettina Grün.
60    
61    2011-12-23  Ingo Feinerer  <feinerer@logic.at>
62    
63            * R/matrix.R (termFreq): Fix invalid handling of
64            control$wordLengths[1]. Reported by Steven C. Bagley.
65    
66    2011-12-17  Ingo Feinerer  <feinerer@logic.at>
67    
68            * DESCRIPTION (Version): Prepare for CRAN Christmas release.
69    
70    2011-12-12  Ingo Feinerer  <feinerer@logic.at>
71    
72            * R/utils.R (map_IETF_Snowball): Map empty input to "porter".
73    
74    2011-12-07  Ingo Feinerer  <feinerer@logic.at>
75    
76            * R/transform.R (removePunctuation): Add option to preserve
77            intra-word dashes.
78    
79    2011-12-06  Ingo Feinerer  <feinerer@logic.at>
80    
81            * R/matrix.R (termFreq): Allow reordering of control option
82            processing.
83    
84    2011-11-17  Ingo Feinerer  <feinerer@logic.at>
85    
86            * R/reader.R (readPDF): Use tools:::pdf_info() instead of external
87            pdfinfo tool.
88    
89            * inst/stopwords/SMART.dat: Add SMART information retrieval system
90            stopwords (which are also used by the MC toolkit).
91    
92            * R/matrix (termFreq): Allow local option \code{bounds$local} to
93            restrict how often a term may appear in each document (generalizes
94            \code{minDocFreq}). Similarly the local option \code{wordLenghts}
95            for word length bounds (generalizes \code{minWordLength}).
96    
97            * R/matrix.R (TermDocumentMatrix.VCorpus): New global option
98            \code{bounds$global} for restricting how often a term is allowed
99            to appear in different documents.
100    
101            * R/matrix.R (TermDocumentMatrix.VCorpus): Distinguish between
102            local options delegated internally to termFreq() and global
103            options which are processed by the term-document matrix
104            constructor itself.
105    
106  2011-11-15  Ingo Feinerer  <feinerer@logic.at>  2011-11-15  Ingo Feinerer  <feinerer@logic.at>
107    
108          * man/getTokenizers.Rd: Document getTokenizers().          * man/getTokenizers.Rd: Document getTokenizers().

Legend:
Removed from v.1150  
changed lines
  Added in v.1234

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge