SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1150, Tue Nov 15 15:37:17 2011 UTC revision 1231, Wed Jul 10 06:51:26 2013 UTC
# Line 1  Line 1 
1    2013-07-10  Ingo Feinerer <feinerer@logic.at>
2    
3            * R/reader.R (readPDF): Use tm:::pdfinfo() (which needs the pdfinfo
4            command line tool) instead of tools:::pdf_info().
5    
6    2013-04-11  Ingo Feinerer <feinerer@logic.at>
7    
8            * R/transform.R (removeWords): Use PCRE UCP to use Unicode properties
9            to determine character types.
10    
11    2012-12-14  Ingo Feinerer <feinerer@logic.at>
12    
13            * R/matrix.R (TermDocumentMatrix): Ensure dimnames of type character
14            when generating a simple_triplet_matrix. Reported by Arho Suominen.
15    
16    2012-12-10  Ingo Feinerer <feinerer@logic.at>
17    
18            * man/tm_reduce.Rd: Document right to left folding order. Adapt
19            example as well. Suggested by Mark Rosenstein.
20    
21    2012-12-04  Ingo Feinerer <feinerer@logic.at>
22    
23            * R/filter.R (sFilter): Avoid attach() and simplify.
24    
25    2012-11-02  Ingo Feinerer <feinerer@logic.at>
26    
27            * R/doc.R (.TextDocument): Use casts to ensure data types and to avoid
28            removal of attributes.
29    
30    2012-10-03 Ingo Feinerer  <feinerer@logic.at>
31    
32            * R/weight.R (weightTfIdf, weightSMART): Gracefully handle empty
33            columns and rows (avoids blow-up due to NaN values). Suggested by Jaap
34            Frölich.
35    
36    2012-07-27 Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/transform.R (removeWords): Allow longer stopword lists.
39    
40    2012-01-31  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/reader.R (readXML): Readers can now set the document language
43            themselves.
44    
45    2012-01-14  Ingo Feinerer  <feinerer@logic.at>
46    
47            * R/source.R (XMLSource, getElem.XMLSource): Simplifications as
48            proposed by Milan Bouchet-Valat.
49    
50    2012-01-11  Ingo Feinerer  <feinerer@logic.at>
51    
52            * R/matrix.R (termFreq): Fix processing of user provided
53            stopwords. Reported by Bettina Grün.
54    
55    2011-12-23  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/matrix.R (termFreq): Fix invalid handling of
58            control$wordLengths[1]. Reported by Steven C. Bagley.
59    
60    2011-12-17  Ingo Feinerer  <feinerer@logic.at>
61    
62            * DESCRIPTION (Version): Prepare for CRAN Christmas release.
63    
64    2011-12-12  Ingo Feinerer  <feinerer@logic.at>
65    
66            * R/utils.R (map_IETF_Snowball): Map empty input to "porter".
67    
68    2011-12-07  Ingo Feinerer  <feinerer@logic.at>
69    
70            * R/transform.R (removePunctuation): Add option to preserve
71            intra-word dashes.
72    
73    2011-12-06  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/matrix.R (termFreq): Allow reordering of control option
76            processing.
77    
78    2011-11-17  Ingo Feinerer  <feinerer@logic.at>
79    
80            * R/reader.R (readPDF): Use tools:::pdf_info() instead of external
81            pdfinfo tool.
82    
83            * inst/stopwords/SMART.dat: Add SMART information retrieval system
84            stopwords (which are also used by the MC toolkit).
85    
86            * R/matrix (termFreq): Allow local option \code{bounds$local} to
87            restrict how often a term may appear in each document (generalizes
88            \code{minDocFreq}). Similarly the local option \code{wordLenghts}
89            for word length bounds (generalizes \code{minWordLength}).
90    
91            * R/matrix.R (TermDocumentMatrix.VCorpus): New global option
92            \code{bounds$global} for restricting how often a term is allowed
93            to appear in different documents.
94    
95            * R/matrix.R (TermDocumentMatrix.VCorpus): Distinguish between
96            local options delegated internally to termFreq() and global
97            options which are processed by the term-document matrix
98            constructor itself.
99    
100  2011-11-15  Ingo Feinerer  <feinerer@logic.at>  2011-11-15  Ingo Feinerer  <feinerer@logic.at>
101    
102          * man/getTokenizers.Rd: Document getTokenizers().          * man/getTokenizers.Rd: Document getTokenizers().

Legend:
Removed from v.1150  
changed lines
  Added in v.1231

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge