SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/tm/ChangeLog revision 806, Wed Jan 2 10:29:14 2008 UTC pkg/ChangeLog revision 982, Tue Aug 11 07:48:04 2009 UTC
# Line 1  Line 1 
1    2009-08-11  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/reader.R (readMail): Moved to tm.plugin.mail package.
4    
5    2009-07-04  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/reader.R (readNewsgroup): Rename to readMail as newsgroup
8            postings are basically e-mails with some extra headers.
9    
10    2009-07-03  Ingo Feinerer  <feinerer@logic.at>
11    
12            * R/transform.R: Move convertMboxEml, removeCitation,
13            removeMultipart, and removeSignature to the tm.plugin.mail package
14            since they are mainly utility functions (for handling e-mails) and
15            not very framework specific.
16    
17    2009-06-28  Ingo Feinerer  <feinerer@logic.at>
18    
19            * man/: Fix documentation.
20    
21    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
22    
23            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
24            plain text document instead of an XML document for texts of the
25            Reuters-21578 dataset.
26    
27            * R/sparse.R: Removed since the slam package is now available on
28            CRAN.
29    
30            * DESCRIPTION (Depends): Add slam package.
31    
32    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
33    
34            * R/transform.R (stemDoc): Fix character(0) handling.
35    
36    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/doc.R (show): Pretty print.
39    
40    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
43            gracefully.
44    
45    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
46    
47            * R/corpus.R: Make corpus virtual. Implement corpus with standard
48            and permanent storage semantics.
49    
50            * DESCRIPTION: New major release. A *lot* of improvements.
51    
52    2009-05-04   Ingo Feinerer <feinerer@logic.at>
53    
54            * NAMESPACE: Export some simple_triplet_matrix functions.
55    
56    2009-04-28   Ingo Feinerer <feinerer@logic.at>
57    
58            * R/weight.R: Adapt tf-idf to new matrix format.
59    
60    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
61    
62            * R/matrix.R: Create two distinct classes for term-document and
63            document-term matrices.
64    
65    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
66    
67            * R/termdocmatrix.R: No longer use Matrix package. This reduces
68            package start-up time significantly.
69    
70    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
71    
72            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
73    
74    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
75    
76            * R/transform.R (tmReduce): Combine multiple maps into one
77            transformation.
78    
79    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
80    
81            * R/weight.R: Remove weightLogical since it does not return a
82            dgCMatrix.
83    
84            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
85            or TermDocumentMatrix instead.
86    
87    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
88    
89            * inst/doc/extensions.Rnw: Finished vignette.
90    
91    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
92    
93            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
94            DocumentTermMatrix representations.
95    
96    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
97    
98            * R/reader.R (readXML): New reader for arbitrary XML files.
99    
100    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
101    
102            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
103            (XMLSource): New XMLSource class for arbitrary XML files.
104            (Source): New slot Vectorized.
105    
106    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
107    
108            * R/reader.R (readTabular): Experimental reader for tabular data
109            structures which can be customized via user-defined mappings.
110    
111            * R/reader.R: Always use UTC time zone.
112    
113            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
114    
115    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
116    
117            * R/reader.R (readDOC): Options can be passed over to antiword.
118    
119            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
120            pdftotext.
121    
122    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
123    
124            * R/source.R (DirSource): Add pattern and ignore.case arguments
125            which are internally passed over to list.files().
126    
127    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
128    
129            * inst/doc/tm.Rnw: Suppress pointless loading message.
130    
131    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
132    
133            * DESCRIPTION: Speed up package loading (via moving packages not
134            strictly necessary for normal operation to Suggests instead of
135            Depends).
136    
137    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
138    
139            * R/reader.R (readNewsgroup): The date format is now configurable.
140    
141    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
142    
143            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
144    
145    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
146    
147            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
148    
149    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
150    
151            * R/source.R (DataframeSource): New source class for data frames.
152    
153            * R/source.R: Fixed non-standard call evaluation.
154    
155    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
156    
157            * R/source.R (URISource): New source class for a single document.
158    
159    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
160    
161            * R/source.R: Refactoring.
162    
163    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
164    
165            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
166            Rmpi installations more gracefully.
167    
168    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
169    
170            * R/source.R (Source): Add Length slot.
171    
172    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
173    
174            * R/AAA.R: Unify duplicated .onLoad function.
175    
176    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
177    
178            * DESCRIPTION (Suggests): Added Rmpi.
179    
180    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
181    
182            * R/source.R (getElem): Fix 'no visible binding' warning.
183    
184            * man/WeightFunction.Rd: Fix signature.
185    
186    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
187    
188            * R/weight.R: Introduce name abbreviations for weighting functions.
189    
190    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
191    
192            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
193    
194            * R/cluster.R: Provide convenience functions for using a MPI
195            cluster.
196    
197            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
198            available.
199    
200            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
201            available.
202    
203    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
204    
205            * R/textdoccol.R (lapply): Removed debug print out.
206    
207    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
208    
209            * R/reader.R (readRCV1): Improved meta data extraction from
210            Reuters Corpus Volume 1 documents.
211    
212    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * R/transform.R: Ensure that all mappings preserve multiline
215            structures.
216    
217    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
218    
219            * R/filter.R: Every filter has now an attribute indicating whether
220            it sould be applied to document level (doclevel).
221    
222            * R/textdoccol.R (tmFilter): Set searchFullText as new default
223            filter.
224    
225    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/transform.R (replacePatterns): Replaced removeWords by
228            replacePatterns. Suggested by Christian Buchta.
229    
230            * R/textdoccol.R (inspect): Improved formatting.
231    
232    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
233    
234            * inst/CITATION: Updated JSS article information.
235    
236            * R/textdoccol.R (setAs): Added coerce method from list to
237            corpus.
238    
239            * R/meta.R (meta): Improved meta data handling.
240    
241    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
244            Christian Buchta.
245    
246            * inst/CITATION: Added template to include JSS article reference.
247    
248    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
249    
250            * R/textdoccol.R (tmMap): Introduced lazy mapping.
251    
252            * R/source.R: Added VectorSource.
253    
254    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * man/: Language codes should be in ISO 639-1 format.
257    
258            * R/textdoccol.R (asPlain): Preserve local meta data.
259    
260    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * R/textdoccol.R (writeCorpus): Function for writing a corpus
263            containing plain text documents to disk.
264    
265    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
268            always set correctly.
269    
270            * R/textdoccol.R: Set load = TRUE as default for load on demand
271            since in most cases this is the wanted behaviour.
272    
273    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
276    
277            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
278    
279    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
280    
281            * R/meta.R (meta): New function for consistent access to meta data
282            of document collections, repositories, and texts.
283    
284    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/: Better support for encodings.
287    
288    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
291            selection when no reader argument is given.
292    
293    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/source.R (CSVSource): Now uses read.csv instead of scan
296            internally.
297    
298  2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300          * R/reader.R (getReaders): Returns available reader functions.          * R/reader.R (getReaders): Returns available reader functions.

Legend:
Removed from v.806  
changed lines
  Added in v.982

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge