SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 946, Wed May 13 18:07:35 2009 UTC
# Line 1  Line 1 
1    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/corpus.R: Make corpus virtual. Implement corpus with standard
4            and permanent storage semantics.
5    
6            * DESCRIPTION: New major release. A *lot* of improvements.
7    
8    2009-05-04   Ingo Feinerer <feinerer@logic.at>
9    
10            * NAMESPACE: Export some simple_triplet_matrix functions.
11    
12    2009-04-28   Ingo Feinerer <feinerer@logic.at>
13    
14            * R/weight.R: Adapt tf-idf to new matrix format.
15    
16    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
17    
18            * R/matrix.R: Create two distinct classes for term-document and
19            document-term matrices.
20    
21    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
22    
23            * R/termdocmatrix.R: No longer use Matrix package. This reduces
24            package start-up time significantly.
25    
26    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
27    
28            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
29    
30    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
31    
32            * R/transform.R (tmReduce): Combine multiple maps into one
33            transformation.
34    
35    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
36    
37            * R/weight.R: Remove weightLogical since it does not return a
38            dgCMatrix.
39    
40            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
41            or TermDocumentMatrix instead.
42    
43    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
44    
45            * inst/doc/extensions.Rnw: Finished vignette.
46    
47    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
48    
49            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
50            DocumentTermMatrix representations.
51    
52    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
53    
54            * R/reader.R (readXML): New reader for arbitrary XML files.
55    
56    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
57    
58            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
59            (XMLSource): New XMLSource class for arbitrary XML files.
60            (Source): New slot Vectorized.
61    
62    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
63    
64            * R/reader.R (readTabular): Experimental reader for tabular data
65            structures which can be customized via user-defined mappings.
66    
67            * R/reader.R: Always use UTC time zone.
68    
69            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
70    
71    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
72    
73            * R/reader.R (readDOC): Options can be passed over to antiword.
74    
75            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
76            pdftotext.
77    
78    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
79    
80            * R/source.R (DirSource): Add pattern and ignore.case arguments
81            which are internally passed over to list.files().
82    
83    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
84    
85            * inst/doc/tm.Rnw: Suppress pointless loading message.
86    
87    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
88    
89            * DESCRIPTION: Speed up package loading (via moving packages not
90            strictly necessary for normal operation to Suggests instead of
91            Depends).
92    
93    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
94    
95            * R/reader.R (readNewsgroup): The date format is now configurable.
96    
97    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
98    
99            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
100    
101    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
102    
103            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
104    
105    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
106    
107            * R/source.R (DataframeSource): New source class for data frames.
108    
109            * R/source.R: Fixed non-standard call evaluation.
110    
111    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
112    
113            * R/source.R (URISource): New source class for a single document.
114    
115    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
116    
117            * R/source.R: Refactoring.
118    
119    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
120    
121            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
122            Rmpi installations more gracefully.
123    
124    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
125    
126            * R/source.R (Source): Add Length slot.
127    
128    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
129    
130            * R/AAA.R: Unify duplicated .onLoad function.
131    
132    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
133    
134            * DESCRIPTION (Suggests): Added Rmpi.
135    
136    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
137    
138            * R/source.R (getElem): Fix 'no visible binding' warning.
139    
140            * man/WeightFunction.Rd: Fix signature.
141    
142    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
143    
144            * R/weight.R: Introduce name abbreviations for weighting functions.
145    
146    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
147    
148            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
149    
150            * R/cluster.R: Provide convenience functions for using a MPI
151            cluster.
152    
153            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
154            available.
155    
156            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
157            available.
158    
159    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
160    
161            * R/textdoccol.R (lapply): Removed debug print out.
162    
163    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
164    
165            * R/reader.R (readRCV1): Improved meta data extraction from
166            Reuters Corpus Volume 1 documents.
167    
168    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
169    
170            * R/transform.R: Ensure that all mappings preserve multiline
171            structures.
172    
173    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
174    
175            * R/filter.R: Every filter has now an attribute indicating whether
176            it sould be applied to document level (doclevel).
177    
178            * R/textdoccol.R (tmFilter): Set searchFullText as new default
179            filter.
180    
181    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * R/transform.R (replacePatterns): Replaced removeWords by
184            replacePatterns. Suggested by Christian Buchta.
185    
186            * R/textdoccol.R (inspect): Improved formatting.
187    
188    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
189    
190            * inst/CITATION: Updated JSS article information.
191    
192            * R/textdoccol.R (setAs): Added coerce method from list to
193            corpus.
194    
195            * R/meta.R (meta): Improved meta data handling.
196    
197    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
198    
199            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
200            Christian Buchta.
201    
202            * inst/CITATION: Added template to include JSS article reference.
203    
204    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
205    
206            * R/textdoccol.R (tmMap): Introduced lazy mapping.
207    
208            * R/source.R: Added VectorSource.
209    
210    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * man/: Language codes should be in ISO 639-1 format.
213    
214            * R/textdoccol.R (asPlain): Preserve local meta data.
215    
216    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
217    
218            * R/textdoccol.R (writeCorpus): Function for writing a corpus
219            containing plain text documents to disk.
220    
221    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
222    
223            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
224            always set correctly.
225    
226            * R/textdoccol.R: Set load = TRUE as default for load on demand
227            since in most cases this is the wanted behaviour.
228    
229    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
230    
231            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
232    
233            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
234    
235    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
236    
237            * R/meta.R (meta): New function for consistent access to meta data
238            of document collections, repositories, and texts.
239    
240    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
241    
242            * R/: Better support for encodings.
243    
244    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
247            selection when no reader argument is given.
248    
249    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
250    
251            * R/source.R (CSVSource): Now uses read.csv instead of scan
252            internally.
253    
254    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * R/reader.R (getReaders): Returns available reader functions.
257    
258            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
259            as default.
260    
261    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * R/stopwords.R (stopwords): Shortened code, removed codetools
264            variable warnings.
265    
266            * man/: Documentation for showMeta, added an example for tmMap.
267    
268            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
269            some minor typos fixed.
270    
271    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/aobjects.R (showMeta): Added method for pretty printing a
274            text document's meta data.
275    
276    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
277    
278            * R/textdoccol.R (TextDocCol): Better handling of empty
279            arguments.
280    
281            * NAMESPACE: Exported readDOC.
282    
283            * man/completeStems.Rd: Added an example.
284    
285    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
286    
287            * R/stopwords.R (stopwords): Look up .dat files at every
288            call. Allows users to modify stopword .dat files interactively.
289    
290    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
291    
292            * R/termdocmatrix.R (termFreq): Correct processing of empty
293            documents.
294    
295    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * man/: Updated documentation.
298    
299    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * R/complete.R (completeStems): Completes (heuristically) word
302            stems.
303    
304            * R/termdocmatrix.R (TermDocMatrix2): New modular
305            constructor.
306    
307            * NAMESPACE: Exported termFreq.
308    
309    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * R/reader.R (readDOC): Added MS Word reader (using antiword).
312    
313    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
314    
315            * R/weight.R: Weighting functions for TermDocMatrix.
316    
317    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
320            functions for accessing dimension, column, and row names.
321    
322            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
323    
324    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
325    
326            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
327    
328    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
329    
330            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
331    
332    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
333    
334            * R/reader.R (readPDF): Removed manual checks for pdftotext and
335            pdfinfo. The system call gives a warning anyway.
336    
337    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * R/textdoccol.R (asPlain): Conversion from
340            StructuredTextDocuments to PlainTextDocuments.
341    
342    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
343    
344            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
345            for accessing term-document matrices.
346    
347            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
348            are installed.
349    
350    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
351    
352            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
353            Christian Buchta.
354    
355    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
358    
359    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
360    
361            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
362    
363            * R/reader.R (readPDF): Added PDF reader.
364    
365    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
368    
369            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
370    
371            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
372    
373            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
374    
375    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
376    
377            * R/distmeasure.R (dissimilarity): Replaced dists call from
378            package cba by new dist call from package proxy.
379    
380    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
381    
382            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
383    
384    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
385    
386            * R/termdocmatrix.R: require() uses the quietly option to suppress
387            loading messages.
388    
389    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
390    
391            * R/dictionary.R: Added dictionary support.
392    
393    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
394    
395            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
396            documents. This simplifies some functions, e.g., asPlain.
397    
398    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
399    
400            * inst/doc/tm.Rnw: Fixed some typos in vignette.
401    
402    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
403    
404            * R/textdoccol.R (replaceWords): Added method to replace a set of
405            words by a single word. Useful for synonyms.
406    
407    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
410    
411    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
412    
413            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
414            vectors. Thanks to Ariel Maguyon for his error report.
415            (removeSparseTerms): New function to remove columns from a
416            term-document matrix exceeding a sparse factor.
417    
418    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
419    
420            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
421    
422    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * man/sFilter.Rd: Corrected documentation on statement format (use
425            '==' instead of '=').
426    
427    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
428    
429            * R/aobjects.R (StructuredTextDocument): Inherits from
430            TextDocument.
431    
432    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
433    
434            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
435            on sparse matrices as proposed by Martin Maechler.
436    
437    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
440            \pkg{filehash} version makes them deprecated.
441    
442    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
443    
444            * R/termdocmatrix.R (textvector): Stemming is now performed before
445            erasing stopwords.
446            (weightMatrix): Adapted to handle sparse matrices.
447            (TermDocMatrix): Sparse matrix is now efficiently built by
448            direct stepwise insertion of row values into it.
449    
450    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
451    
452            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
453            due to ongoing problems. For our purposes the latter is as useful
454            as the replaced package.
455    
456    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
457    
458            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
459    
460            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
461    
462    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
463    
464            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
465            languages with available stopwords.
466    
467    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
468    
469            * inst/doc/tm.Rnw: Minor corrections in the vignette.
470    
471    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
472    
473            * DESCRIPTION: Update to version 0.2, since a lot of new features
474            have been integrated.
475    
476            * inst/stopwords: Updated existing stopwords and added stopwords
477            for various other languages.
478    
479    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
480    
481            * man/: Updated documentation.
482    
483            * Work/testDb.R: Script to test database stuff.
484    
485            * R/: Fixed various database related bugs. Seems to be rather
486            useable now, i.e., consider as alpha status for now.
487    
488    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
489    
490            * R/: Fixed some bugs related to database support.
491    
492    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
493    
494            * man/: Added a lot of examples to the manuals.
495    
496    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
497    
498            * man/: Updated parts of the documentation.
499    
500            * R/textdoccol.R (asPlain): Added conversion from newsgroup
501            documents to plain text documents.
502    
503    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
504    
505            * R/textdoccol.R: Finished experimental database support. Not yet
506            intensively tested.
507    
508            * R/source.R: Now each source has a default reader.
509    
510            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
511            class anymore.
512    
513            * R/plaintextdoc.R: Custom show method for plain text documents.
514    
515            * R/aobjects.R: Added a class for structured text documents.
516    
517            * R/reader.R: Replaced remaining \code{parser} occurrences with
518            \code{reader}.
519    
520            * R/textdoccol.R (summary): Indent tags.
521    
522            * R/textdoccol.R (removePunctuation): Transform method to remove
523            punctuation marks.
524    
525    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
526    
527            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
528            using prescindMeta().
529    
530    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * R/textdoccol.R: Improved database support.
533    
534    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
535    
536            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
537    
538            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
539            language code.
540    
541            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
542            into parserControl argument.
543    
544            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
545    
546    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
547    
548            * Work/tmDataSetup.R: The datasets acq and crude can now be
549            created on the fly.
550    
551            * R/stopwords.R: Introduced a function returning the stopwords for
552            a given language (English, German and French at the moment)
553    
554            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
555            otherwise falls back to Snowball package.
556    
557    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
558    
559            * man/dissimilarity-methods.Rd: Make clear that any method offered
560            by "dists" from package "cba" can be used.
561    
562    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
563    
564            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
565            to Kurt's latex suggestion. Removed points and underscores in
566            variable names for consistent naming.
567    
568            * DESCRIPTION: Update to version 0.1-2.
569    
570            * man/TextRepository.Rd: Fixed bug in documentation.
571    
572    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
573    
574            * DESCRIPTION: Update to version 0.1-1.
575    
576    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
577    
578            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
579            wordStem.
580    
581    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
582    
583            * R/: Changes due to Kurt's review.
584    
585    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
586    
587            * R/: Implemented improvements based upon comments by David
588            Meyer.
589    
590    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
591    
592            * inst/doc/: Rewrote vignette.
593    
594            * man/: Improved documentation.
595    
596    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
597    
598            * man/: Updated documentation.
599    
600            * DESCRIPTION: Changed package name to "tm". Updated version to
601            0.1 for first CRAN release.
602    
603            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
604            list archive example.
605    
606            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
607            archive example.
608    
609            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
610            from (several mails per box) mbox format to (single mail per file)
611            eml format.
612    
613    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
614    
615            * data/crude.rda: Rebuilt.
616    
617            * data/acq.rda: Rebuilt.
618    
619            * R/reader.R: Factored out reader and parser methods from
620            textdoccol.R.
621    
622            * R/source.R: Factored out Source methods from aobjects.R and
623            textdoccol.R.
624            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
625            feeds.
626    
627            * R/textdoccol.R (DirSource): Added support for recursive
628            traversal of directories.
629    
630    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
631    
632            * R/textdoccol.R ([[): Loads the document corpus automatically
633            into memory upon access.
634            (tm_transform, tm_filter): Removed several checks whether the
635            document is already loaded ([[ ensures this now).
636            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
637            mailing list archive.
638    
639    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
640    
641            * R/aobjects.R (TextDocument): Is now a virtual class.
642            (Source): Is now a virtual class.
643    
644    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
645    
646            * R/textdoccol.R (c): Support for an arbitrary number of document
647            collections.
648    
649    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
650    
651            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
652            append_meta and remove_meta.
653    
654            * R/textdoccol.R: Removed modify_metadata method.
655    
656            * R/textrepo.R: Removed modify_metadata method.
657    
658            * R/textdoccol.R (remove_meta): Supports removal of document
659            collection metadata and document (= in data frame) metadata.
660    
661    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
662    
663            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
664    
665            * data/crude.rda: Rebuilt.
666    
667            * data/acq.rda: Rebuilt.
668    
669            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
670    
671            * R/textdoccol.R ([): Bug fix for subsetting a document
672            collection's data frame.
673    
674    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
675    
676            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
677            to s_filter.
678    
679            * R/textdoccol.R: Local text documents' metadata can now be copied
680            to a document collection's data frame with prescind_meta.
681    
682    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
683    
684            * R/: Text documents' slot metadata is now accessible in s_filter.
685    
686            * R/: Rewrote s_filter function (has still some restrictions).
687    
688    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
689    
690            * R/: Various fixes in handling metadata.
691    
692            * R/: Added update mechanism for text document collections.
693    
694    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
695    
696            * R/: Merging of document collections now creates a binary tree
697            for reconstructing merged document collections.
698    
699            * R/: Redesign of metadata for document collections.
700    
701    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
702    
703            * R/: Messages now use \code{ngettext}.
704    
705    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
706    
707            * R/: Added functions for modifying and removing metadata.
708    
709    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * man/: Updated some documentation.
712    
713            * R/: Corrected some connection issues.
714    
715            * inst/doc: Worked on the vignette.
716    
717    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
718    
719            * inst/: Added texts and started vignette.
720    
721            * R/: Final changes based upon David's comments.
722    
723    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
724    
725            * NAMESPACE: Corrected exports (generic methods need exportMethods
726            directives!).
727    
728    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
729    
730            * R/: Modified the TextDocCol constructur and various parsers. It
731            is now modular and supports various file formats via plugins (see
732            the new "Source" class).
733    
734    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
735    
736            * man/: Revised documentation after previous code changes.
737    
738    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
739    
740            * R/: Remaining changes as discussed with David.
741    
742    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
743    
744            * R/: Some changes as suggested by David. The rest will follow
745            within the next days.
746    
747    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
748    
749            * man/: Finished documentation.
750    
751    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * man/: Wrote some documentation.
754    
755    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
756    
757            * R/: Further syntactic sugar in form of additional assignment and
758            accessor methods.
759    
760    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
761    
762            * R/: Syntactic sugar in form of "length", "show" and "summary"
763            operators.
764    
765    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
766    
767            * R/: Diverse updates. Mainly on default operators ("[" or "c")
768            and dissimilarities.
769    
770    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
771    
772            * R/: Added similarity functions.
773    
774            * data/: Added english stopwords.
775    
776    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
777    
778            * data/: Examples compiled for new features
779    
780            * R/: Changes due to new structure.
781    
782            * NAMESPACE: Corrected namespace to reflect new structure.
783    
784            * R/termdocmatrix.R: Adapted for new naming scheme.
785    
786    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
787    
788            * R/textdoccol.R: Adapted code for new class structure. Wrote
789            several transform and filter functions operating on text document
790            collections (alias text document databases).
791    
792            * R/aobjects.R: Adapted class structure with inheritance,
793            repositories and additional meta data. Loading files on demand is
794            now possible.
795    
796    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
797    
798            * R/: Some cosmetic cleanups.
799    
800            * inst/: Removed vignette on clustering. That and much more is now
801            described in the JSS paper on text mining. Based upon that
802            article an elaborated vignette will be incorporated in the future.
803    
804    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
805    
806            * R/: Updated generic S4 methods to comply with signature changes
807            in newer versions of R (> 2.3)
808    
809    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
810    
811            * ext/R/importRIS.R: Automatic RIS import is now possible.
812    
813    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
814    
815            * R/textdoccol.R: Added RIS HTML input format.
816    
817    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
818    
819            * R/textdoccol.R: Removed bug that caused invalid text document
820            collections when handling many input files.
821    
822  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
823    
824          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.946

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge