SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC pkg/ChangeLog revision 945, Mon May 4 10:57:01 2009 UTC
# Line 1  Line 1 
1    2009-05-04   Ingo Feinerer <feinerer@logic.at>
2    
3            * NAMESPACE: Export some simple_triplet_matrix functions.
4    
5    2009-04-28   Ingo Feinerer <feinerer@logic.at>
6    
7            * R/weight.R: Adapt tf-idf to new matrix format.
8    
9    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
10    
11            * R/matrix.R: Create two distinct classes for term-document and
12            document-term matrices.
13    
14    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
15    
16            * R/termdocmatrix.R: No longer use Matrix package. This reduces
17            package start-up time significantly.
18    
19    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
20    
21            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
22    
23    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
24    
25            * R/transform.R (tmReduce): Combine multiple maps into one
26            transformation.
27    
28    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
29    
30            * R/weight.R: Remove weightLogical since it does not return a
31            dgCMatrix.
32    
33            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
34            or TermDocumentMatrix instead.
35    
36    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
37    
38            * inst/doc/extensions.Rnw: Finished vignette.
39    
40    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
43            DocumentTermMatrix representations.
44    
45    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
46    
47            * R/reader.R (readXML): New reader for arbitrary XML files.
48    
49    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
50    
51            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
52            (XMLSource): New XMLSource class for arbitrary XML files.
53            (Source): New slot Vectorized.
54    
55    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/reader.R (readTabular): Experimental reader for tabular data
58            structures which can be customized via user-defined mappings.
59    
60            * R/reader.R: Always use UTC time zone.
61    
62            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
63    
64    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
65    
66            * R/reader.R (readDOC): Options can be passed over to antiword.
67    
68            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
69            pdftotext.
70    
71    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
72    
73            * R/source.R (DirSource): Add pattern and ignore.case arguments
74            which are internally passed over to list.files().
75    
76    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
77    
78            * inst/doc/tm.Rnw: Suppress pointless loading message.
79    
80    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
81    
82            * DESCRIPTION: Speed up package loading (via moving packages not
83            strictly necessary for normal operation to Suggests instead of
84            Depends).
85    
86    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
87    
88            * R/reader.R (readNewsgroup): The date format is now configurable.
89    
90    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
91    
92            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
93    
94    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
95    
96            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
97    
98    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
99    
100            * R/source.R (DataframeSource): New source class for data frames.
101    
102            * R/source.R: Fixed non-standard call evaluation.
103    
104    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
105    
106            * R/source.R (URISource): New source class for a single document.
107    
108    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
109    
110            * R/source.R: Refactoring.
111    
112    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
113    
114            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
115            Rmpi installations more gracefully.
116    
117    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
118    
119            * R/source.R (Source): Add Length slot.
120    
121    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
122    
123            * R/AAA.R: Unify duplicated .onLoad function.
124    
125    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
126    
127            * DESCRIPTION (Suggests): Added Rmpi.
128    
129    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
130    
131            * R/source.R (getElem): Fix 'no visible binding' warning.
132    
133            * man/WeightFunction.Rd: Fix signature.
134    
135    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
136    
137            * R/weight.R: Introduce name abbreviations for weighting functions.
138    
139    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
140    
141            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
142    
143            * R/cluster.R: Provide convenience functions for using a MPI
144            cluster.
145    
146            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
147            available.
148    
149            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
150            available.
151    
152    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
153    
154            * R/textdoccol.R (lapply): Removed debug print out.
155    
156    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
157    
158            * R/reader.R (readRCV1): Improved meta data extraction from
159            Reuters Corpus Volume 1 documents.
160    
161    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
162    
163            * R/transform.R: Ensure that all mappings preserve multiline
164            structures.
165    
166    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
167    
168            * R/filter.R: Every filter has now an attribute indicating whether
169            it sould be applied to document level (doclevel).
170    
171            * R/textdoccol.R (tmFilter): Set searchFullText as new default
172            filter.
173    
174    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
175    
176            * R/transform.R (replacePatterns): Replaced removeWords by
177            replacePatterns. Suggested by Christian Buchta.
178    
179            * R/textdoccol.R (inspect): Improved formatting.
180    
181    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * inst/CITATION: Updated JSS article information.
184    
185            * R/textdoccol.R (setAs): Added coerce method from list to
186            corpus.
187    
188            * R/meta.R (meta): Improved meta data handling.
189    
190    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
191    
192            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
193            Christian Buchta.
194    
195            * inst/CITATION: Added template to include JSS article reference.
196    
197    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
198    
199            * R/textdoccol.R (tmMap): Introduced lazy mapping.
200    
201            * R/source.R: Added VectorSource.
202    
203    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
204    
205            * man/: Language codes should be in ISO 639-1 format.
206    
207            * R/textdoccol.R (asPlain): Preserve local meta data.
208    
209    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/textdoccol.R (writeCorpus): Function for writing a corpus
212            containing plain text documents to disk.
213    
214    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
217            always set correctly.
218    
219            * R/textdoccol.R: Set load = TRUE as default for load on demand
220            since in most cases this is the wanted behaviour.
221    
222    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
225    
226            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
227    
228    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * R/meta.R (meta): New function for consistent access to meta data
231            of document collections, repositories, and texts.
232    
233    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/: Better support for encodings.
236    
237    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
238    
239            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
240            selection when no reader argument is given.
241    
242    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
243    
244            * R/source.R (CSVSource): Now uses read.csv instead of scan
245            internally.
246    
247    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
248    
249            * R/reader.R (getReaders): Returns available reader functions.
250    
251            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
252            as default.
253    
254    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * R/stopwords.R (stopwords): Shortened code, removed codetools
257            variable warnings.
258    
259            * man/: Documentation for showMeta, added an example for tmMap.
260    
261            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
262            some minor typos fixed.
263    
264    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
265    
266            * R/aobjects.R (showMeta): Added method for pretty printing a
267            text document's meta data.
268    
269    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
270    
271            * R/textdoccol.R (TextDocCol): Better handling of empty
272            arguments.
273    
274            * NAMESPACE: Exported readDOC.
275    
276            * man/completeStems.Rd: Added an example.
277    
278    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
279    
280            * R/stopwords.R (stopwords): Look up .dat files at every
281            call. Allows users to modify stopword .dat files interactively.
282    
283    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
284    
285            * R/termdocmatrix.R (termFreq): Correct processing of empty
286            documents.
287    
288    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * man/: Updated documentation.
291    
292    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
293    
294            * R/complete.R (completeStems): Completes (heuristically) word
295            stems.
296    
297            * R/termdocmatrix.R (TermDocMatrix2): New modular
298            constructor.
299    
300            * NAMESPACE: Exported termFreq.
301    
302    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * R/reader.R (readDOC): Added MS Word reader (using antiword).
305    
306    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
307    
308            * R/weight.R: Weighting functions for TermDocMatrix.
309    
310    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
311    
312            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
313            functions for accessing dimension, column, and row names.
314    
315            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
316    
317    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
320    
321    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
322    
323            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
324    
325    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
326    
327            * R/reader.R (readPDF): Removed manual checks for pdftotext and
328            pdfinfo. The system call gives a warning anyway.
329    
330    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * R/textdoccol.R (asPlain): Conversion from
333            StructuredTextDocuments to PlainTextDocuments.
334    
335    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
336    
337            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
338            for accessing term-document matrices.
339    
340            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
341            are installed.
342    
343    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
346            Christian Buchta.
347    
348    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
349    
350            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
351    
352    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
355    
356            * R/reader.R (readPDF): Added PDF reader.
357    
358    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
359    
360            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
361    
362            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
363    
364            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
365    
366            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
367    
368    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
369    
370            * R/distmeasure.R (dissimilarity): Replaced dists call from
371            package cba by new dist call from package proxy.
372    
373    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
374    
375            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
376    
377    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
378    
379            * R/termdocmatrix.R: require() uses the quietly option to suppress
380            loading messages.
381    
382    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
383    
384            * R/dictionary.R: Added dictionary support.
385    
386    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
387    
388            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
389            documents. This simplifies some functions, e.g., asPlain.
390    
391    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
392    
393            * inst/doc/tm.Rnw: Fixed some typos in vignette.
394    
395    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
396    
397            * R/textdoccol.R (replaceWords): Added method to replace a set of
398            words by a single word. Useful for synonyms.
399    
400    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
401    
402            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
403    
404    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
405    
406            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
407            vectors. Thanks to Ariel Maguyon for his error report.
408            (removeSparseTerms): New function to remove columns from a
409            term-document matrix exceeding a sparse factor.
410    
411    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
412    
413            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
414    
415    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
416    
417            * man/sFilter.Rd: Corrected documentation on statement format (use
418            '==' instead of '=').
419    
420    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
421    
422            * R/aobjects.R (StructuredTextDocument): Inherits from
423            TextDocument.
424    
425    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
426    
427            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
428            on sparse matrices as proposed by Martin Maechler.
429    
430    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
431    
432            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
433            \pkg{filehash} version makes them deprecated.
434    
435    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
436    
437            * R/termdocmatrix.R (textvector): Stemming is now performed before
438            erasing stopwords.
439            (weightMatrix): Adapted to handle sparse matrices.
440            (TermDocMatrix): Sparse matrix is now efficiently built by
441            direct stepwise insertion of row values into it.
442    
443    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
444    
445            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
446            due to ongoing problems. For our purposes the latter is as useful
447            as the replaced package.
448    
449    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
450    
451            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
452    
453            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
454    
455    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
456    
457            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
458            languages with available stopwords.
459    
460    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
461    
462            * inst/doc/tm.Rnw: Minor corrections in the vignette.
463    
464    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
465    
466            * DESCRIPTION: Update to version 0.2, since a lot of new features
467            have been integrated.
468    
469            * inst/stopwords: Updated existing stopwords and added stopwords
470            for various other languages.
471    
472    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
473    
474            * man/: Updated documentation.
475    
476            * Work/testDb.R: Script to test database stuff.
477    
478            * R/: Fixed various database related bugs. Seems to be rather
479            useable now, i.e., consider as alpha status for now.
480    
481    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
482    
483            * R/: Fixed some bugs related to database support.
484    
485    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
486    
487            * man/: Added a lot of examples to the manuals.
488    
489    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
490    
491            * man/: Updated parts of the documentation.
492    
493            * R/textdoccol.R (asPlain): Added conversion from newsgroup
494            documents to plain text documents.
495    
496    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
497    
498            * R/textdoccol.R: Finished experimental database support. Not yet
499            intensively tested.
500    
501            * R/source.R: Now each source has a default reader.
502    
503            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
504            class anymore.
505    
506            * R/plaintextdoc.R: Custom show method for plain text documents.
507    
508            * R/aobjects.R: Added a class for structured text documents.
509    
510            * R/reader.R: Replaced remaining \code{parser} occurrences with
511            \code{reader}.
512    
513            * R/textdoccol.R (summary): Indent tags.
514    
515            * R/textdoccol.R (removePunctuation): Transform method to remove
516            punctuation marks.
517    
518    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
519    
520            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
521            using prescindMeta().
522    
523    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
524    
525            * R/textdoccol.R: Improved database support.
526    
527    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
528    
529            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
530    
531            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
532            language code.
533    
534            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
535            into parserControl argument.
536    
537            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
538    
539    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
540    
541            * Work/tmDataSetup.R: The datasets acq and crude can now be
542            created on the fly.
543    
544            * R/stopwords.R: Introduced a function returning the stopwords for
545            a given language (English, German and French at the moment)
546    
547            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
548            otherwise falls back to Snowball package.
549    
550    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
551    
552            * man/dissimilarity-methods.Rd: Make clear that any method offered
553            by "dists" from package "cba" can be used.
554    
555    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
556    
557            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
558            to Kurt's latex suggestion. Removed points and underscores in
559            variable names for consistent naming.
560    
561            * DESCRIPTION: Update to version 0.1-2.
562    
563            * man/TextRepository.Rd: Fixed bug in documentation.
564    
565    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
566    
567            * DESCRIPTION: Update to version 0.1-1.
568    
569    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
570    
571            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
572            wordStem.
573    
574    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
575    
576            * R/: Changes due to Kurt's review.
577    
578    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * R/: Implemented improvements based upon comments by David
581            Meyer.
582    
583    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
584    
585            * inst/doc/: Rewrote vignette.
586    
587            * man/: Improved documentation.
588    
589    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
590    
591            * man/: Updated documentation.
592    
593            * DESCRIPTION: Changed package name to "tm". Updated version to
594            0.1 for first CRAN release.
595    
596            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
597            list archive example.
598    
599            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
600            archive example.
601    
602            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
603            from (several mails per box) mbox format to (single mail per file)
604            eml format.
605    
606    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
607    
608            * data/crude.rda: Rebuilt.
609    
610            * data/acq.rda: Rebuilt.
611    
612            * R/reader.R: Factored out reader and parser methods from
613            textdoccol.R.
614    
615            * R/source.R: Factored out Source methods from aobjects.R and
616            textdoccol.R.
617            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
618            feeds.
619    
620            * R/textdoccol.R (DirSource): Added support for recursive
621            traversal of directories.
622    
623    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
624    
625            * R/textdoccol.R ([[): Loads the document corpus automatically
626            into memory upon access.
627            (tm_transform, tm_filter): Removed several checks whether the
628            document is already loaded ([[ ensures this now).
629            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
630            mailing list archive.
631    
632    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
633    
634            * R/aobjects.R (TextDocument): Is now a virtual class.
635            (Source): Is now a virtual class.
636    
637    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
638    
639            * R/textdoccol.R (c): Support for an arbitrary number of document
640            collections.
641    
642    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
643    
644            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
645            append_meta and remove_meta.
646    
647            * R/textdoccol.R: Removed modify_metadata method.
648    
649            * R/textrepo.R: Removed modify_metadata method.
650    
651            * R/textdoccol.R (remove_meta): Supports removal of document
652            collection metadata and document (= in data frame) metadata.
653    
654    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
657    
658            * data/crude.rda: Rebuilt.
659    
660            * data/acq.rda: Rebuilt.
661    
662            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
663    
664            * R/textdoccol.R ([): Bug fix for subsetting a document
665            collection's data frame.
666    
667    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
668    
669            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
670            to s_filter.
671    
672            * R/textdoccol.R: Local text documents' metadata can now be copied
673            to a document collection's data frame with prescind_meta.
674    
675    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
676    
677            * R/: Text documents' slot metadata is now accessible in s_filter.
678    
679            * R/: Rewrote s_filter function (has still some restrictions).
680    
681    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
682    
683            * R/: Various fixes in handling metadata.
684    
685            * R/: Added update mechanism for text document collections.
686    
687    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
688    
689            * R/: Merging of document collections now creates a binary tree
690            for reconstructing merged document collections.
691    
692            * R/: Redesign of metadata for document collections.
693    
694    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
695    
696            * R/: Messages now use \code{ngettext}.
697    
698    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
699    
700            * R/: Added functions for modifying and removing metadata.
701    
702    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
703    
704            * man/: Updated some documentation.
705    
706            * R/: Corrected some connection issues.
707    
708            * inst/doc: Worked on the vignette.
709    
710    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
711    
712            * inst/: Added texts and started vignette.
713    
714            * R/: Final changes based upon David's comments.
715    
716    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
717    
718            * NAMESPACE: Corrected exports (generic methods need exportMethods
719            directives!).
720    
721    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
722    
723            * R/: Modified the TextDocCol constructur and various parsers. It
724            is now modular and supports various file formats via plugins (see
725            the new "Source" class).
726    
727    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
728    
729            * man/: Revised documentation after previous code changes.
730    
731    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
732    
733            * R/: Remaining changes as discussed with David.
734    
735    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
736    
737            * R/: Some changes as suggested by David. The rest will follow
738            within the next days.
739    
740    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
741    
742            * man/: Finished documentation.
743    
744    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
745    
746            * man/: Wrote some documentation.
747    
748    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
749    
750            * R/: Further syntactic sugar in form of additional assignment and
751            accessor methods.
752    
753    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
754    
755            * R/: Syntactic sugar in form of "length", "show" and "summary"
756            operators.
757    
758    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
759    
760            * R/: Diverse updates. Mainly on default operators ("[" or "c")
761            and dissimilarities.
762    
763    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
764    
765            * R/: Added similarity functions.
766    
767            * data/: Added english stopwords.
768    
769    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
770    
771            * data/: Examples compiled for new features
772    
773            * R/: Changes due to new structure.
774    
775            * NAMESPACE: Corrected namespace to reflect new structure.
776    
777            * R/termdocmatrix.R: Adapted for new naming scheme.
778    
779    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
780    
781            * R/textdoccol.R: Adapted code for new class structure. Wrote
782            several transform and filter functions operating on text document
783            collections (alias text document databases).
784    
785            * R/aobjects.R: Adapted class structure with inheritance,
786            repositories and additional meta data. Loading files on demand is
787            now possible.
788    
789    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
790    
791            * R/: Some cosmetic cleanups.
792    
793            * inst/: Removed vignette on clustering. That and much more is now
794            described in the JSS paper on text mining. Based upon that
795            article an elaborated vignette will be incorporated in the future.
796    
797    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
798    
799            * R/: Updated generic S4 methods to comply with signature changes
800            in newer versions of R (> 2.3)
801    
802    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
803    
804            * ext/R/importRIS.R: Automatic RIS import is now possible.
805    
806    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
807    
808            * R/textdoccol.R: Added RIS HTML input format.
809    
810    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
811    
812            * R/textdoccol.R: Removed bug that caused invalid text document
813            collections when handling many input files.
814    
815    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
816    
817            * R/textdoccol.R: Restructured and extended file import
818            mechanism.
819    
820            * inst/doc/clustering.Rnw: Adapted vignette for use with
821            ReutNews.rda
822    
823            * man/ReutNews.Rd: Documentation for ReutNews.rda
824    
825            * data/ReutNews.rda: A tiny Reuters21578 example data set.
826    
827    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
828    
829            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
830            clustering facilities of this package.
831    
832    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
833    
834            * R/aobjects.R: Changed package document structure to avoid class
835            dependency problems.
836    
837  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
838    
839            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
840            data set.
841    
842          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
843          CMD check textmin" works without errors.          CMD check textmin" works without errors.
844    

Legend:
Removed from v.28  
changed lines
  Added in v.945

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge