SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC pkg/ChangeLog revision 938, Sat Apr 25 19:05:50 2009 UTC
# Line 1  Line 1 
1    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/termdocmatrix.R: No longer use Matrix package. This improves
4            package loading significantly.
5    
6    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
7    
8            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
9    
10    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
11    
12            * R/transform.R (tmReduce): Combine multiple maps into one
13            transformation.
14    
15    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
16    
17            * R/weight.R: Remove weightLogical since it does not return a
18            dgCMatrix.
19    
20            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
21            or TermDocumentMatrix instead.
22    
23    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
24    
25            * inst/doc/extensions.Rnw: Finished vignette.
26    
27    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
28    
29            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
30            DocumentTermMatrix representations.
31    
32    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
33    
34            * R/reader.R (readXML): New reader for arbitrary XML files.
35    
36    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
39            (XMLSource): New XMLSource class for arbitrary XML files.
40            (Source): New slot Vectorized.
41    
42    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
43    
44            * R/reader.R (readCustom): Experimental reader which can be
45            customized via user-defined mappings.
46    
47            * R/reader.R: Always use UTC time zone.
48    
49            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
50    
51    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
52    
53            * R/reader.R (readDOC): Options can be passed over to antiword.
54    
55            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
56            pdftotext.
57    
58    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
59    
60            * R/source.R (DirSource): Add pattern and ignore.case arguments
61            which are internally passed over to list.files().
62    
63    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
64    
65            * inst/doc/tm.Rnw: Suppress pointless loading message.
66    
67    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
68    
69            * DESCRIPTION: Speed up package loading (via moving packages not
70            strictly necessary for normal operation to Suggests instead of
71            Depends).
72    
73    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/reader.R (readNewsgroup): The date format is now configurable.
76    
77    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
78    
79            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
80    
81    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
82    
83            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
84    
85    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
86    
87            * R/source.R (DataframeSource): New source class for data frames.
88    
89            * R/source.R: Fixed non-standard call evaluation.
90    
91    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
92    
93            * R/source.R (URISource): New source class for a single document.
94    
95    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
96    
97            * R/source.R: Refactoring.
98    
99    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
100    
101            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
102            Rmpi installations more gracefully.
103    
104    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
105    
106            * R/source.R (Source): Add Length slot.
107    
108    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
109    
110            * R/AAA.R: Unify duplicated .onLoad function.
111    
112    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
113    
114            * DESCRIPTION (Suggests): Added Rmpi.
115    
116    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
117    
118            * R/source.R (getElem): Fix 'no visible binding' warning.
119    
120            * man/WeightFunction.Rd: Fix signature.
121    
122    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
123    
124            * R/weight.R: Introduce name abbreviations for weighting functions.
125    
126    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
127    
128            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
129    
130            * R/cluster.R: Provide convenience functions for using a MPI
131            cluster.
132    
133            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
134            available.
135    
136            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
137            available.
138    
139    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
140    
141            * R/textdoccol.R (lapply): Removed debug print out.
142    
143    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
144    
145            * R/reader.R (readRCV1): Improved meta data extraction from
146            Reuters Corpus Volume 1 documents.
147    
148    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
149    
150            * R/transform.R: Ensure that all mappings preserve multiline
151            structures.
152    
153    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * R/filter.R: Every filter has now an attribute indicating whether
156            it sould be applied to document level (doclevel).
157    
158            * R/textdoccol.R (tmFilter): Set searchFullText as new default
159            filter.
160    
161    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
162    
163            * R/transform.R (replacePatterns): Replaced removeWords by
164            replacePatterns. Suggested by Christian Buchta.
165    
166            * R/textdoccol.R (inspect): Improved formatting.
167    
168    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
169    
170            * inst/CITATION: Updated JSS article information.
171    
172            * R/textdoccol.R (setAs): Added coerce method from list to
173            corpus.
174    
175            * R/meta.R (meta): Improved meta data handling.
176    
177    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
178    
179            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
180            Christian Buchta.
181    
182            * inst/CITATION: Added template to include JSS article reference.
183    
184    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
185    
186            * R/textdoccol.R (tmMap): Introduced lazy mapping.
187    
188            * R/source.R: Added VectorSource.
189    
190    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
191    
192            * man/: Language codes should be in ISO 639-1 format.
193    
194            * R/textdoccol.R (asPlain): Preserve local meta data.
195    
196    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
197    
198            * R/textdoccol.R (writeCorpus): Function for writing a corpus
199            containing plain text documents to disk.
200    
201    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
202    
203            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
204            always set correctly.
205    
206            * R/textdoccol.R: Set load = TRUE as default for load on demand
207            since in most cases this is the wanted behaviour.
208    
209    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
212    
213            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
214    
215    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
216    
217            * R/meta.R (meta): New function for consistent access to meta data
218            of document collections, repositories, and texts.
219    
220    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * R/: Better support for encodings.
223    
224    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
225    
226            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
227            selection when no reader argument is given.
228    
229    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
230    
231            * R/source.R (CSVSource): Now uses read.csv instead of scan
232            internally.
233    
234    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
235    
236            * R/reader.R (getReaders): Returns available reader functions.
237    
238            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
239            as default.
240    
241    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * R/stopwords.R (stopwords): Shortened code, removed codetools
244            variable warnings.
245    
246            * man/: Documentation for showMeta, added an example for tmMap.
247    
248            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
249            some minor typos fixed.
250    
251    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
252    
253            * R/aobjects.R (showMeta): Added method for pretty printing a
254            text document's meta data.
255    
256    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
257    
258            * R/textdoccol.R (TextDocCol): Better handling of empty
259            arguments.
260    
261            * NAMESPACE: Exported readDOC.
262    
263            * man/completeStems.Rd: Added an example.
264    
265    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/stopwords.R (stopwords): Look up .dat files at every
268            call. Allows users to modify stopword .dat files interactively.
269    
270    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
271    
272            * R/termdocmatrix.R (termFreq): Correct processing of empty
273            documents.
274    
275    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * man/: Updated documentation.
278    
279    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
280    
281            * R/complete.R (completeStems): Completes (heuristically) word
282            stems.
283    
284            * R/termdocmatrix.R (TermDocMatrix2): New modular
285            constructor.
286    
287            * NAMESPACE: Exported termFreq.
288    
289    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/reader.R (readDOC): Added MS Word reader (using antiword).
292    
293    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/weight.R: Weighting functions for TermDocMatrix.
296    
297    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
298    
299            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
300            functions for accessing dimension, column, and row names.
301    
302            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
303    
304    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
307    
308    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
309    
310            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
311    
312    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
313    
314            * R/reader.R (readPDF): Removed manual checks for pdftotext and
315            pdfinfo. The system call gives a warning anyway.
316    
317    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/textdoccol.R (asPlain): Conversion from
320            StructuredTextDocuments to PlainTextDocuments.
321    
322    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
325            for accessing term-document matrices.
326    
327            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
328            are installed.
329    
330    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
333            Christian Buchta.
334    
335    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
336    
337            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
338    
339    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
340    
341            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
342    
343            * R/reader.R (readPDF): Added PDF reader.
344    
345    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
346    
347            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
348    
349            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
350    
351            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
352    
353            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
354    
355    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * R/distmeasure.R (dissimilarity): Replaced dists call from
358            package cba by new dist call from package proxy.
359    
360    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
363    
364    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
365    
366            * R/termdocmatrix.R: require() uses the quietly option to suppress
367            loading messages.
368    
369    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
370    
371            * R/dictionary.R: Added dictionary support.
372    
373    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
374    
375            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
376            documents. This simplifies some functions, e.g., asPlain.
377    
378    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
379    
380            * inst/doc/tm.Rnw: Fixed some typos in vignette.
381    
382    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
383    
384            * R/textdoccol.R (replaceWords): Added method to replace a set of
385            words by a single word. Useful for synonyms.
386    
387    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
388    
389            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
390    
391    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
392    
393            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
394            vectors. Thanks to Ariel Maguyon for his error report.
395            (removeSparseTerms): New function to remove columns from a
396            term-document matrix exceeding a sparse factor.
397    
398    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
399    
400            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
401    
402    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
403    
404            * man/sFilter.Rd: Corrected documentation on statement format (use
405            '==' instead of '=').
406    
407    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/aobjects.R (StructuredTextDocument): Inherits from
410            TextDocument.
411    
412    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
415            on sparse matrices as proposed by Martin Maechler.
416    
417    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
418    
419            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
420            \pkg{filehash} version makes them deprecated.
421    
422    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * R/termdocmatrix.R (textvector): Stemming is now performed before
425            erasing stopwords.
426            (weightMatrix): Adapted to handle sparse matrices.
427            (TermDocMatrix): Sparse matrix is now efficiently built by
428            direct stepwise insertion of row values into it.
429    
430    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
431    
432            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
433            due to ongoing problems. For our purposes the latter is as useful
434            as the replaced package.
435    
436    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
437    
438            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
439    
440            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
441    
442    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
443    
444            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
445            languages with available stopwords.
446    
447    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
448    
449            * inst/doc/tm.Rnw: Minor corrections in the vignette.
450    
451    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
452    
453            * DESCRIPTION: Update to version 0.2, since a lot of new features
454            have been integrated.
455    
456            * inst/stopwords: Updated existing stopwords and added stopwords
457            for various other languages.
458    
459    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
460    
461            * man/: Updated documentation.
462    
463            * Work/testDb.R: Script to test database stuff.
464    
465            * R/: Fixed various database related bugs. Seems to be rather
466            useable now, i.e., consider as alpha status for now.
467    
468    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
469    
470            * R/: Fixed some bugs related to database support.
471    
472    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
473    
474            * man/: Added a lot of examples to the manuals.
475    
476    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
477    
478            * man/: Updated parts of the documentation.
479    
480            * R/textdoccol.R (asPlain): Added conversion from newsgroup
481            documents to plain text documents.
482    
483    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
484    
485            * R/textdoccol.R: Finished experimental database support. Not yet
486            intensively tested.
487    
488            * R/source.R: Now each source has a default reader.
489    
490            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
491            class anymore.
492    
493            * R/plaintextdoc.R: Custom show method for plain text documents.
494    
495            * R/aobjects.R: Added a class for structured text documents.
496    
497            * R/reader.R: Replaced remaining \code{parser} occurrences with
498            \code{reader}.
499    
500            * R/textdoccol.R (summary): Indent tags.
501    
502            * R/textdoccol.R (removePunctuation): Transform method to remove
503            punctuation marks.
504    
505    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
506    
507            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
508            using prescindMeta().
509    
510    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
511    
512            * R/textdoccol.R: Improved database support.
513    
514    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
515    
516            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
517    
518            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
519            language code.
520    
521            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
522            into parserControl argument.
523    
524            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
525    
526    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
527    
528            * Work/tmDataSetup.R: The datasets acq and crude can now be
529            created on the fly.
530    
531            * R/stopwords.R: Introduced a function returning the stopwords for
532            a given language (English, German and French at the moment)
533    
534            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
535            otherwise falls back to Snowball package.
536    
537    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
538    
539            * man/dissimilarity-methods.Rd: Make clear that any method offered
540            by "dists" from package "cba" can be used.
541    
542    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
543    
544            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
545            to Kurt's latex suggestion. Removed points and underscores in
546            variable names for consistent naming.
547    
548            * DESCRIPTION: Update to version 0.1-2.
549    
550            * man/TextRepository.Rd: Fixed bug in documentation.
551    
552    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
553    
554            * DESCRIPTION: Update to version 0.1-1.
555    
556    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
557    
558            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
559            wordStem.
560    
561    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
562    
563            * R/: Changes due to Kurt's review.
564    
565    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
566    
567            * R/: Implemented improvements based upon comments by David
568            Meyer.
569    
570    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
571    
572            * inst/doc/: Rewrote vignette.
573    
574            * man/: Improved documentation.
575    
576    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
577    
578            * man/: Updated documentation.
579    
580            * DESCRIPTION: Changed package name to "tm". Updated version to
581            0.1 for first CRAN release.
582    
583            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
584            list archive example.
585    
586            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
587            archive example.
588    
589            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
590            from (several mails per box) mbox format to (single mail per file)
591            eml format.
592    
593    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
594    
595            * data/crude.rda: Rebuilt.
596    
597            * data/acq.rda: Rebuilt.
598    
599            * R/reader.R: Factored out reader and parser methods from
600            textdoccol.R.
601    
602            * R/source.R: Factored out Source methods from aobjects.R and
603            textdoccol.R.
604            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
605            feeds.
606    
607            * R/textdoccol.R (DirSource): Added support for recursive
608            traversal of directories.
609    
610    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
611    
612            * R/textdoccol.R ([[): Loads the document corpus automatically
613            into memory upon access.
614            (tm_transform, tm_filter): Removed several checks whether the
615            document is already loaded ([[ ensures this now).
616            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
617            mailing list archive.
618    
619    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
620    
621            * R/aobjects.R (TextDocument): Is now a virtual class.
622            (Source): Is now a virtual class.
623    
624    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
625    
626            * R/textdoccol.R (c): Support for an arbitrary number of document
627            collections.
628    
629    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
630    
631            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
632            append_meta and remove_meta.
633    
634            * R/textdoccol.R: Removed modify_metadata method.
635    
636            * R/textrepo.R: Removed modify_metadata method.
637    
638            * R/textdoccol.R (remove_meta): Supports removal of document
639            collection metadata and document (= in data frame) metadata.
640    
641    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
642    
643            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
644    
645            * data/crude.rda: Rebuilt.
646    
647            * data/acq.rda: Rebuilt.
648    
649            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
650    
651            * R/textdoccol.R ([): Bug fix for subsetting a document
652            collection's data frame.
653    
654    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
657            to s_filter.
658    
659            * R/textdoccol.R: Local text documents' metadata can now be copied
660            to a document collection's data frame with prescind_meta.
661    
662    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
663    
664            * R/: Text documents' slot metadata is now accessible in s_filter.
665    
666            * R/: Rewrote s_filter function (has still some restrictions).
667    
668    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
669    
670            * R/: Various fixes in handling metadata.
671    
672            * R/: Added update mechanism for text document collections.
673    
674    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
675    
676            * R/: Merging of document collections now creates a binary tree
677            for reconstructing merged document collections.
678    
679            * R/: Redesign of metadata for document collections.
680    
681    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
682    
683            * R/: Messages now use \code{ngettext}.
684    
685    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
686    
687            * R/: Added functions for modifying and removing metadata.
688    
689    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
690    
691            * man/: Updated some documentation.
692    
693            * R/: Corrected some connection issues.
694    
695            * inst/doc: Worked on the vignette.
696    
697    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
698    
699            * inst/: Added texts and started vignette.
700    
701            * R/: Final changes based upon David's comments.
702    
703    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
704    
705            * NAMESPACE: Corrected exports (generic methods need exportMethods
706            directives!).
707    
708    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
709    
710            * R/: Modified the TextDocCol constructur and various parsers. It
711            is now modular and supports various file formats via plugins (see
712            the new "Source" class).
713    
714    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
715    
716            * man/: Revised documentation after previous code changes.
717    
718    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
719    
720            * R/: Remaining changes as discussed with David.
721    
722    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
723    
724            * R/: Some changes as suggested by David. The rest will follow
725            within the next days.
726    
727    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
728    
729            * man/: Finished documentation.
730    
731    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
732    
733            * man/: Wrote some documentation.
734    
735    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
736    
737            * R/: Further syntactic sugar in form of additional assignment and
738            accessor methods.
739    
740    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
741    
742            * R/: Syntactic sugar in form of "length", "show" and "summary"
743            operators.
744    
745    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
746    
747            * R/: Diverse updates. Mainly on default operators ("[" or "c")
748            and dissimilarities.
749    
750    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
751    
752            * R/: Added similarity functions.
753    
754            * data/: Added english stopwords.
755    
756    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
757    
758            * data/: Examples compiled for new features
759    
760            * R/: Changes due to new structure.
761    
762            * NAMESPACE: Corrected namespace to reflect new structure.
763    
764            * R/termdocmatrix.R: Adapted for new naming scheme.
765    
766    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
767    
768            * R/textdoccol.R: Adapted code for new class structure. Wrote
769            several transform and filter functions operating on text document
770            collections (alias text document databases).
771    
772            * R/aobjects.R: Adapted class structure with inheritance,
773            repositories and additional meta data. Loading files on demand is
774            now possible.
775    
776    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
777    
778            * R/: Some cosmetic cleanups.
779    
780            * inst/: Removed vignette on clustering. That and much more is now
781            described in the JSS paper on text mining. Based upon that
782            article an elaborated vignette will be incorporated in the future.
783    
784    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
785    
786            * R/: Updated generic S4 methods to comply with signature changes
787            in newer versions of R (> 2.3)
788    
789    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
790    
791            * ext/R/importRIS.R: Automatic RIS import is now possible.
792    
793    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
794    
795            * R/textdoccol.R: Added RIS HTML input format.
796    
797    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
798    
799            * R/textdoccol.R: Removed bug that caused invalid text document
800            collections when handling many input files.
801    
802    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
803    
804            * R/textdoccol.R: Restructured and extended file import
805            mechanism.
806    
807            * inst/doc/clustering.Rnw: Adapted vignette for use with
808            ReutNews.rda
809    
810            * man/ReutNews.Rd: Documentation for ReutNews.rda
811    
812            * data/ReutNews.rda: A tiny Reuters21578 example data set.
813    
814    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
815    
816            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
817            clustering facilities of this package.
818    
819    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
820    
821            * R/aobjects.R: Changed package document structure to avoid class
822            dependency problems.
823    
824    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
825    
826            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
827            data set.
828    
829            *  Finished documentation and reordered directory structure. Now "R
830            CMD check textmin" works without errors.
831    
832    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
833    
834            * src/: Various splits can now be easily created for the
835            Reuters21578 data set.
836    
837    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
838    
839            *  Updated documentation
840    
841    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
842    
843            *  Wrote R documentation for some classes and methods.
844    
845    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
846    
847            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
848            files. See the questionnaire data/Umfrage.csv for such an example.
849            We are now able to import files in Reuters-21578 XML format.
850    
851            *  Changed class interfaces in various files. Weighting of the text
852            matrix is now possible.
853    
854    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
855    
856            * R/textdoccol.R: One can build term-document matrices if
857            nessecary (with buildTDM(...)) and fill the field tdm from a text
858            document collection with it.
859    
860            * R/textmatrix.R: Wrote S4 class for term-document matrices.
861    
862    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
863    
864            * R/textdoccol.R: We now can read in a whole XML file with several
865            news items.
866    
867  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
868    
869          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.938

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge