SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 930, Sat Apr 11 08:49:37 2009 UTC
# Line 1  Line 1 
1    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
2    
3            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
4    
5    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/transform.R (tmReduce): Combine multiple maps into one
8            transformation.
9    
10    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
11    
12            * R/weight.R: Remove weightLogical since it does not return a
13            dgCMatrix.
14    
15            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
16            or TermDocumentMatrix instead.
17    
18    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
19    
20            * inst/doc/extensions.Rnw: Finished vignette.
21    
22    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
23    
24            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
25            DocumentTermMatrix representations.
26    
27    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
28    
29            * R/reader.R (readXML): New reader for arbitrary XML files.
30    
31    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
32    
33            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
34            (XMLSource): New XMLSource class for arbitrary XML files.
35            (Source): New slot Vectorized.
36    
37    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
38    
39            * R/reader.R (readCustom): Experimental reader which can be
40            customized via user-defined mappings.
41    
42            * R/reader.R: Always use UTC time zone.
43    
44            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
45    
46    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
47    
48            * R/reader.R (readDOC): Options can be passed over to antiword.
49    
50            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
51            pdftotext.
52    
53    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
54    
55            * R/source.R (DirSource): Add pattern and ignore.case arguments
56            which are internally passed over to list.files().
57    
58    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
59    
60            * inst/doc/tm.Rnw: Suppress pointless loading message.
61    
62    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
63    
64            * DESCRIPTION: Speed up package loading (via moving packages not
65            strictly necessary for normal operation to Suggests instead of
66            Depends).
67    
68    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
69    
70            * R/reader.R (readNewsgroup): The date format is now configurable.
71    
72    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
73    
74            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
75    
76    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
77    
78            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
79    
80    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
81    
82            * R/source.R (DataframeSource): New source class for data frames.
83    
84            * R/source.R: Fixed non-standard call evaluation.
85    
86    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
87    
88            * R/source.R (URISource): New source class for a single document.
89    
90    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
91    
92            * R/source.R: Refactoring.
93    
94    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
95    
96            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
97            Rmpi installations more gracefully.
98    
99    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
100    
101            * R/source.R (Source): Add Length slot.
102    
103    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
104    
105            * R/AAA.R: Unify duplicated .onLoad function.
106    
107    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
108    
109            * DESCRIPTION (Suggests): Added Rmpi.
110    
111    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
112    
113            * R/source.R (getElem): Fix 'no visible binding' warning.
114    
115            * man/WeightFunction.Rd: Fix signature.
116    
117    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
118    
119            * R/weight.R: Introduce name abbreviations for weighting functions.
120    
121    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
122    
123            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
124    
125            * R/cluster.R: Provide convenience functions for using a MPI
126            cluster.
127    
128            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
129            available.
130    
131            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
132            available.
133    
134    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
135    
136            * R/textdoccol.R (lapply): Removed debug print out.
137    
138    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
139    
140            * R/reader.R (readRCV1): Improved meta data extraction from
141            Reuters Corpus Volume 1 documents.
142    
143    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
144    
145            * R/transform.R: Ensure that all mappings preserve multiline
146            structures.
147    
148    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
149    
150            * R/filter.R: Every filter has now an attribute indicating whether
151            it sould be applied to document level (doclevel).
152    
153            * R/textdoccol.R (tmFilter): Set searchFullText as new default
154            filter.
155    
156    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
157    
158            * R/transform.R (replacePatterns): Replaced removeWords by
159            replacePatterns. Suggested by Christian Buchta.
160    
161            * R/textdoccol.R (inspect): Improved formatting.
162    
163    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
164    
165            * inst/CITATION: Updated JSS article information.
166    
167            * R/textdoccol.R (setAs): Added coerce method from list to
168            corpus.
169    
170            * R/meta.R (meta): Improved meta data handling.
171    
172    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
175            Christian Buchta.
176    
177            * inst/CITATION: Added template to include JSS article reference.
178    
179    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * R/textdoccol.R (tmMap): Introduced lazy mapping.
182    
183            * R/source.R: Added VectorSource.
184    
185    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
186    
187            * man/: Language codes should be in ISO 639-1 format.
188    
189            * R/textdoccol.R (asPlain): Preserve local meta data.
190    
191    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
192    
193            * R/textdoccol.R (writeCorpus): Function for writing a corpus
194            containing plain text documents to disk.
195    
196    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
197    
198            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
199            always set correctly.
200    
201            * R/textdoccol.R: Set load = TRUE as default for load on demand
202            since in most cases this is the wanted behaviour.
203    
204    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
205    
206            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
207    
208            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
209    
210    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * R/meta.R (meta): New function for consistent access to meta data
213            of document collections, repositories, and texts.
214    
215    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
216    
217            * R/: Better support for encodings.
218    
219    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
220    
221            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
222            selection when no reader argument is given.
223    
224    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
225    
226            * R/source.R (CSVSource): Now uses read.csv instead of scan
227            internally.
228    
229    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
230    
231            * R/reader.R (getReaders): Returns available reader functions.
232    
233            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
234            as default.
235    
236    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
237    
238            * R/stopwords.R (stopwords): Shortened code, removed codetools
239            variable warnings.
240    
241            * man/: Documentation for showMeta, added an example for tmMap.
242    
243            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
244            some minor typos fixed.
245    
246    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
247    
248            * R/aobjects.R (showMeta): Added method for pretty printing a
249            text document's meta data.
250    
251    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
252    
253            * R/textdoccol.R (TextDocCol): Better handling of empty
254            arguments.
255    
256            * NAMESPACE: Exported readDOC.
257    
258            * man/completeStems.Rd: Added an example.
259    
260    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * R/stopwords.R (stopwords): Look up .dat files at every
263            call. Allows users to modify stopword .dat files interactively.
264    
265    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/termdocmatrix.R (termFreq): Correct processing of empty
268            documents.
269    
270    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
271    
272            * man/: Updated documentation.
273    
274    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
275    
276            * R/complete.R (completeStems): Completes (heuristically) word
277            stems.
278    
279            * R/termdocmatrix.R (TermDocMatrix2): New modular
280            constructor.
281    
282            * NAMESPACE: Exported termFreq.
283    
284    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/reader.R (readDOC): Added MS Word reader (using antiword).
287    
288    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * R/weight.R: Weighting functions for TermDocMatrix.
291    
292    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
293    
294            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
295            functions for accessing dimension, column, and row names.
296    
297            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
298    
299    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
302    
303    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
304    
305            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
306    
307    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
308    
309            * R/reader.R (readPDF): Removed manual checks for pdftotext and
310            pdfinfo. The system call gives a warning anyway.
311    
312    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
313    
314            * R/textdoccol.R (asPlain): Conversion from
315            StructuredTextDocuments to PlainTextDocuments.
316    
317    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
320            for accessing term-document matrices.
321    
322            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
323            are installed.
324    
325    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
326    
327            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
328            Christian Buchta.
329    
330    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
333    
334    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
337    
338            * R/reader.R (readPDF): Added PDF reader.
339    
340    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
341    
342            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
343    
344            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
345    
346            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
347    
348            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
349    
350    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
351    
352            * R/distmeasure.R (dissimilarity): Replaced dists call from
353            package cba by new dist call from package proxy.
354    
355    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
358    
359    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
360    
361            * R/termdocmatrix.R: require() uses the quietly option to suppress
362            loading messages.
363    
364    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
365    
366            * R/dictionary.R: Added dictionary support.
367    
368    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
369    
370            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
371            documents. This simplifies some functions, e.g., asPlain.
372    
373    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
374    
375            * inst/doc/tm.Rnw: Fixed some typos in vignette.
376    
377    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
378    
379            * R/textdoccol.R (replaceWords): Added method to replace a set of
380            words by a single word. Useful for synonyms.
381    
382    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
383    
384            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
385    
386    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
387    
388            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
389            vectors. Thanks to Ariel Maguyon for his error report.
390            (removeSparseTerms): New function to remove columns from a
391            term-document matrix exceeding a sparse factor.
392    
393    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
394    
395            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
396    
397    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
398    
399            * man/sFilter.Rd: Corrected documentation on statement format (use
400            '==' instead of '=').
401    
402    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
403    
404            * R/aobjects.R (StructuredTextDocument): Inherits from
405            TextDocument.
406    
407    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
410            on sparse matrices as proposed by Martin Maechler.
411    
412    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
415            \pkg{filehash} version makes them deprecated.
416    
417    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
418    
419            * R/termdocmatrix.R (textvector): Stemming is now performed before
420            erasing stopwords.
421            (weightMatrix): Adapted to handle sparse matrices.
422            (TermDocMatrix): Sparse matrix is now efficiently built by
423            direct stepwise insertion of row values into it.
424    
425    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
426    
427            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
428            due to ongoing problems. For our purposes the latter is as useful
429            as the replaced package.
430    
431    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
432    
433            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
434    
435            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
436    
437    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
440            languages with available stopwords.
441    
442    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
443    
444            * inst/doc/tm.Rnw: Minor corrections in the vignette.
445    
446    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
447    
448            * DESCRIPTION: Update to version 0.2, since a lot of new features
449            have been integrated.
450    
451            * inst/stopwords: Updated existing stopwords and added stopwords
452            for various other languages.
453    
454    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
455    
456            * man/: Updated documentation.
457    
458            * Work/testDb.R: Script to test database stuff.
459    
460            * R/: Fixed various database related bugs. Seems to be rather
461            useable now, i.e., consider as alpha status for now.
462    
463    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
464    
465            * R/: Fixed some bugs related to database support.
466    
467    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
468    
469            * man/: Added a lot of examples to the manuals.
470    
471    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
472    
473            * man/: Updated parts of the documentation.
474    
475            * R/textdoccol.R (asPlain): Added conversion from newsgroup
476            documents to plain text documents.
477    
478    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
479    
480            * R/textdoccol.R: Finished experimental database support. Not yet
481            intensively tested.
482    
483            * R/source.R: Now each source has a default reader.
484    
485            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
486            class anymore.
487    
488            * R/plaintextdoc.R: Custom show method for plain text documents.
489    
490            * R/aobjects.R: Added a class for structured text documents.
491    
492            * R/reader.R: Replaced remaining \code{parser} occurrences with
493            \code{reader}.
494    
495            * R/textdoccol.R (summary): Indent tags.
496    
497            * R/textdoccol.R (removePunctuation): Transform method to remove
498            punctuation marks.
499    
500    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
501    
502            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
503            using prescindMeta().
504    
505    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
506    
507            * R/textdoccol.R: Improved database support.
508    
509    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
510    
511            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
512    
513            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
514            language code.
515    
516            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
517            into parserControl argument.
518    
519            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
520    
521    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
522    
523            * Work/tmDataSetup.R: The datasets acq and crude can now be
524            created on the fly.
525    
526            * R/stopwords.R: Introduced a function returning the stopwords for
527            a given language (English, German and French at the moment)
528    
529            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
530            otherwise falls back to Snowball package.
531    
532    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
533    
534            * man/dissimilarity-methods.Rd: Make clear that any method offered
535            by "dists" from package "cba" can be used.
536    
537    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
538    
539            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
540            to Kurt's latex suggestion. Removed points and underscores in
541            variable names for consistent naming.
542    
543            * DESCRIPTION: Update to version 0.1-2.
544    
545            * man/TextRepository.Rd: Fixed bug in documentation.
546    
547    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * DESCRIPTION: Update to version 0.1-1.
550    
551    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
552    
553            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
554            wordStem.
555    
556    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
557    
558            * R/: Changes due to Kurt's review.
559    
560    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
561    
562            * R/: Implemented improvements based upon comments by David
563            Meyer.
564    
565    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
566    
567            * inst/doc/: Rewrote vignette.
568    
569            * man/: Improved documentation.
570    
571    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
572    
573            * man/: Updated documentation.
574    
575            * DESCRIPTION: Changed package name to "tm". Updated version to
576            0.1 for first CRAN release.
577    
578            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
579            list archive example.
580    
581            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
582            archive example.
583    
584            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
585            from (several mails per box) mbox format to (single mail per file)
586            eml format.
587    
588    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
589    
590            * data/crude.rda: Rebuilt.
591    
592            * data/acq.rda: Rebuilt.
593    
594            * R/reader.R: Factored out reader and parser methods from
595            textdoccol.R.
596    
597            * R/source.R: Factored out Source methods from aobjects.R and
598            textdoccol.R.
599            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
600            feeds.
601    
602            * R/textdoccol.R (DirSource): Added support for recursive
603            traversal of directories.
604    
605    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
606    
607            * R/textdoccol.R ([[): Loads the document corpus automatically
608            into memory upon access.
609            (tm_transform, tm_filter): Removed several checks whether the
610            document is already loaded ([[ ensures this now).
611            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
612            mailing list archive.
613    
614    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
615    
616            * R/aobjects.R (TextDocument): Is now a virtual class.
617            (Source): Is now a virtual class.
618    
619    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
620    
621            * R/textdoccol.R (c): Support for an arbitrary number of document
622            collections.
623    
624    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
625    
626            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
627            append_meta and remove_meta.
628    
629            * R/textdoccol.R: Removed modify_metadata method.
630    
631            * R/textrepo.R: Removed modify_metadata method.
632    
633            * R/textdoccol.R (remove_meta): Supports removal of document
634            collection metadata and document (= in data frame) metadata.
635    
636    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
637    
638            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
639    
640            * data/crude.rda: Rebuilt.
641    
642            * data/acq.rda: Rebuilt.
643    
644            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
645    
646            * R/textdoccol.R ([): Bug fix for subsetting a document
647            collection's data frame.
648    
649    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
650    
651            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
652            to s_filter.
653    
654            * R/textdoccol.R: Local text documents' metadata can now be copied
655            to a document collection's data frame with prescind_meta.
656    
657    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
658    
659            * R/: Text documents' slot metadata is now accessible in s_filter.
660    
661            * R/: Rewrote s_filter function (has still some restrictions).
662    
663    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
664    
665            * R/: Various fixes in handling metadata.
666    
667            * R/: Added update mechanism for text document collections.
668    
669    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
670    
671            * R/: Merging of document collections now creates a binary tree
672            for reconstructing merged document collections.
673    
674            * R/: Redesign of metadata for document collections.
675    
676    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
677    
678            * R/: Messages now use \code{ngettext}.
679    
680    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
681    
682            * R/: Added functions for modifying and removing metadata.
683    
684    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
685    
686            * man/: Updated some documentation.
687    
688            * R/: Corrected some connection issues.
689    
690            * inst/doc: Worked on the vignette.
691    
692    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
693    
694            * inst/: Added texts and started vignette.
695    
696            * R/: Final changes based upon David's comments.
697    
698    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
699    
700            * NAMESPACE: Corrected exports (generic methods need exportMethods
701            directives!).
702    
703    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
704    
705            * R/: Modified the TextDocCol constructur and various parsers. It
706            is now modular and supports various file formats via plugins (see
707            the new "Source" class).
708    
709    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * man/: Revised documentation after previous code changes.
712    
713    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * R/: Remaining changes as discussed with David.
716    
717    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
718    
719            * R/: Some changes as suggested by David. The rest will follow
720            within the next days.
721    
722    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
723    
724            * man/: Finished documentation.
725    
726    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
727    
728            * man/: Wrote some documentation.
729    
730    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
731    
732            * R/: Further syntactic sugar in form of additional assignment and
733            accessor methods.
734    
735    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
736    
737            * R/: Syntactic sugar in form of "length", "show" and "summary"
738            operators.
739    
740    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
741    
742            * R/: Diverse updates. Mainly on default operators ("[" or "c")
743            and dissimilarities.
744    
745    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
746    
747            * R/: Added similarity functions.
748    
749            * data/: Added english stopwords.
750    
751    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * data/: Examples compiled for new features
754    
755            * R/: Changes due to new structure.
756    
757            * NAMESPACE: Corrected namespace to reflect new structure.
758    
759            * R/termdocmatrix.R: Adapted for new naming scheme.
760    
761    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
762    
763            * R/textdoccol.R: Adapted code for new class structure. Wrote
764            several transform and filter functions operating on text document
765            collections (alias text document databases).
766    
767            * R/aobjects.R: Adapted class structure with inheritance,
768            repositories and additional meta data. Loading files on demand is
769            now possible.
770    
771    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
772    
773            * R/: Some cosmetic cleanups.
774    
775            * inst/: Removed vignette on clustering. That and much more is now
776            described in the JSS paper on text mining. Based upon that
777            article an elaborated vignette will be incorporated in the future.
778    
779    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
780    
781            * R/: Updated generic S4 methods to comply with signature changes
782            in newer versions of R (> 2.3)
783    
784    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
785    
786            * ext/R/importRIS.R: Automatic RIS import is now possible.
787    
788    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
789    
790            * R/textdoccol.R: Added RIS HTML input format.
791    
792    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
793    
794            * R/textdoccol.R: Removed bug that caused invalid text document
795            collections when handling many input files.
796    
797  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
798    
799          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.930

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge