SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC pkg/ChangeLog revision 904, Sat Mar 21 08:15:11 2009 UTC
# Line 1  Line 1 
1    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
4    
5    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/reader.R (readDOC): Options can be passed over to antiword.
8    
9            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
10            pdftotext.
11    
12    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
13    
14            * R/source.R (DirSource): Add pattern and ignore.case arguments
15            which are internally passed over to list.files().
16    
17    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
18    
19            * inst/doc/tm.Rnw: Suppress pointless loading message.
20    
21    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
22    
23            * DESCRIPTION: Speed up package loading (via moving packages not
24            strictly necessary for normal operation to Suggests instead of
25            Depends).
26    
27    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
28    
29            * R/reader.R (readNewsgroup): The date format is now configurable.
30    
31    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
32    
33            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
34    
35    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
36    
37            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
38    
39    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
40    
41            * R/source.R (DataframeSource): New source class for data frames.
42    
43            * R/source.R: Fixed non-standard call evaluation.
44    
45    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
46    
47            * R/source.R (URISource): New source class for a single document.
48    
49    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
50    
51            * R/source.R: Refactoring.
52    
53    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
54    
55            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
56            Rmpi installations more gracefully.
57    
58    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
59    
60            * R/source.R (Source): Add Length slot.
61    
62    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
63    
64            * R/AAA.R: Unify duplicated .onLoad function.
65    
66    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
67    
68            * DESCRIPTION (Suggests): Added Rmpi.
69    
70    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
71    
72            * R/source.R (getElem): Fix 'no visible binding' warning.
73    
74            * man/WeightFunction.Rd: Fix signature.
75    
76    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
77    
78            * R/weight.R: Introduce name abbreviations for weighting functions.
79    
80    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
81    
82            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
83    
84            * R/cluster.R: Provide convenience functions for using a MPI
85            cluster.
86    
87            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
88            available.
89    
90            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
91            available.
92    
93    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
94    
95            * R/textdoccol.R (lapply): Removed debug print out.
96    
97    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
98    
99            * R/reader.R (readRCV1): Improved meta data extraction from
100            Reuters Corpus Volume 1 documents.
101    
102    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
103    
104            * R/transform.R: Ensure that all mappings preserve multiline
105            structures.
106    
107    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
108    
109            * R/filter.R: Every filter has now an attribute indicating whether
110            it sould be applied to document level (doclevel).
111    
112            * R/textdoccol.R (tmFilter): Set searchFullText as new default
113            filter.
114    
115    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
116    
117            * R/transform.R (replacePatterns): Replaced removeWords by
118            replacePatterns. Suggested by Christian Buchta.
119    
120            * R/textdoccol.R (inspect): Improved formatting.
121    
122    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
123    
124            * inst/CITATION: Updated JSS article information.
125    
126            * R/textdoccol.R (setAs): Added coerce method from list to
127            corpus.
128    
129            * R/meta.R (meta): Improved meta data handling.
130    
131    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
132    
133            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
134            Christian Buchta.
135    
136            * inst/CITATION: Added template to include JSS article reference.
137    
138    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
139    
140            * R/textdoccol.R (tmMap): Introduced lazy mapping.
141    
142            * R/source.R: Added VectorSource.
143    
144    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
145    
146            * man/: Language codes should be in ISO 639-1 format.
147    
148            * R/textdoccol.R (asPlain): Preserve local meta data.
149    
150    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
151    
152            * R/textdoccol.R (writeCorpus): Function for writing a corpus
153            containing plain text documents to disk.
154    
155    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
156    
157            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
158            always set correctly.
159    
160            * R/textdoccol.R: Set load = TRUE as default for load on demand
161            since in most cases this is the wanted behaviour.
162    
163    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
164    
165            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
166    
167            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
168    
169    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
170    
171            * R/meta.R (meta): New function for consistent access to meta data
172            of document collections, repositories, and texts.
173    
174    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
175    
176            * R/: Better support for encodings.
177    
178    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
179    
180            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
181            selection when no reader argument is given.
182    
183    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
184    
185            * R/source.R (CSVSource): Now uses read.csv instead of scan
186            internally.
187    
188    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
189    
190            * R/reader.R (getReaders): Returns available reader functions.
191    
192            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
193            as default.
194    
195    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
196    
197            * R/stopwords.R (stopwords): Shortened code, removed codetools
198            variable warnings.
199    
200            * man/: Documentation for showMeta, added an example for tmMap.
201    
202            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
203            some minor typos fixed.
204    
205    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
206    
207            * R/aobjects.R (showMeta): Added method for pretty printing a
208            text document's meta data.
209    
210    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * R/textdoccol.R (TextDocCol): Better handling of empty
213            arguments.
214    
215            * NAMESPACE: Exported readDOC.
216    
217            * man/completeStems.Rd: Added an example.
218    
219    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
220    
221            * R/stopwords.R (stopwords): Look up .dat files at every
222            call. Allows users to modify stopword .dat files interactively.
223    
224    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
225    
226            * R/termdocmatrix.R (termFreq): Correct processing of empty
227            documents.
228    
229    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
230    
231            * man/: Updated documentation.
232    
233    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/complete.R (completeStems): Completes (heuristically) word
236            stems.
237    
238            * R/termdocmatrix.R (TermDocMatrix2): New modular
239            constructor.
240    
241            * NAMESPACE: Exported termFreq.
242    
243    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
244    
245            * R/reader.R (readDOC): Added MS Word reader (using antiword).
246    
247    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
248    
249            * R/weight.R: Weighting functions for TermDocMatrix.
250    
251    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
252    
253            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
254            functions for accessing dimension, column, and row names.
255    
256            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
257    
258    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
259    
260            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
261    
262    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
263    
264            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
265    
266    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
267    
268            * R/reader.R (readPDF): Removed manual checks for pdftotext and
269            pdfinfo. The system call gives a warning anyway.
270    
271    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/textdoccol.R (asPlain): Conversion from
274            StructuredTextDocuments to PlainTextDocuments.
275    
276    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
277    
278            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
279            for accessing term-document matrices.
280    
281            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
282            are installed.
283    
284    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
287            Christian Buchta.
288    
289    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
292    
293    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
296    
297            * R/reader.R (readPDF): Added PDF reader.
298    
299    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
302    
303            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
304    
305            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
306    
307            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
308    
309    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * R/distmeasure.R (dissimilarity): Replaced dists call from
312            package cba by new dist call from package proxy.
313    
314    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
315    
316            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
317    
318    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
319    
320            * R/termdocmatrix.R: require() uses the quietly option to suppress
321            loading messages.
322    
323    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
324    
325            * R/dictionary.R: Added dictionary support.
326    
327    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
328    
329            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
330            documents. This simplifies some functions, e.g., asPlain.
331    
332    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
333    
334            * inst/doc/tm.Rnw: Fixed some typos in vignette.
335    
336    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
337    
338            * R/textdoccol.R (replaceWords): Added method to replace a set of
339            words by a single word. Useful for synonyms.
340    
341    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
344    
345    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
346    
347            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
348            vectors. Thanks to Ariel Maguyon for his error report.
349            (removeSparseTerms): New function to remove columns from a
350            term-document matrix exceeding a sparse factor.
351    
352    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
355    
356    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * man/sFilter.Rd: Corrected documentation on statement format (use
359            '==' instead of '=').
360    
361    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
362    
363            * R/aobjects.R (StructuredTextDocument): Inherits from
364            TextDocument.
365    
366    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
367    
368            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
369            on sparse matrices as proposed by Martin Maechler.
370    
371    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
372    
373            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
374            \pkg{filehash} version makes them deprecated.
375    
376    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
377    
378            * R/termdocmatrix.R (textvector): Stemming is now performed before
379            erasing stopwords.
380            (weightMatrix): Adapted to handle sparse matrices.
381            (TermDocMatrix): Sparse matrix is now efficiently built by
382            direct stepwise insertion of row values into it.
383    
384    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
385    
386            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
387            due to ongoing problems. For our purposes the latter is as useful
388            as the replaced package.
389    
390    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
391    
392            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
393    
394            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
395    
396    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
397    
398            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
399            languages with available stopwords.
400    
401    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
402    
403            * inst/doc/tm.Rnw: Minor corrections in the vignette.
404    
405    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
406    
407            * DESCRIPTION: Update to version 0.2, since a lot of new features
408            have been integrated.
409    
410            * inst/stopwords: Updated existing stopwords and added stopwords
411            for various other languages.
412    
413    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
414    
415            * man/: Updated documentation.
416    
417            * Work/testDb.R: Script to test database stuff.
418    
419            * R/: Fixed various database related bugs. Seems to be rather
420            useable now, i.e., consider as alpha status for now.
421    
422    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * R/: Fixed some bugs related to database support.
425    
426    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
427    
428            * man/: Added a lot of examples to the manuals.
429    
430    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
431    
432            * man/: Updated parts of the documentation.
433    
434            * R/textdoccol.R (asPlain): Added conversion from newsgroup
435            documents to plain text documents.
436    
437    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * R/textdoccol.R: Finished experimental database support. Not yet
440            intensively tested.
441    
442            * R/source.R: Now each source has a default reader.
443    
444            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
445            class anymore.
446    
447            * R/plaintextdoc.R: Custom show method for plain text documents.
448    
449            * R/aobjects.R: Added a class for structured text documents.
450    
451            * R/reader.R: Replaced remaining \code{parser} occurrences with
452            \code{reader}.
453    
454            * R/textdoccol.R (summary): Indent tags.
455    
456            * R/textdoccol.R (removePunctuation): Transform method to remove
457            punctuation marks.
458    
459    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
460    
461            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
462            using prescindMeta().
463    
464    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
465    
466            * R/textdoccol.R: Improved database support.
467    
468    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
469    
470            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
471    
472            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
473            language code.
474    
475            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
476            into parserControl argument.
477    
478            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
479    
480    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
481    
482            * Work/tmDataSetup.R: The datasets acq and crude can now be
483            created on the fly.
484    
485            * R/stopwords.R: Introduced a function returning the stopwords for
486            a given language (English, German and French at the moment)
487    
488            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
489            otherwise falls back to Snowball package.
490    
491    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
492    
493            * man/dissimilarity-methods.Rd: Make clear that any method offered
494            by "dists" from package "cba" can be used.
495    
496    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
497    
498            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
499            to Kurt's latex suggestion. Removed points and underscores in
500            variable names for consistent naming.
501    
502            * DESCRIPTION: Update to version 0.1-2.
503    
504            * man/TextRepository.Rd: Fixed bug in documentation.
505    
506    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
507    
508            * DESCRIPTION: Update to version 0.1-1.
509    
510    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
511    
512            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
513            wordStem.
514    
515    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
516    
517            * R/: Changes due to Kurt's review.
518    
519    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
520    
521            * R/: Implemented improvements based upon comments by David
522            Meyer.
523    
524    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
525    
526            * inst/doc/: Rewrote vignette.
527    
528            * man/: Improved documentation.
529    
530    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * man/: Updated documentation.
533    
534            * DESCRIPTION: Changed package name to "tm". Updated version to
535            0.1 for first CRAN release.
536    
537            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
538            list archive example.
539    
540            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
541            archive example.
542    
543            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
544            from (several mails per box) mbox format to (single mail per file)
545            eml format.
546    
547    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * data/crude.rda: Rebuilt.
550    
551            * data/acq.rda: Rebuilt.
552    
553            * R/reader.R: Factored out reader and parser methods from
554            textdoccol.R.
555    
556            * R/source.R: Factored out Source methods from aobjects.R and
557            textdoccol.R.
558            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
559            feeds.
560    
561            * R/textdoccol.R (DirSource): Added support for recursive
562            traversal of directories.
563    
564    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
565    
566            * R/textdoccol.R ([[): Loads the document corpus automatically
567            into memory upon access.
568            (tm_transform, tm_filter): Removed several checks whether the
569            document is already loaded ([[ ensures this now).
570            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
571            mailing list archive.
572    
573    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
574    
575            * R/aobjects.R (TextDocument): Is now a virtual class.
576            (Source): Is now a virtual class.
577    
578    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * R/textdoccol.R (c): Support for an arbitrary number of document
581            collections.
582    
583    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
584    
585            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
586            append_meta and remove_meta.
587    
588            * R/textdoccol.R: Removed modify_metadata method.
589    
590            * R/textrepo.R: Removed modify_metadata method.
591    
592            * R/textdoccol.R (remove_meta): Supports removal of document
593            collection metadata and document (= in data frame) metadata.
594    
595    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
596    
597            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
598    
599            * data/crude.rda: Rebuilt.
600    
601            * data/acq.rda: Rebuilt.
602    
603            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
604    
605            * R/textdoccol.R ([): Bug fix for subsetting a document
606            collection's data frame.
607    
608    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
609    
610            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
611            to s_filter.
612    
613            * R/textdoccol.R: Local text documents' metadata can now be copied
614            to a document collection's data frame with prescind_meta.
615    
616    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
617    
618            * R/: Text documents' slot metadata is now accessible in s_filter.
619    
620            * R/: Rewrote s_filter function (has still some restrictions).
621    
622    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
623    
624            * R/: Various fixes in handling metadata.
625    
626            * R/: Added update mechanism for text document collections.
627    
628    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
629    
630            * R/: Merging of document collections now creates a binary tree
631            for reconstructing merged document collections.
632    
633            * R/: Redesign of metadata for document collections.
634    
635    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
636    
637            * R/: Messages now use \code{ngettext}.
638    
639    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
640    
641            * R/: Added functions for modifying and removing metadata.
642    
643    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
644    
645            * man/: Updated some documentation.
646    
647            * R/: Corrected some connection issues.
648    
649            * inst/doc: Worked on the vignette.
650    
651    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
652    
653            * inst/: Added texts and started vignette.
654    
655            * R/: Final changes based upon David's comments.
656    
657    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
658    
659            * NAMESPACE: Corrected exports (generic methods need exportMethods
660            directives!).
661    
662    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
663    
664            * R/: Modified the TextDocCol constructur and various parsers. It
665            is now modular and supports various file formats via plugins (see
666            the new "Source" class).
667    
668    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
669    
670            * man/: Revised documentation after previous code changes.
671    
672    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
673    
674            * R/: Remaining changes as discussed with David.
675    
676    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
677    
678            * R/: Some changes as suggested by David. The rest will follow
679            within the next days.
680    
681    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
682    
683            * man/: Finished documentation.
684    
685    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
686    
687            * man/: Wrote some documentation.
688    
689    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
690    
691            * R/: Further syntactic sugar in form of additional assignment and
692            accessor methods.
693    
694    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
695    
696            * R/: Syntactic sugar in form of "length", "show" and "summary"
697            operators.
698    
699    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
700    
701            * R/: Diverse updates. Mainly on default operators ("[" or "c")
702            and dissimilarities.
703    
704    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
705    
706            * R/: Added similarity functions.
707    
708            * data/: Added english stopwords.
709    
710    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
711    
712            * data/: Examples compiled for new features
713    
714            * R/: Changes due to new structure.
715    
716            * NAMESPACE: Corrected namespace to reflect new structure.
717    
718            * R/termdocmatrix.R: Adapted for new naming scheme.
719    
720    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
721    
722            * R/textdoccol.R: Adapted code for new class structure. Wrote
723            several transform and filter functions operating on text document
724            collections (alias text document databases).
725    
726            * R/aobjects.R: Adapted class structure with inheritance,
727            repositories and additional meta data. Loading files on demand is
728            now possible.
729    
730    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
731    
732            * R/: Some cosmetic cleanups.
733    
734            * inst/: Removed vignette on clustering. That and much more is now
735            described in the JSS paper on text mining. Based upon that
736            article an elaborated vignette will be incorporated in the future.
737    
738    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
739    
740            * R/: Updated generic S4 methods to comply with signature changes
741            in newer versions of R (> 2.3)
742    
743    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
744    
745            * ext/R/importRIS.R: Automatic RIS import is now possible.
746    
747    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
748    
749            * R/textdoccol.R: Added RIS HTML input format.
750    
751    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * R/textdoccol.R: Removed bug that caused invalid text document
754            collections when handling many input files.
755    
756    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
757    
758            * R/textdoccol.R: Restructured and extended file import
759            mechanism.
760    
761            * inst/doc/clustering.Rnw: Adapted vignette for use with
762            ReutNews.rda
763    
764            * man/ReutNews.Rd: Documentation for ReutNews.rda
765    
766            * data/ReutNews.rda: A tiny Reuters21578 example data set.
767    
768    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
769    
770            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
771            clustering facilities of this package.
772    
773    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
774    
775            * R/aobjects.R: Changed package document structure to avoid class
776            dependency problems.
777    
778    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
779    
780            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
781            data set.
782    
783            *  Finished documentation and reordered directory structure. Now "R
784            CMD check textmin" works without errors.
785    
786    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
787    
788            * src/: Various splits can now be easily created for the
789            Reuters21578 data set.
790    
791    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
792    
793            *  Updated documentation
794    
795    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
796    
797            *  Wrote R documentation for some classes and methods.
798    
799    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
800    
801            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
802            files. See the questionnaire data/Umfrage.csv for such an example.
803            We are now able to import files in Reuters-21578 XML format.
804    
805            *  Changed class interfaces in various files. Weighting of the text
806            matrix is now possible.
807    
808    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
809    
810            * R/textdoccol.R: One can build term-document matrices if
811            nessecary (with buildTDM(...)) and fill the field tdm from a text
812            document collection with it.
813    
814            * R/textmatrix.R: Wrote S4 class for term-document matrices.
815    
816    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
817    
818            * R/textdoccol.R: We now can read in a whole XML file with several
819            news items.
820    
821  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
822    
823          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.904

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge