SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC pkg/ChangeLog revision 909, Sun Mar 22 12:45:59 2009 UTC
# Line 1  Line 1 
1    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/source.R (Source): New slot Vectorized.
4    
5    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/reader.R (readCustom): Experimental reader which can be
8            customized via user-defined mappings.
9    
10            * R/reader.R: Always use UTC time zone.
11    
12            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
13    
14    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
15    
16            * R/reader.R (readDOC): Options can be passed over to antiword.
17    
18            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
19            pdftotext.
20    
21    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
22    
23            * R/source.R (DirSource): Add pattern and ignore.case arguments
24            which are internally passed over to list.files().
25    
26    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
27    
28            * inst/doc/tm.Rnw: Suppress pointless loading message.
29    
30    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
31    
32            * DESCRIPTION: Speed up package loading (via moving packages not
33            strictly necessary for normal operation to Suggests instead of
34            Depends).
35    
36    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/reader.R (readNewsgroup): The date format is now configurable.
39    
40    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
43    
44    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
45    
46            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
47    
48    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
49    
50            * R/source.R (DataframeSource): New source class for data frames.
51    
52            * R/source.R: Fixed non-standard call evaluation.
53    
54    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
55    
56            * R/source.R (URISource): New source class for a single document.
57    
58    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
59    
60            * R/source.R: Refactoring.
61    
62    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
63    
64            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
65            Rmpi installations more gracefully.
66    
67    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
68    
69            * R/source.R (Source): Add Length slot.
70    
71    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
72    
73            * R/AAA.R: Unify duplicated .onLoad function.
74    
75    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
76    
77            * DESCRIPTION (Suggests): Added Rmpi.
78    
79    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
80    
81            * R/source.R (getElem): Fix 'no visible binding' warning.
82    
83            * man/WeightFunction.Rd: Fix signature.
84    
85    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
86    
87            * R/weight.R: Introduce name abbreviations for weighting functions.
88    
89    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
90    
91            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
92    
93            * R/cluster.R: Provide convenience functions for using a MPI
94            cluster.
95    
96            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
97            available.
98    
99            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
100            available.
101    
102    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
103    
104            * R/textdoccol.R (lapply): Removed debug print out.
105    
106    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
107    
108            * R/reader.R (readRCV1): Improved meta data extraction from
109            Reuters Corpus Volume 1 documents.
110    
111    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
112    
113            * R/transform.R: Ensure that all mappings preserve multiline
114            structures.
115    
116    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
117    
118            * R/filter.R: Every filter has now an attribute indicating whether
119            it sould be applied to document level (doclevel).
120    
121            * R/textdoccol.R (tmFilter): Set searchFullText as new default
122            filter.
123    
124    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
125    
126            * R/transform.R (replacePatterns): Replaced removeWords by
127            replacePatterns. Suggested by Christian Buchta.
128    
129            * R/textdoccol.R (inspect): Improved formatting.
130    
131    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
132    
133            * inst/CITATION: Updated JSS article information.
134    
135            * R/textdoccol.R (setAs): Added coerce method from list to
136            corpus.
137    
138            * R/meta.R (meta): Improved meta data handling.
139    
140    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
141    
142            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
143            Christian Buchta.
144    
145            * inst/CITATION: Added template to include JSS article reference.
146    
147    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
148    
149            * R/textdoccol.R (tmMap): Introduced lazy mapping.
150    
151            * R/source.R: Added VectorSource.
152    
153    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * man/: Language codes should be in ISO 639-1 format.
156    
157            * R/textdoccol.R (asPlain): Preserve local meta data.
158    
159    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
160    
161            * R/textdoccol.R (writeCorpus): Function for writing a corpus
162            containing plain text documents to disk.
163    
164    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
165    
166            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
167            always set correctly.
168    
169            * R/textdoccol.R: Set load = TRUE as default for load on demand
170            since in most cases this is the wanted behaviour.
171    
172    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
175    
176            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
177    
178    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
179    
180            * R/meta.R (meta): New function for consistent access to meta data
181            of document collections, repositories, and texts.
182    
183    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
184    
185            * R/: Better support for encodings.
186    
187    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
188    
189            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
190            selection when no reader argument is given.
191    
192    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
193    
194            * R/source.R (CSVSource): Now uses read.csv instead of scan
195            internally.
196    
197    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
198    
199            * R/reader.R (getReaders): Returns available reader functions.
200    
201            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
202            as default.
203    
204    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
205    
206            * R/stopwords.R (stopwords): Shortened code, removed codetools
207            variable warnings.
208    
209            * man/: Documentation for showMeta, added an example for tmMap.
210    
211            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
212            some minor typos fixed.
213    
214    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * R/aobjects.R (showMeta): Added method for pretty printing a
217            text document's meta data.
218    
219    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
220    
221            * R/textdoccol.R (TextDocCol): Better handling of empty
222            arguments.
223    
224            * NAMESPACE: Exported readDOC.
225    
226            * man/completeStems.Rd: Added an example.
227    
228    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * R/stopwords.R (stopwords): Look up .dat files at every
231            call. Allows users to modify stopword .dat files interactively.
232    
233    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/termdocmatrix.R (termFreq): Correct processing of empty
236            documents.
237    
238    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
239    
240            * man/: Updated documentation.
241    
242    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
243    
244            * R/complete.R (completeStems): Completes (heuristically) word
245            stems.
246    
247            * R/termdocmatrix.R (TermDocMatrix2): New modular
248            constructor.
249    
250            * NAMESPACE: Exported termFreq.
251    
252    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
253    
254            * R/reader.R (readDOC): Added MS Word reader (using antiword).
255    
256    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
257    
258            * R/weight.R: Weighting functions for TermDocMatrix.
259    
260    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
263            functions for accessing dimension, column, and row names.
264    
265            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
266    
267    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
268    
269            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
270    
271    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
274    
275    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * R/reader.R (readPDF): Removed manual checks for pdftotext and
278            pdfinfo. The system call gives a warning anyway.
279    
280    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
281    
282            * R/textdoccol.R (asPlain): Conversion from
283            StructuredTextDocuments to PlainTextDocuments.
284    
285    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
286    
287            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
288            for accessing term-document matrices.
289    
290            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
291            are installed.
292    
293    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
296            Christian Buchta.
297    
298    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
301    
302    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
305    
306            * R/reader.R (readPDF): Added PDF reader.
307    
308    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
309    
310            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
311    
312            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
313    
314            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
315    
316            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
317    
318    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
319    
320            * R/distmeasure.R (dissimilarity): Replaced dists call from
321            package cba by new dist call from package proxy.
322    
323    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
324    
325            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
326    
327    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
328    
329            * R/termdocmatrix.R: require() uses the quietly option to suppress
330            loading messages.
331    
332    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
333    
334            * R/dictionary.R: Added dictionary support.
335    
336    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
337    
338            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
339            documents. This simplifies some functions, e.g., asPlain.
340    
341    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * inst/doc/tm.Rnw: Fixed some typos in vignette.
344    
345    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
346    
347            * R/textdoccol.R (replaceWords): Added method to replace a set of
348            words by a single word. Useful for synonyms.
349    
350    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
351    
352            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
353    
354    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
355    
356            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
357            vectors. Thanks to Ariel Maguyon for his error report.
358            (removeSparseTerms): New function to remove columns from a
359            term-document matrix exceeding a sparse factor.
360    
361    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
362    
363            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
364    
365    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * man/sFilter.Rd: Corrected documentation on statement format (use
368            '==' instead of '=').
369    
370    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
371    
372            * R/aobjects.R (StructuredTextDocument): Inherits from
373            TextDocument.
374    
375    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
376    
377            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
378            on sparse matrices as proposed by Martin Maechler.
379    
380    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
381    
382            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
383            \pkg{filehash} version makes them deprecated.
384    
385    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
386    
387            * R/termdocmatrix.R (textvector): Stemming is now performed before
388            erasing stopwords.
389            (weightMatrix): Adapted to handle sparse matrices.
390            (TermDocMatrix): Sparse matrix is now efficiently built by
391            direct stepwise insertion of row values into it.
392    
393    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
394    
395            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
396            due to ongoing problems. For our purposes the latter is as useful
397            as the replaced package.
398    
399    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
400    
401            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
402    
403            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
404    
405    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
406    
407            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
408            languages with available stopwords.
409    
410    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
411    
412            * inst/doc/tm.Rnw: Minor corrections in the vignette.
413    
414    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
415    
416            * DESCRIPTION: Update to version 0.2, since a lot of new features
417            have been integrated.
418    
419            * inst/stopwords: Updated existing stopwords and added stopwords
420            for various other languages.
421    
422    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * man/: Updated documentation.
425    
426            * Work/testDb.R: Script to test database stuff.
427    
428            * R/: Fixed various database related bugs. Seems to be rather
429            useable now, i.e., consider as alpha status for now.
430    
431    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
432    
433            * R/: Fixed some bugs related to database support.
434    
435    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
436    
437            * man/: Added a lot of examples to the manuals.
438    
439    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
440    
441            * man/: Updated parts of the documentation.
442    
443            * R/textdoccol.R (asPlain): Added conversion from newsgroup
444            documents to plain text documents.
445    
446    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
447    
448            * R/textdoccol.R: Finished experimental database support. Not yet
449            intensively tested.
450    
451            * R/source.R: Now each source has a default reader.
452    
453            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
454            class anymore.
455    
456            * R/plaintextdoc.R: Custom show method for plain text documents.
457    
458            * R/aobjects.R: Added a class for structured text documents.
459    
460            * R/reader.R: Replaced remaining \code{parser} occurrences with
461            \code{reader}.
462    
463            * R/textdoccol.R (summary): Indent tags.
464    
465            * R/textdoccol.R (removePunctuation): Transform method to remove
466            punctuation marks.
467    
468    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
469    
470            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
471            using prescindMeta().
472    
473    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
474    
475            * R/textdoccol.R: Improved database support.
476    
477    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
478    
479            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
480    
481            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
482            language code.
483    
484            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
485            into parserControl argument.
486    
487            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
488    
489    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
490    
491            * Work/tmDataSetup.R: The datasets acq and crude can now be
492            created on the fly.
493    
494            * R/stopwords.R: Introduced a function returning the stopwords for
495            a given language (English, German and French at the moment)
496    
497            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
498            otherwise falls back to Snowball package.
499    
500    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
501    
502            * man/dissimilarity-methods.Rd: Make clear that any method offered
503            by "dists" from package "cba" can be used.
504    
505    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
506    
507            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
508            to Kurt's latex suggestion. Removed points and underscores in
509            variable names for consistent naming.
510    
511            * DESCRIPTION: Update to version 0.1-2.
512    
513            * man/TextRepository.Rd: Fixed bug in documentation.
514    
515    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
516    
517            * DESCRIPTION: Update to version 0.1-1.
518    
519    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
520    
521            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
522            wordStem.
523    
524    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
525    
526            * R/: Changes due to Kurt's review.
527    
528    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
529    
530            * R/: Implemented improvements based upon comments by David
531            Meyer.
532    
533    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
534    
535            * inst/doc/: Rewrote vignette.
536    
537            * man/: Improved documentation.
538    
539    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
540    
541            * man/: Updated documentation.
542    
543            * DESCRIPTION: Changed package name to "tm". Updated version to
544            0.1 for first CRAN release.
545    
546            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
547            list archive example.
548    
549            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
550            archive example.
551    
552            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
553            from (several mails per box) mbox format to (single mail per file)
554            eml format.
555    
556    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
557    
558            * data/crude.rda: Rebuilt.
559    
560            * data/acq.rda: Rebuilt.
561    
562            * R/reader.R: Factored out reader and parser methods from
563            textdoccol.R.
564    
565            * R/source.R: Factored out Source methods from aobjects.R and
566            textdoccol.R.
567            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
568            feeds.
569    
570            * R/textdoccol.R (DirSource): Added support for recursive
571            traversal of directories.
572    
573    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
574    
575            * R/textdoccol.R ([[): Loads the document corpus automatically
576            into memory upon access.
577            (tm_transform, tm_filter): Removed several checks whether the
578            document is already loaded ([[ ensures this now).
579            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
580            mailing list archive.
581    
582    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
583    
584            * R/aobjects.R (TextDocument): Is now a virtual class.
585            (Source): Is now a virtual class.
586    
587    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
588    
589            * R/textdoccol.R (c): Support for an arbitrary number of document
590            collections.
591    
592    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
593    
594            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
595            append_meta and remove_meta.
596    
597            * R/textdoccol.R: Removed modify_metadata method.
598    
599            * R/textrepo.R: Removed modify_metadata method.
600    
601            * R/textdoccol.R (remove_meta): Supports removal of document
602            collection metadata and document (= in data frame) metadata.
603    
604    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
605    
606            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
607    
608            * data/crude.rda: Rebuilt.
609    
610            * data/acq.rda: Rebuilt.
611    
612            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
613    
614            * R/textdoccol.R ([): Bug fix for subsetting a document
615            collection's data frame.
616    
617    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
618    
619            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
620            to s_filter.
621    
622            * R/textdoccol.R: Local text documents' metadata can now be copied
623            to a document collection's data frame with prescind_meta.
624    
625    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
626    
627            * R/: Text documents' slot metadata is now accessible in s_filter.
628    
629            * R/: Rewrote s_filter function (has still some restrictions).
630    
631    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
632    
633            * R/: Various fixes in handling metadata.
634    
635            * R/: Added update mechanism for text document collections.
636    
637    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
638    
639            * R/: Merging of document collections now creates a binary tree
640            for reconstructing merged document collections.
641    
642            * R/: Redesign of metadata for document collections.
643    
644    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
645    
646            * R/: Messages now use \code{ngettext}.
647    
648    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
649    
650            * R/: Added functions for modifying and removing metadata.
651    
652    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
653    
654            * man/: Updated some documentation.
655    
656            * R/: Corrected some connection issues.
657    
658            * inst/doc: Worked on the vignette.
659    
660    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
661    
662            * inst/: Added texts and started vignette.
663    
664            * R/: Final changes based upon David's comments.
665    
666    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
667    
668            * NAMESPACE: Corrected exports (generic methods need exportMethods
669            directives!).
670    
671    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
672    
673            * R/: Modified the TextDocCol constructur and various parsers. It
674            is now modular and supports various file formats via plugins (see
675            the new "Source" class).
676    
677    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
678    
679            * man/: Revised documentation after previous code changes.
680    
681    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
682    
683            * R/: Remaining changes as discussed with David.
684    
685    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
686    
687            * R/: Some changes as suggested by David. The rest will follow
688            within the next days.
689    
690    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
691    
692            * man/: Finished documentation.
693    
694    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
695    
696            * man/: Wrote some documentation.
697    
698    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
699    
700            * R/: Further syntactic sugar in form of additional assignment and
701            accessor methods.
702    
703    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
704    
705            * R/: Syntactic sugar in form of "length", "show" and "summary"
706            operators.
707    
708    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
709    
710            * R/: Diverse updates. Mainly on default operators ("[" or "c")
711            and dissimilarities.
712    
713    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * R/: Added similarity functions.
716    
717            * data/: Added english stopwords.
718    
719    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
720    
721            * data/: Examples compiled for new features
722    
723            * R/: Changes due to new structure.
724    
725            * NAMESPACE: Corrected namespace to reflect new structure.
726    
727            * R/termdocmatrix.R: Adapted for new naming scheme.
728    
729    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
730    
731            * R/textdoccol.R: Adapted code for new class structure. Wrote
732            several transform and filter functions operating on text document
733            collections (alias text document databases).
734    
735            * R/aobjects.R: Adapted class structure with inheritance,
736            repositories and additional meta data. Loading files on demand is
737            now possible.
738    
739    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
740    
741            * R/: Some cosmetic cleanups.
742    
743            * inst/: Removed vignette on clustering. That and much more is now
744            described in the JSS paper on text mining. Based upon that
745            article an elaborated vignette will be incorporated in the future.
746    
747    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
748    
749            * R/: Updated generic S4 methods to comply with signature changes
750            in newer versions of R (> 2.3)
751    
752    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
753    
754            * ext/R/importRIS.R: Automatic RIS import is now possible.
755    
756    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
757    
758            * R/textdoccol.R: Added RIS HTML input format.
759    
760    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
761    
762            * R/textdoccol.R: Removed bug that caused invalid text document
763            collections when handling many input files.
764    
765    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
766    
767            * R/textdoccol.R: Restructured and extended file import
768            mechanism.
769    
770            * inst/doc/clustering.Rnw: Adapted vignette for use with
771            ReutNews.rda
772    
773            * man/ReutNews.Rd: Documentation for ReutNews.rda
774    
775            * data/ReutNews.rda: A tiny Reuters21578 example data set.
776    
777    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
778    
779            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
780            clustering facilities of this package.
781    
782    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
783    
784            * R/aobjects.R: Changed package document structure to avoid class
785            dependency problems.
786    
787    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
788    
789            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
790            data set.
791    
792            *  Finished documentation and reordered directory structure. Now "R
793            CMD check textmin" works without errors.
794    
795    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
796    
797            * src/: Various splits can now be easily created for the
798            Reuters21578 data set.
799    
800    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
801    
802            *  Updated documentation
803    
804    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
805    
806            *  Wrote R documentation for some classes and methods.
807    
808    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
809    
810            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
811            files. See the questionnaire data/Umfrage.csv for such an example.
812            We are now able to import files in Reuters-21578 XML format.
813    
814            *  Changed class interfaces in various files. Weighting of the text
815            matrix is now possible.
816    
817    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
818    
819            * R/textdoccol.R: One can build term-document matrices if
820            nessecary (with buildTDM(...)) and fill the field tdm from a text
821            document collection with it.
822    
823            * R/textmatrix.R: Wrote S4 class for term-document matrices.
824    
825    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
826    
827            * R/textdoccol.R: We now can read in a whole XML file with several
828            news items.
829    
830  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
831    
832          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.909

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge