Merge branch 'language' into next
* language: Some documentation fixes and enhancments. Improve INSTALL documentation. README: Update copyright notice. README: improve phrasing.
This commit is contained in:
commit
25b25735dd
15
doc/INSTALL
15
doc/INSTALL
|
@ -1,7 +1,7 @@
|
||||||
NewsStats 0.1 (c) 2010 Thomas Hochstein <thh@inter.net>
|
NewsStats 0.1 (c) 2010-2012 Thomas Hochstein <thh@inter.net>
|
||||||
|
|
||||||
NewsStats is a software package for gathering statistical data live
|
NewsStats is a software package used to gather statistical information
|
||||||
from a Usenet feed and subsequent examination.
|
from a live Usenet feed and for its subsequent examination.
|
||||||
|
|
||||||
This script package is free software; you can redistribute it and/or
|
This script package is free software; you can redistribute it and/or
|
||||||
modify it under the terms of the GNU Public License as published by
|
modify it under the terms of the GNU Public License as published by
|
||||||
|
@ -20,7 +20,8 @@ INSTALLATION INSTRUCTIONS
|
||||||
|
|
||||||
# tar -xzf newsstats-nn.tar.gz
|
# tar -xzf newsstats-nn.tar.gz
|
||||||
|
|
||||||
Scripts in this path should be executable by the news user.
|
Scripts in this path - at least feedlog.pl - should be executable by the
|
||||||
|
news user.
|
||||||
|
|
||||||
2) Configuration
|
2) Configuration
|
||||||
|
|
||||||
|
@ -80,8 +81,8 @@ INSTALLATION INSTRUCTIONS
|
||||||
* Edit your 'newsfeeds' file and insert something like
|
* Edit your 'newsfeeds' file and insert something like
|
||||||
|
|
||||||
## gather statistics for NewsStats
|
## gather statistics for NewsStats
|
||||||
newsstats!
|
newsstats!\
|
||||||
:!*,de.*
|
:!*,de.*\
|
||||||
:Tc,WmtfbsPNH,Ac:/path/to/feedlog.pl
|
:Tc,WmtfbsPNH,Ac:/path/to/feedlog.pl
|
||||||
|
|
||||||
* You should only feed that hierarchy (those hierarchies ...) to
|
* You should only feed that hierarchy (those hierarchies ...) to
|
||||||
|
@ -109,7 +110,7 @@ INSTALLATION INSTRUCTIONS
|
||||||
Everything should be going smoothly now.
|
Everything should be going smoothly now.
|
||||||
|
|
||||||
* If INN is spewing error messages to 'errlog' or reporting
|
* If INN is spewing error messages to 'errlog' or reporting
|
||||||
continous respaws of feedlog.pl to 'news.notice', stop your feed:
|
continous respawns of feedlog.pl to 'news.notice', stop your feed:
|
||||||
|
|
||||||
# ctlinnd drop 'newsstats!'
|
# ctlinnd drop 'newsstats!'
|
||||||
|
|
||||||
|
|
15
doc/README
15
doc/README
|
@ -1,4 +1,4 @@
|
||||||
NewsStats 0.1 (c) 2010 Thomas Hochstein <thh@inter.net>
|
NewsStats 0.1 (c) 2010-2012 Thomas Hochstein <thh@inter.net>
|
||||||
|
|
||||||
NewsStats is a software package for gathering statistical data live
|
NewsStats is a software package for gathering statistical data live
|
||||||
from a Usenet feed and subsequent examination.
|
from a Usenet feed and subsequent examination.
|
||||||
|
@ -12,7 +12,7 @@ the Free Software Foundation.
|
||||||
What's that?
|
What's that?
|
||||||
|
|
||||||
There's a multitude of tools for the statistical examination of
|
There's a multitude of tools for the statistical examination of
|
||||||
newsgroups: number of postings month or per person, longest
|
newsgroups: number of postings per month or per person, longest
|
||||||
threads, and so on (see <http://th-h.de/infos/usenet/stats.php>
|
threads, and so on (see <http://th-h.de/infos/usenet/stats.php>
|
||||||
[German language] for an incomplete list). Most of them use a per-
|
[German language] for an incomplete list). Most of them use a per-
|
||||||
newsgroup approach while NewsStats is hierarchy oriented.
|
newsgroup approach while NewsStats is hierarchy oriented.
|
||||||
|
@ -27,7 +27,7 @@ Workflow
|
||||||
|
|
||||||
That raw data will be regularly - e.g. monthly - processed to a
|
That raw data will be regularly - e.g. monthly - processed to a
|
||||||
second set of database tables each dedicated to a certain
|
second set of database tables each dedicated to a certain
|
||||||
statistical aspect, e.g. number of postings per group per month.
|
statistical aspect, e.g. number of postings per group and month.
|
||||||
|
|
||||||
Several kinds of reports can then be generated from those result
|
Several kinds of reports can then be generated from those result
|
||||||
tables.
|
tables.
|
||||||
|
@ -35,8 +35,8 @@ Workflow
|
||||||
Prerequisites
|
Prerequisites
|
||||||
|
|
||||||
NewsStats is written in Perl (5.8.x and above) and makes use of a
|
NewsStats is written in Perl (5.8.x and above) and makes use of a
|
||||||
MySQL database, so you'll need Perl, some modules, mysql and, of
|
MySQL database, so you will need Perl, some modules, mysql and, of
|
||||||
course, an INN.
|
course, INN.
|
||||||
|
|
||||||
* Perl 5.8.x with standard modules
|
* Perl 5.8.x with standard modules
|
||||||
- Cwd
|
- Cwd
|
||||||
|
@ -62,12 +62,12 @@ Getting Started
|
||||||
table. See the feedlog.pl man page for more information.
|
table. See the feedlog.pl man page for more information.
|
||||||
|
|
||||||
You can process that data via 'gatherstats.pl'; currently only the
|
You can process that data via 'gatherstats.pl'; currently only the
|
||||||
tabulation of postings per group per month is supported. More to
|
tabulation of postings per group and month is supported. More to
|
||||||
come. See the gatherstats.pl man page for more information.
|
come. See the gatherstats.pl man page for more information.
|
||||||
|
|
||||||
Report generation is handled by specialised scripts for each
|
Report generation is handled by specialised scripts for each
|
||||||
report type. Currently only reports on the number of postings per
|
report type. Currently only reports on the number of postings per
|
||||||
group per month are supported; you can use 'groupstats.pl' for
|
group and month are supported; you can use 'groupstats.pl' for
|
||||||
this. See the groupstats.pl man page for more information.
|
this. See the groupstats.pl man page for more information.
|
||||||
|
|
||||||
Reporting Bugs
|
Reporting Bugs
|
||||||
|
@ -95,3 +95,4 @@ Author
|
||||||
|
|
||||||
Thomas Hochstein <thh@inter.net>
|
Thomas Hochstein <thh@inter.net>
|
||||||
<http://th-h.de/>
|
<http://th-h.de/>
|
||||||
|
|
||||||
|
|
|
@ -201,7 +201,7 @@ Suppress logging to syslog.
|
||||||
|
|
||||||
=head1 INSTALLATION
|
=head1 INSTALLATION
|
||||||
|
|
||||||
See L<doc/INSTALL.>
|
See L<doc/INSTALL>.
|
||||||
|
|
||||||
=head1 EXAMPLES
|
=head1 EXAMPLES
|
||||||
|
|
||||||
|
|
|
@ -206,7 +206,7 @@ gatherstats - process statistical data from a raw source
|
||||||
|
|
||||||
=head1 SYNOPSIS
|
=head1 SYNOPSIS
|
||||||
|
|
||||||
B<gatherstats> [B<-Vhdt>] [B<-m> I<YYYY-MM> | I<YYYY-MM:YYYY-MM>] [B<-s> I<stats] [B<-c> I<filename template>]] [B<--hierarchy> I<TLH>] [B<--rawdb> I<database table>] [B<-groupsdb> I<database table>] [B<--clientsdb> I<database table>] [B<--hostsdb> I<database table>]
|
B<gatherstats> [B<-Vhdt>] [B<-m> I<YYYY-MM> | I<YYYY-MM:YYYY-MM>] [B<-s> I<stats>] [B<-c> I<filename template>]] [B<--hierarchy> I<TLH>] [B<--rawdb> I<database table>] [B<-groupsdb> I<database table>] [B<--clientsdb> I<database table>] [B<--hostsdb> I<database table>]
|
||||||
|
|
||||||
=head1 REQUIREMENTS
|
=head1 REQUIREMENTS
|
||||||
|
|
||||||
|
@ -293,7 +293,6 @@ Set processing period to a single month in YYYY-MM format or to a time
|
||||||
period between two month in YYYY-MM:YYYY-MM format (two month, separated
|
period between two month in YYYY-MM:YYYY-MM format (two month, separated
|
||||||
by a colon).
|
by a colon).
|
||||||
|
|
||||||
|
|
||||||
=item B<-s>, B<--stats> I<type>
|
=item B<-s>, B<--stats> I<type>
|
||||||
|
|
||||||
Set processing type to one of I<all> and I<groups>. Defaults to all
|
Set processing type to one of I<all> and I<groups>. Defaults to all
|
||||||
|
@ -307,8 +306,9 @@ one group on each line and ignoring everything after the first
|
||||||
whitespace (so you can use a file in checkgroups format or (part of)
|
whitespace (so you can use a file in checkgroups format or (part of)
|
||||||
your INN active file).
|
your INN active file).
|
||||||
|
|
||||||
The filename is taken from I<filename template>, amended by each B<--
|
The filename is taken from I<filename template>, amended by each
|
||||||
month> B<gatherstats> is processing, so that
|
B<--month> B<gatherstats> is processing in the form of I<template-YYYY-MM>,
|
||||||
|
so that
|
||||||
|
|
||||||
gatherstats -m 2010-01:2010-12 -c checkgroups
|
gatherstats -m 2010-01:2010-12 -c checkgroups
|
||||||
|
|
||||||
|
|
|
@ -379,6 +379,9 @@ Restrict output to those newgroups present in a file in checkgroups format
|
||||||
(one newgroup name per line; everything after the first whitespace on each
|
(one newgroup name per line; everything after the first whitespace on each
|
||||||
line is ignored). All other newsgroups will be removed from output.
|
line is ignored). All other newsgroups will be removed from output.
|
||||||
|
|
||||||
|
Contrary to B<gatherstats>, I<filename> is not a template, but refers to
|
||||||
|
a single file in checkgroups format.
|
||||||
|
|
||||||
=item B<-r>, B<--report> I<default|average|sums>
|
=item B<-r>, B<--report> I<default|average|sums>
|
||||||
|
|
||||||
Choose the report type: I<default>, I<average> or I<sums>
|
Choose the report type: I<default>, I<average> or I<sums>
|
||||||
|
@ -436,8 +439,8 @@ you'll get the following result:
|
||||||
|
|
||||||
de.comp.datenbanken.misc has not been considered even though it has 38
|
de.comp.datenbanken.misc has not been considered even though it has 38
|
||||||
postings in total, because it has less than 25 postings in every single
|
postings in total, because it has less than 25 postings in every single
|
||||||
month. If you want to list all newsgroups with more than 25 postings U<in
|
month. If you want to list all newsgroups with more than 25 postings
|
||||||
total>, you'll have to set the boundary type to I<sum>, see below.
|
I<in total>, you'll have to set the boundary type to I<sum>, see below.
|
||||||
|
|
||||||
A boundary type of I<level> will show only those newsgroups - at all -
|
A boundary type of I<level> will show only those newsgroups - at all -
|
||||||
that satisfy the boundaries in each and every single month. With the above
|
that satisfy the boundaries in each and every single month. With the above
|
||||||
|
@ -449,10 +452,10 @@ you'll get this result:
|
||||||
de.comp.datenbanken.ms-access 293
|
de.comp.datenbanken.ms-access 293
|
||||||
|
|
||||||
de.comp.datenbanken.mysql has not been considered because it had less than
|
de.comp.datenbanken.mysql has not been considered because it had less than
|
||||||
25 postings in 2012-02.
|
25 postings in 2012-02 (only).
|
||||||
|
|
||||||
You can use that to get a list of newsgroups that have more (or less) then
|
You can use that to get a list of newsgroups that have more (or less) then
|
||||||
x postings during the whole reporting period.
|
x postings in every month during the whole reporting period.
|
||||||
|
|
||||||
A boundary type of I<average> will show only those newsgroups - at all -that
|
A boundary type of I<average> will show only those newsgroups - at all -that
|
||||||
satisfy the boundaries on average. With the above list of newsgroups and
|
satisfy the boundaries on average. With the above list of newsgroups and
|
||||||
|
|
Loading…
Reference in a new issue