2025-05-11 03:01:06 +02:00
|
|
|
NewsStats (c) 2010-2013, 2025 Thomas Hochstein <thh@thh.name>
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
NewsStats is a software package for gathering statistical data live
|
2025-06-01 16:39:25 +02:00
|
|
|
from a Usenet feed and subsequent analysis.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
2025-06-01 16:39:25 +02:00
|
|
|
This package is free software; you can redistribute it and/or modify
|
|
|
|
it under the terms of the GNU Public License as published by the Free
|
|
|
|
Software Foundation.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
---------------------------------------------------------------------
|
|
|
|
|
|
|
|
What's that?
|
|
|
|
|
2025-06-01 16:39:25 +02:00
|
|
|
There's a multitude of tools to create statistics about newsgroup
|
|
|
|
usage: number of postings per month or per person, longest threads,
|
|
|
|
and so on (see <https://th-h.de/net/usenet/stats/> [German language]
|
|
|
|
for an incomplete list). Most of them use a per-newsgroup approach
|
|
|
|
while NewsStats is hierarchy oriented.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
NewsStats will accumulate data from a live INN feed, allowing you
|
|
|
|
to process the saved information later on.
|
|
|
|
|
|
|
|
Workflow
|
|
|
|
|
|
|
|
NewsStats saves overview data and complete headers of (all)
|
|
|
|
incoming postings to a (MySQL) database in real time.
|
|
|
|
|
|
|
|
That raw data will be regularly - e.g. monthly - processed to a
|
|
|
|
second set of database tables each dedicated to a certain
|
2010-09-28 22:15:08 +02:00
|
|
|
statistical aspect, e.g. number of postings per group and month.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
Several kinds of reports can then be generated from those result
|
|
|
|
tables.
|
|
|
|
|
|
|
|
Prerequisites
|
|
|
|
|
|
|
|
NewsStats is written in Perl (5.8.x and above) and makes use of a
|
2010-09-28 22:15:08 +02:00
|
|
|
MySQL database, so you will need Perl, some modules, mysql and, of
|
|
|
|
course, INN.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
* Perl 5.8.x with standard modules
|
|
|
|
- Cwd
|
2025-06-01 16:39:25 +02:00
|
|
|
- Encode
|
2010-09-17 21:16:51 +02:00
|
|
|
- File::Basename
|
2025-06-01 16:39:25 +02:00
|
|
|
- Getopt::Long
|
2010-09-17 21:16:51 +02:00
|
|
|
- Sys::Syslog
|
|
|
|
|
2025-05-18 13:27:38 +02:00
|
|
|
* Perl modules from CPAN
|
2010-09-19 13:34:27 +02:00
|
|
|
- Config::Auto
|
2010-09-17 21:16:51 +02:00
|
|
|
- Date::Format
|
|
|
|
- DBI
|
|
|
|
|
|
|
|
* mysql 5.0.x
|
|
|
|
|
2025-06-01 16:39:25 +02:00
|
|
|
* a working installation of INN
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
Installation instructions
|
|
|
|
|
2025-05-18 13:54:57 +02:00
|
|
|
See INSTALL.
|
|
|
|
|
|
|
|
Documentation is in /doc, configuration in /etc, the NewsStats
|
|
|
|
module in /lib and most scripts in /bin, while /contrib has some
|
|
|
|
sample scripts that may have to be adjusted to work in your
|
|
|
|
configuration.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
Getting Started
|
|
|
|
|
|
|
|
'feedlog.pl' will continuously feed raw data to your raw data
|
|
|
|
table. See the feedlog.pl man page for more information.
|
|
|
|
|
2025-05-11 03:01:06 +02:00
|
|
|
You can process that data via 'gatherstats.pl'; currently the
|
2025-06-01 16:39:25 +02:00
|
|
|
tabulation of postings per group, injection server and posting
|
|
|
|
agent (newsreader) per month is supported. See the gatherstats.pl
|
|
|
|
man page for more information.
|
|
|
|
|
|
|
|
Example:
|
|
|
|
|
|
|
|
bin/gatherstats.pl
|
|
|
|
|
|
|
|
will parse raw data from the last month and save the results in
|
|
|
|
tables for postings per group, server and client, respectively.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
Report generation is handled by specialised scripts for each
|
2025-06-01 16:39:25 +02:00
|
|
|
report type: 'groupstats.pl' for postings per group
|
|
|
|
(s), 'hoststats.pl' for postings per injection server
|
|
|
|
(s) and 'clientstats.pl' for postings per posting agent. See the
|
|
|
|
groupstats.pl, hoststats.pl and clientstats.pl man pages for more
|
|
|
|
information.
|
|
|
|
|
|
|
|
Example:
|
|
|
|
|
|
|
|
bin/groupstats.pl -o postings-desc
|
|
|
|
bin/hoststats.pl -o postings-desc
|
|
|
|
bin/clientstats.pl -o postings-desc -v
|
|
|
|
|
|
|
|
will show reports for postings per group, per injection server and
|
|
|
|
per client (with detailed client versions) for the last month,
|
|
|
|
using the result tables filled by gatherstats.
|
|
|
|
|
|
|
|
To post those reports to Usenet, change postingstats.pl according
|
|
|
|
to your needs (sender, newsgroups and other headers, translation
|
|
|
|
of table headers and text templates) and display a test posting
|
|
|
|
by piping report data into postingstats.pl:
|
|
|
|
|
|
|
|
bin/groupstats.pl --nocomments -s -f dump | bin/postingstats.pl
|
|
|
|
|
|
|
|
If the result is to your liking, add a pipe to a inews
|
|
|
|
implementation.
|
|
|
|
|
|
|
|
Example:
|
|
|
|
|
|
|
|
bin/groupstats.pl --nocomments -s -f dump | bin/postingstats.pl | contrib/tinews.pl -X
|
|
|
|
|
|
|
|
More information
|
|
|
|
|
|
|
|
See the man pages for 'gatherstats' and the report generating
|
|
|
|
scripts.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
Reporting Bugs
|
|
|
|
|
2010-09-19 13:34:27 +02:00
|
|
|
You can report bugs or feature requests to the author using the
|
2024-05-27 00:55:55 +02:00
|
|
|
issue tracker at <https://code.virtcomm.de/thh/newsstats/issues>.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
2010-11-01 13:16:41 +01:00
|
|
|
Please have a look at the TODO list before suggesting
|
|
|
|
improvements.
|
|
|
|
|
2024-05-27 00:55:55 +02:00
|
|
|
Development
|
2010-09-17 21:16:51 +02:00
|
|
|
|
2025-06-01 16:39:25 +02:00
|
|
|
This package is maintained using the Git version control system at
|
2024-05-27 00:55:55 +02:00
|
|
|
<https://code.virtcomm.de/thh/newsstats/>.
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
Related projects
|
|
|
|
|
|
|
|
<http://usenet.dex.de/> is a site were data gathered via NewsStats
|
|
|
|
is used for a graphical presentation of activity in the de.*
|
2010-09-19 13:34:27 +02:00
|
|
|
Usenet hierarchy over the years (since 1992).
|
2010-09-17 21:16:51 +02:00
|
|
|
|
|
|
|
Author
|
|
|
|
|
2024-05-27 00:55:55 +02:00
|
|
|
Thomas Hochstein <thh@thh.name>
|
|
|
|
<https://th-h.de/>
|