geminispace.info

Unnamed repository; edit this file 'description' to name the repository.
git clone git://code.clttr.info/geminispace.info.git
Log | Files | Refs | README | LICENSE

commit 14c39977247cd992c0a49efc1bd8416a9979a942
parent e0fba80405ff7ea29d1680098031eaee3e165628
Author: René Wagner <rwa@clttr.info>
Date:   Tue, 25 May 2021 21:13:28 +0200

news 2021-05-25

Diffstat:
Mgus/crawl.py | 1+
Mserve/templates/news.gmi | 4++++
2 files changed, 5 insertions(+), 0 deletions(-)

diff --git a/gus/crawl.py b/gus/crawl.py @@ -200,6 +200,7 @@ EXCLUDED_URL_PREFIXES = [ # list of ~30000 stations, crawling takes too long "gemini://gemini.tunerapp.org/stations/", + "gemini://tunerapp.org/stations/", # this page inexplicably breaks both build_index, as well as elpher # when I browse to it... I think it might have some weird encoding diff --git a/serve/templates/news.gmi b/serve/templates/news.gmi @@ -2,6 +2,10 @@ ## News +### 2021-05-25 +geminispace.info is now aware of more than 1000 capsules. Unfortunately this data is somewhat misleading: some of the capsules may already be gone, but GUS lacks a mechanism for invaliding old data. +I'll probably start with some manual cleanup the next days, so don't worry if numbers go down. + ### 2021-05-12 We are back on track with crawl and index, everything is up-to-date again. I had to add another news and a wikipedia mirror to the exclude list. The current implementation can't handle such a huge amount of information well.