-
-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generating a static archive of bugs.python.org and bugs.jython.org #370
Comments
@JacobCoffee, @rouilj and I had a conversation on IRC a while back and John was tracking a request from another user regarding static archives of roundup instances. |
@ewdurbin, I have not had any luck tracking down the user from the prior discussion. However, some things to consider: How much of the msgs/files/urls do you want to keep? Do user### urls matter? Do any other class urls matter? As a first pass consider scraping all the issue### urls. The issue url's include the sender, date and If not this becomes easier. If you do need to keep the msgs (e.g. for recipient list, or exact For files, how much of the metadata do you need? Scraping the files#### url and then placing the actual One issue might be setting the mime type for the attached files. If you can live To preserve internal links (e.g. issue123 references issue456) , we would need to make the url If this isn't possible, we would need to automate munging the html in the scraped files changing Also I don't see a reasonable way to generate an index page. How useful would a series of This ties in with searching the site. Roundup provides faceted searching (status, message text, title, That's a few things to consider off the top of my head. |
Currently bugs.python.org and bugs.jython.org have both been deprecated in favor of Github Issues.
The messages/files/urls need to remain online in perpetuity as a reference (we don't want to break old links!).
We should investigate methods of creating a static archive of the sites for this purpose, to avoid the need to maintain the installations forever.
cc @rouilj
The text was updated successfully, but these errors were encountered: