[Product-Developers] Best approach for archiving a Plone website

David Bain david.bain at alteroo.com
Wed Jul 30 21:50:57 UTC 2008


Webstemmer looks useful.

On Wed, Jul 30, 2008 at 4:17 PM, Tim Knapp <duffyd at kokorice.org> wrote:
> Hi David,
>
> Have you thought of using webstemmer[1]? I'm actually in the process of
> designing a solution for this at the moment. Interested in sharing
> ideas?
>
> Thanks,
> Tim
>
> [1] http://www.unixuser.org/~euske/python/webstemmer/
>
> On Wed, 2008-07-30 at 14:20 -0500, David Bain wrote:
>> Martin,
>> That looks a bit more sane, but when I attempt to pull the site down
>> with wget I end up having a <base href=mysite> issue. Where everty
>> html files points back to the original site as a result.
>>
>> On Wed, Jul 30, 2008 at 5:59 AM, Martin Stadler <martin at siarp.de> wrote:
>> >
>> > Am 28.07.2008 um 18:38 schrieb David Bain:
>> >
>> >> I need to save an all html version of a plone site, so that it can be
>> >> stored on a thumbdrive or CD for archiving purposes. What's the best
>> >> approach?
>> >> In the past I've used httrack, but it does some strange things with images
>> >> creating a directory named after the image for example image.jpg would
>> >> become image.jpg/thumb, image.jpg/preview etc...
>> >>
>> >
>> > You can also try wget (find a hint how to grab a site here:
>> > http://en.wikipedia.org/wiki/Wget#Using_Wget). Don't know how good it works
>> > with a Plone site.
>> >
>> > Martin
>> >
>> >
>>
>> _______________________________________________
>> Product-Developers mailing list
>> Product-Developers at lists.plone.org
>> http://lists.plone.org/mailman/listinfo/product-developers
>
>
> _______________________________________________
> Product-Developers mailing list
> Product-Developers at lists.plone.org
> http://lists.plone.org/mailman/listinfo/product-developers
>




More information about the Product-Developers mailing list