[Product-Developers] [Fwd: Re: [Plone-Users] Bulk import of data]

Dylan Jay gmane at dylanjay.com
Wed Apr 9 05:49:23 UTC 2008


I'm moving this thread to this group as its more appropriate.

Anyone here interested in this project?

-------- Original Message --------
Subject: Re: [Plone-Users] Bulk import of data
Date: Wed, 09 Apr 2008 13:01:04 +1000


Martijn Pieters wrote:
> On Mon, Apr 7, 2008 at 8:53 PM, Dylan Jay <gmane at dylanjay.com> wrote:
>>> like tools in a toolbox; I imagine a MySQL blueprint to do imports
>>  > from a miriad of different PHP CMSes, where you can easily abstract
>>
>>  but if we have something that can crawl any live site with no knowledge
>>  of which CMS its using, or even if it's using a CMS... and still get out
>>  all the content. Doesn't that mean we don't need any blueprints or
>>  scraping or specific code at all?
> 
> A generic live-site CMS-agnostic transmogrifier source would be a
> blueprint in itself, so you could slot extra (standard) sections after
> such a source to clean and tweak whatever you get out of that source.
> The intention is certainly for there to be an ecosystem of standard
> blueprints to build such a pipeline easily, without additional coding.
> 
> If you can get access to the CMS-specific data and create a better
> import with a CMS-specific source section (or set of sections), that
> would be an option too of course.

Sounds great. I think this would be a great platform to work with. I
tend to convert static sites or propriatory CMS so generic live-site is
the most interest to me.

Just got to get time to modify the webstemmer code
http://pypi.python.org/simple/webstemmer/
to get it retain minimal formatting and work with transmogrifier.

Anyone else see this as a useful use-case and want to help?

Dylan.





-------------------------------------------------------------------------
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference
Don't miss this year's exciting event. There's still time to save $100.
Use priority code J8TL2D2.
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone





More information about the Product-Developers mailing list