XBMC Community Forum  

Go Back   XBMC Community Forum > Development > Scraper Development

Scraper Development Developers forum for meta data scrapers. Scraper developers only!
Not for posting feature requests, bugs, or end-user support requests!

Reply
 
Thread Tools Search this Thread Display Modes
Old 2007-03-22, 18:30   #1
HectorziN
Member
 
HectorziN's Avatar
 
Join Date: Mar 2007
Location: Barcelona (Spain)
Posts: 90
HectorziN is on a distinguished road
Default FilmAffinity (Spanish) scraper

Hello guys.

In last build, is included a new Spanish scraper by Jurrabi. This scraper uses www.culturalianet.com
I tried this, and I found this web page hasn’t enough information. I decided then making a new scraper using www.filmaffinity.com web page. This scrapper supports all of this:

- Title
- Plot
- Outline (when available for the movie)
- Year
- Director
- Country and original version title (used mpaa tagline for this)
- Credits
- Genre
- Nominations, Oscars, and other prices (used mpaa field for this)
- Rating
- Votes
- Runtime
- Actor name

I think that it could be useful adding it to next releases.
Where can I post the xml file and the gif file?


Best Regards.

HectorziN (From Spain)
HectorziN is offline   Reply With Quote
Old 2007-03-22, 19:25   #2
Nuka1195
Team Arcade
 
Join Date: Dec 2004
Posts: 3,369
Nuka1195 is on a distinguished road
Default

submit a patch on SF.

https://sourceforge.net/tracker/?gro...54&atid=581840
__________________
For python coding questions first see http://xbmc.sourceforge.net/python-docs/
Nuka1195 is offline   Reply With Quote
Old 2007-03-22, 19:30   #3
HectorziN
Member
 
HectorziN's Avatar
 
Join Date: Mar 2007
Location: Barcelona (Spain)
Posts: 90
HectorziN is on a distinguished road
Default FilmAffinity Scraper Download

http://hectorzin.dynalias.com/FilmAffinity.zip
Regards

HectorziN
HectorziN is offline   Reply With Quote
Old 2007-03-23, 11:41   #4
HectorziN
Member
 
HectorziN's Avatar
 
Join Date: Mar 2007
Location: Barcelona (Spain)
Posts: 90
HectorziN is on a distinguished road
Default

I don't know how submit a patch. I have posted here a link to download. Guys, try to test it and evaluate. I think that it is very good. I haven't found a movie that dont' work with this scraper.

The only problem I have found, and I don't know how to solve it, is that don't work when the search string contains ñ, á, é, í, ó and ú. you mast search using n, a, e, i, o and u.

If anybody knows how to fix, please, tell me.

I am waiting your comments.
__________________
HectorziN
HectorziN is offline   Reply With Quote
Old 2007-03-26, 16:44   #5
spiff
Grumpy Bastard Developer
 
spiff's Avatar
 
Join Date: Nov 2003
Posts: 7,715
spiff is on a distinguished road
Default

sounds like encoding issues to me. make sure you properly specify utf vs not in your returned xml's.
__________________
Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
spiff is offline   Reply With Quote
Old 2007-03-27, 20:03   #6
spiff
Grumpy Bastard Developer
 
spiff's Avatar
 
Join Date: Nov 2003
Posts: 7,715
spiff is on a distinguished road
Default

seems to be working fine. i'll hold off a couple of days to see if you nail that issue, then i'll commit to svn.

cheers

spiff
__________________
Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
spiff is offline   Reply With Quote
Old 2007-03-29, 10:03   #7
HectorziN
Member
 
HectorziN's Avatar
 
Join Date: Mar 2007
Location: Barcelona (Spain)
Posts: 90
HectorziN is on a distinguished road
Default problems with ñ

the problem is that xbmc transforms ñ to ñ
In culturalianet.com, a search to ñ works, but filmaffinity.com needs to receive %F1.

The problem is that is xbmc who send ñ to the parer, then the parser cannot do anything with this.... This is what I think is happening.

Here is the log

INFO: Get URL: http://www.filmaffinity.com/es/searc...±o&stype=title

if you paste this url into internet explorer it won't find anything, but if you paste http://www.filmaffinity.com/es/searc...1o&stype=title then it will work. How can this be fixed?

Thanks
__________________
HectorziN
HectorziN is offline   Reply With Quote
Old 2007-03-29, 13:32   #8
spiff
Grumpy Bastard Developer
 
spiff's Avatar
 
Join Date: Nov 2003
Posts: 7,715
spiff is on a distinguished road
Default

we need some way for the scraper to say that it wants iso not utf chars. will have a crack at it later.
__________________
Always read the XBMC online-manual, FAQ and search the forum before posting.
Do not e-mail XBMC-Team members directly asking for support. Read/follow the forum rules.
For troubleshooting and bug reporting please make sure you read this first.
spiff is offline   Reply With Quote
Old 2007-03-29, 14:48   #9
HectorziN
Member
 
HectorziN's Avatar
 
Join Date: Mar 2007
Location: Barcelona (Spain)
Posts: 90
HectorziN is on a distinguished road
Default

OK, thanks, I'll wait for the new scraper functionallity
__________________
HectorziN
HectorziN is offline   Reply With Quote
Old 2007-04-03, 18:03   #10
HectorziN
Member
 
HectorziN's Avatar
 
Join Date: Mar 2007
Location: Barcelona (Spain)
Posts: 90
HectorziN is on a distinguished road
Default

Any progress
is there a new definition of scraper languaje to explicit if dthe search using utf or iso?

Thanks a lot.

Also, anyone has test this scrapper? what do you think?
__________________
HectorziN
HectorziN is offline   Reply With Quote
Reply

Bookmarks


Currently Active Users Viewing This Thread: 2 (1 members and 1 guests)
Almaciguero
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +2. The time now is 17:38.


Protected by Akismet, We recommend WordPress blogs
Copyright © 2008, XBMC Project