PDA

View Full Version : Kinopoisk.ru (Russian Movies) Scraper


hamp
2009-02-12, 20:31
56

Kinopoisk.ru v2.008 (Russian)

List:
-Added fanart (screens and wallpapers).
-Added fanart from www.themoviedb.org (fix the).
-Added trailer from www.imdb.com.
-Added fanart from www.themoviedb.org +24 (fix the).

v2.008 - 60

Jim DiGriz
2009-02-19, 21:42
Thanks!

Short question: does it pull the movie info from the URL in the info file like IMDB scrapper?

If not could you please build it in?

spiff
2009-02-19, 23:06
please use trac

hamp
2009-02-22, 13:17
Jim DiGriz where you from? Who is interesting - will soon make a normal fanart.
//Jim DiGriz ты от куда? Кому интересно - скоро сделаю нормальный фанарт.//

hamp
2009-02-25, 18:32
New version.

spiff
2009-02-25, 18:40
can you PLEASE submit a ticket on trac once you feel it is svn ready. i can't keep track of all these forum threads...

hamp
2009-02-26, 14:02
can you PLEASE submit a ticket on trac once you feel it is svn ready. i can't keep track of all these forum threads...

How?

Maxim
2009-02-26, 14:13
http://xbmc.org/trac

Use your forum account to log in. Note: BOTH Login and Password are Case-Sensitive.

hamp
2009-02-26, 15:08
<span style="font-size:13px;color:#666">The ([^<]+)</span>
How to cut "the", when it is.

spiff
2009-02-26, 15:22
you want to match whether or not the The is there?


<expression>&lt;span[^&gt*]&gt;(The )?([^&lt;]+)&lt;/span</expression>

hamp
2009-02-26, 15:28
Del "the".
The Dark Knight -> Dark Knight; Bolt-> Bolt.

<RegExp conditional="fanart" input="$$8" output="&lt;url function=&quot;GMA&quot;&gt;http://themoviedb.org/search?search%5Btext%5D=\1&lt;/url&gt;" dest="5+">
<RegExp input="$$1" output="\1&lt;" dest="8">
<expression>&lt;span style=&quot;font-size:13px;color:#666&quot;&gt;(The )?([^&lt;]+)&lt;/span&gt;</expression>
</RegExp>
<expression>The ([^&lt;]+)&lt;</expression>
</RegExp>

Not work. http://xbmc.org/wiki/?title=Scrapers nothing.

spiff
2009-02-26, 16:55
well, the expression is fine... your usage of it is wrong though.


<RegExp conditional="fanart" input="$$8" output="&lt;url function=&quot;GMA&quot;&gt;http://themoviedb.org/search?search%5Btext%5D=\1&lt;/url&gt;" dest="5+">
<RegExp input="$$1" output="\2" dest="8">
<expression>&lt;span style=&quot;font-size:13px;color:#666&quot;&gt;(The )?([^&lt;]+)&lt;/span&gt;</expression>
</RegExp>
<expression/>
</RegExp>


your selection will be avail in buffer 2 since buffer 1 may or may not hold (The )

vdrfan
2009-02-26, 20:26
Is there a reason why you are doing normal web searches instead of using the themoviedb API? Using the API search should give better results and is much more fail safe.

hamp
2009-02-27, 15:46
Because using themoviedb API xbmc strange hangs. Therefore had to choose the standart search.

Spiff Many thanks for help in refining.

GooglieS
2009-05-14, 20:38
This script does not load any information/art from kinopoisk! Something is broken?

vlavrinenko
2009-10-06, 15:14
Seems that kinopoisk.ru changed some internal layout details. As long as author still does not respond, I've made some modifications to scraper myself. I don't see if I can attach a file, so you can just get it here: ftp://lvu.kiev.ua/KinoPoisk.xml. Maybe author will want to incorporate these changes to his scraper.

I've tried to fix tmdb fanart scraping, but it works very poorly.

vovanprod
2009-11-07, 13:00
Похоже с Кинопоиском все конкретно заглохло... А так хотелось-бы.....

Tossik
2009-11-08, 06:29
yes can someone please make it work!!!! please!!!!