View Full Version : Excalibur Scraper - Near to be the best (after little help) for +18 Adult Movies
Good evening. After looking the Excalibur scraper, i realized it need to be updated. Of course, i'm not able to correct the strings, but with little help, this scraper can become the best for porn movies. the first thing i did, is to update the thumb link : much better quility, without blue and withe strokes and higher resolution =).
result in xbmc :
http://cestvouslesbons.free.fr/screenshot001.jpg
http://cestvouslesbons.free.fr/screenshot002.jpg
As you can see, the new thumb is wonderful (not photoshoped, directly grabbed from excalibur, i simply changed the thumb link in the excalibur.xml)
Download the scraper : (updated 26 march 09)
http://xbmc.org/trac/raw-attachment/ticket/6155/Excalibur.xml
Bleckshire
2008-11-30, 03:31
Alright, gotta head to work but I'll post what I've done so far:
- Added the film runtime.
- Fixed the "Year" tag so it only picks up the year and not the whole date. (Notice your screenshot has a (6) next to the title because it's picking up the month part of the date.)
- Fixed the "Plot" tag so it doesn't pick up all the extra bs afterwards.
- Fixed the "Director" tag so it picks up the name of the director based on the name displayed, not the link. (I did this because there are a few low budget directors who Excalibur doesn't have profiles for; therefore, there aren't any links for them so the original script would pick up nothing.)
Minor annoyances
- For some reason, I can't get the last quote of the plot to stop being picked up. Not really a big deal so I left it alone.
When I get back home I'll clean up a few things and see what else I can do (rating, cast, etc.).
http://www.bleckshire.com/screenshot000.jpg
<scraper name="Excalibur" content="movies" thumb="excalibur.jpg">
<NfoUrl dest="3">
<RegExp input="$$1" output="<url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url>" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>
</RegExp>
</NfoUrl>
<CreateSearchUrl dest="3">
<RegExp input="$$1" output="<url>http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1</url>" dest="3">
<expression noclean="1"></expression>
</RegExp>
</CreateSearchUrl>
<GetSearchResults dest="6">
<RegExp input="$$5" output="<?xml version="1.0" encoding="iso-8859-1" standalone="yes"?><results>\1</results>" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression><a href="([^/]*).htm"></expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>/1</title><url>http://www.excaliburfilms.com/AdultDVD/$$4.htm</url></entity>" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>\2</title><url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url></entity>" dest="5+">
<expression repeat="yes"><a href="http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm" class="searchTitle12"><font class="searchTitle12">([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetSearchResults>
<GetDetails dest="3">
<RegExp input="$$5" output="<details>\1</details>" dest="3">
<RegExp input="$$1" output="<thumb>http://images.excaliburfilms.com/DVD/reviews/imagesCH3WY/largemoviepic/\1</thumb>" dest="5">
<expression>imagesCH3WY/([^"]*)</expression>
</RegExp>
<RegExp input="$$1" output="<title>\1</title>" dest="5+">
<expression>Buy ([^D]*) </expression>
</RegExp>
<RegExp input="$$1" output="<actor><name>\1</name></actor>" dest="5+">
<expression repeat="yes"><a[^>]*> ([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<genre>\1</genre>" dest="5+">
<expression>href='/DVD/[^>]*>([^<]*)</a><BR></expression>
</RegExp>
<RegExp input="$$1" output="<year>\1</year>" dest="5+">
<expression>/excal/release[^>]*>[0-9]*/[0-9]*/([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<runtime>\1 mins.</runtime>" dest="5+">
<expression>Time: </font><[^>]*>([0-9]*)</expression>
</RegExp>
<RegExp input="$$1" output="<plot>\1</plot>" dest="5+">
<expression>class="lineHeader">&quot;([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<studio>\1</studio>" dest="5+">
<expression>href='/dvd/adultstudios/[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<director>\1</director>" dest="5+">
<expression>>Director: </font>[^>]*>([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetDetails>
</scraper>
Woaaa !! so much better :D Can't wait the end of your work.
- For the "year" it's ok. i know why i get numbers like "6" before, thanks ;)
- About casting, scrapers are able to grad actress pictures for example ? i don't know if there is this feature in scrapers like grabbing infos from http://www.excaliburfilms.com/pornlist/starpgs/Sophia_Santi.htm
Also i noticed 2 issues :
- The plot is not complete :)
On your screenshot, we have :
Sophia Santi, Kortni Kiss, Sabrina Rose, Evie Delatosso, Savanah Gold. With Stoya, juicy boy/girl romps, anal, tons of tease and titty, My First Porn 10 has it all. Enjoy some first-class, good time Digital Playground humor interlaced between all the delicious stars, all of which makes My First Porn 10 insanely satisfying and extremely entertaining."
But here is the complete plot :
The’ My First Porn’ series is a comical sex-spectacle of Jack directing his first porn movie. Volume 10 is over 2 hours and features Stoya in a brand-new, amazing ass-a-liscious anal scene. The A-list cast includes sensational fuck stars, Sophia Santi, Kortni Kiss, Sabrina Rose, Evie Delatosso, Savanah Gold. With Stoya, juicy boy/girl romps, anal, tons of tease and titty, My First Porn 10 has it all. Enjoy some first-class, good time Digital Playground humor interlaced between all the delicious stars, all of which makes My First Porn 10 insanely satisfying and extremely entertaining.
2nd issue : If you can look the casting when you have time. Old excalibur scraper missed "stoya" actress for example. Dunno why ;)
I get excalibur.jpg if you need it ;)
http://cestvouslesbons.free.fr/excalibur.jpg
thanks again Bleckshire for your help, can't wait to see what you want to do :P
Bleckshire
2008-12-01, 02:43
Woaaa !! so much better :D Can't wait the end of your work.
- For the "year" it's ok. i know why i get numbers like "6" before, thanks ;)
- About casting, scrapers are able to grad actress pictures for example ? i don't know if there is this feature in scrapers like grabbing infos from http://www.excaliburfilms.com/pornlist/starpgs/Sophia_Santi.htm
Also i noticed 2 issues :
- The plot is not complete :)
On your screenshot, we have :
Sophia Santi, Kortni Kiss, Sabrina Rose, Evie Delatosso, Savanah Gold. With Stoya, juicy boy/girl romps, anal, tons of tease and titty, My First Porn 10 has it all. Enjoy some first-class, good time Digital Playground humor interlaced between all the delicious stars, all of which makes My First Porn 10 insanely satisfying and extremely entertaining."
But here is the complete plot :
The’ My First Porn’ series is a comical sex-spectacle of Jack directing his first porn movie. Volume 10 is over 2 hours and features Stoya in a brand-new, amazing ass-a-liscious anal scene. The A-list cast includes sensational fuck stars, Sophia Santi, Kortni Kiss, Sabrina Rose, Evie Delatosso, Savanah Gold. With Stoya, juicy boy/girl romps, anal, tons of tease and titty, My First Porn 10 has it all. Enjoy some first-class, good time Digital Playground humor interlaced between all the delicious stars, all of which makes My First Porn 10 insanely satisfying and extremely entertaining.
2nd issue : If you can look the casting when you have time. Old excalibur scraper missed "stoya" actress for example. Dunno why ;)
I get excalibur.jpg if you need it ;)
http://cestvouslesbons.free.fr/excalibur.jpg
thanks again Bleckshire for your help, can't wait to see what you want to do :P
I'll take a look at the casting. The most I'd be able to grab from the star info would be a thumbnail for each and that's depending on how they file them. I'll look into it though. And the plot actually pulls the complete plot. The reason that screenshot only shows the ending half of it is because in that view, the plot is actually scrolling. So I took the shot once it scrolled to the end of the plot. :P I'll take a look at the casting. I didn't even look at it before because I just figured it was already picking up everything.
I'll take a look at the casting. The most I'd be able to grab from the star info would be a thumbnail for each and that's depending on how they file them. I'll look into it though. And the plot actually pulls the complete plot. The reason that screenshot only shows the ending half of it is because in that view, the plot is actually scrolling. So I took the shot once it scrolled to the end of the plot. :P I'll take a look at the casting. I didn't even look at it before because I just figured it was already picking up everything.
guys..: thank you for your effort and your fantastic scrapter for adult content!
:laugh:
Bleckshire
2008-12-01, 14:24
Okay, this is probably about where I'll stop. Let me go over a few of the ups and downs of what I've done with this scraper:
- As I said in my previous post, runtime was added, the year has been changed to detect the year and not the month (or the whole mm/dd/yyyy date as it originally was), the plot should pick up JUST the plot and nothing extra, and the director should pick up correctly now (see below for a comment on this).
What I've added since:
- I was able to get the stars' thumbnails for those listed in the cast. It should display the main picture on that star's profile page. Here's the image displayed on Sabrina Rose's profile page:
http://images.excaliburfilms.com/pornlist/starpicsCH3WY/Sabrina_Rose.jpg
Here's a screenshot of the cast listing:
http://www.bleckshire.com/screenshot002.jpg
- That damn quote at the end of the plot was annoying the hell out of me so I rewrote the plot detection and figured out why it was doing that. Unfortunately, I couldn't remove it (at least not without compromising any other possible film plots I haven't seen yet) so instead, I got it to leave the opening quote, so now it detects the plot just as it is on the page, inside quotes.
Now, for the bad:
- Artik, I did figure out why it wasn't picking up Stoya when pulling the cast. It's related to the same issue I was mentioning with the director. The original script was using links (a href tags) to pull cast and directors. As with the director, if Excalibur does not have a profile for the star/director, then the page obviously won't create a link and just has the star/director listed in plain text. So I did what I did with the director and was able to detect Stoya in our example film. Problem with that was that it wasn't detecting any of the other stars. With the way Excalibur lists the cast, there's really no way of hopping around each star if some have profiles and some don't (someone more proficient in regexp might be able to do it, but it still seems like it'd be a lot of work). Now the reason my change to the director works is because they have only one director listed. I'm sure if they had multiple directors listed, the script wouldn't pull both. So, I left it as is. It will detect all stars in the cast that have profile pages. Granted, there are some films that have a complete cast with no profiles at all. If it hits one of those, it won't pull any of the cast obviously.
- Couldn't pull the rating either since Excalibur doesn't actually list a numbered rating to pull. They just have 'star-5.jpg' for example. Not much I can do with that.
Looks like that's just about everything I can change. Hope that helps ya out, artik and anyone else who wanted to use this.
<scraper name="Excalibur" content="movies" thumb="excalibur.jpg">
<NfoUrl dest="3">
<RegExp input="$$1" output="<url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url>" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>
</RegExp>
</NfoUrl>
<CreateSearchUrl dest="3">
<RegExp input="$$1" output="<url>http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1</url>" dest="3">
<expression noclean="1"></expression>
</RegExp>
</CreateSearchUrl>
<GetSearchResults dest="6">
<RegExp input="$$5" output="<?xml version="1.0" encoding="iso-8859-1" standalone="yes"?><results>\1</results>" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression><a href="([^/]*).htm"></expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>/1</title><url>http://www.excaliburfilms.com/AdultDVD/$$4.htm</url></entity>" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>\2</title><url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url></entity>" dest="5+">
<expression repeat="yes"><a href="http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm" class="searchTitle12"><font class="searchTitle12">([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetSearchResults>
<GetDetails dest="3">
<RegExp input="$$5" output="<details>\1</details>" dest="3">
<RegExp input="$$1" output="<thumb>http://images.excaliburfilms.com/DVD/reviews/imagesCH3WY/largemoviepic/\1</thumb>" dest="5">
<expression>imagesCH3WY/([^"]*)</expression>
</RegExp>
<RegExp input="$$1" output="<title>\1</title>" dest="5+">
<expression>Buy ([^D]*) </expression>
</RegExp>
<RegExp input="$$1" output="<actor><name>\2</name><thumb>http://images.excaliburfilms.com/pornlist/starpicsCH3WY/\1.jpg</thumb></actor>" dest="5+">
<expression repeat="yes"><a href="/pornlist/starpgs/([^\.]*)[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<genre>\1</genre>" dest="5+">
<expression>href='/DVD/[^>]*>([^<]*)</a><BR></expression>
</RegExp>
<RegExp input="$$1" output="<year>\1</year>" dest="5+">
<expression>/excal/release[^>]*>[0-9]*/[0-9]*/([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<runtime>\1 mins</runtime>" dest="5+">
<expression>Time: </font><[^>]*>([0-9]*)</expression>
</RegExp>
<RegExp input="$$1" output="<plot>\1</plot>" dest="5+">
<expression>Description:</font></a>[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<studio>\1</studio>" dest="5+">
<expression>href='/dvd/adultstudios/[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<director>\1</director>" dest="5+">
<expression>>Director: </font>[^>]*>([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetDetails>
</scraper>
Amazing work Bleckshire, many thx again. But, cast don't display the actress picture with the actual code ;) did you update it ? or is there something special to do ?
<scraper name="Excalibur" content="movies" thumb="excalibur.jpg">
<NfoUrl dest="3">
<RegExp input="$$1" output="<url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url>" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>
</RegExp>
</NfoUrl>
<CreateSearchUrl dest="3">
<RegExp input="$$1" output="<url>http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1</url>" dest="3">
<expression noclean="1"></expression>
</RegExp>
</CreateSearchUrl>
<GetSearchResults dest="6">
<RegExp input="$$5" output="<?xml version="1.0" encoding="iso-8859-1" standalone="yes"?><results>\1</results>" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression><a href="([^/]*).htm"></expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>/1</title><url>http://www.excaliburfilms.com/AdultDVD/$$4.htm</url></entity>" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>\2</title><url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url></entity>" dest="5+">
<expression repeat="yes"><a href="http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm" class="searchTitle12"><font class="searchTitle12">([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetSearchResults>
<GetDetails dest="3">
<RegExp input="$$5" output="<details>\1</details>" dest="3">
<RegExp input="$$1" output="<thumb>http://images.excaliburfilms.com/DVD/reviews/imagesCH3WY/largemoviepic/\1</thumb>" dest="5">
<expression>imagesCH3WY/([^"]*)</expression>
</RegExp>
<RegExp input="$$1" output="<title>\1</title>" dest="5+">
<expression>Buy ([^D]*) </expression>
</RegExp>
<RegExp input="$$1" output="<actor><name>\2</name><thumb>http://images.excaliburfilms.com/pornlist/starpicsCH3WY/\1.jpg</thumb></actor>" dest="5+">
<expression repeat="yes"><a href="/pornlist/starpgs/([^\.]*)[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<genre>\1</genre>" dest="5+">
<expression>href='/DVD/[^>]*>([^<]*)</a><BR></expression>
</RegExp>
<RegExp input="$$1" output="<year>\1</year>" dest="5+">
<expression>/excal/release[^>]*>[0-9]*/[0-9]*/([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<runtime>\1 mins</runtime>" dest="5+">
<expression>Time: </font><[^>]*>([0-9]*)</expression>
</RegExp>
<RegExp input="$$1" output="<plot>\1</plot>" dest="5+">
<expression>Description:</font></a>[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<studio>\1</studio>" dest="5+">
<expression>href='/dvd/adultstudios/[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<director>\1</director>" dest="5+">
<expression>>Director: </font>[^>]*>([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetDetails>
</scraper>
Bleckshire
2008-12-01, 21:37
I'm pretty sure I pasted the correct one. I just ran a couple test movies and they all pulled up the pictures fine. Let me know which particular movie you noticed it on and I'll check it out. I'll paste again just in case I somehow did accidentally change it.
<scraper name="Excalibur" content="movies" thumb="excalibur.jpg">
<NfoUrl dest="3">
<RegExp input="$$1" output="<url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url>" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>
</RegExp>
</NfoUrl>
<CreateSearchUrl dest="3">
<RegExp input="$$1" output="<url>http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1</url>" dest="3">
<expression noclean="1"></expression>
</RegExp>
</CreateSearchUrl>
<GetSearchResults dest="6">
<RegExp input="$$5" output="<?xml version="1.0" encoding="iso-8859-1" standalone="yes"?><results>\1</results>" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression><a href="([^/]*).htm"></expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>/1</title><url>http://www.excaliburfilms.com/AdultDVD/$$4.htm</url></entity>" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>\2</title><url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url></entity>" dest="5+">
<expression repeat="yes"><a href="http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm" class="searchTitle12"><font class="searchTitle12">([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetSearchResults>
<GetDetails dest="3">
<RegExp input="$$5" output="<details>\1</details>" dest="3">
<RegExp input="$$1" output="<thumb>http://images.excaliburfilms.com/DVD/reviews/imagesCH3WY/largemoviepic/\1</thumb>" dest="5">
<expression>imagesCH3WY/([^"]*)</expression>
</RegExp>
<RegExp input="$$1" output="<title>\1</title>" dest="5+">
<expression>Buy ([^D]*) </expression>
</RegExp>
<RegExp input="$$1" output="<actor><name>\2</name><thumb>http://images.excaliburfilms.com/pornlist/starpicsCH3WY/\1.jpg</thumb></actor>" dest="5+">
<expression repeat="yes"><a href="/pornlist/starpgs/([^\.]*)[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<genre>\1</genre>" dest="5+">
<expression>href='/DVD/[^>]*>([^<]*)</a><BR></expression>
</RegExp>
<RegExp input="$$1" output="<year>\1</year>" dest="5+">
<expression>/excal/release[^>]*>[0-9]*/[0-9]*/([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<runtime>\1 mins</runtime>" dest="5+">
<expression>Time: </font><[^>]*>([0-9]*)</expression>
</RegExp>
<RegExp input="$$1" output="<plot>\1</plot>" dest="5+">
<expression>Description:</font></a>[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<studio>\1</studio>" dest="5+">
<expression>href='/dvd/adultstudios/[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<director>\1</director>" dest="5+">
<expression>>Director: </font>[^>]*>([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetDetails>
</scraper>
please post a trac ticket (diff), then i will include in svn
I'm pretty sure I pasted the correct one. I just ran a couple test movies and they all pulled up the pictures fine. Let me know which particular movie you noticed it on and I'll check it out. I'll paste again just in case I somehow did accidentally change it.
i tried on my first porn movie 10 :)
i tried on my first porn movie 10 :)
[offtopic] artik: mind telling me which skin you are working with when you created those screenshots in your postings? it looks like a really cool skin.
thanks ;)
[offtopic] artik: mind telling me which skin you are working with when you created those screenshots in your postings? it looks like a really cool skin.
thanks ;)
mediastream 0.99, amazing skin ;)
http://www.teamrazorfish.co.uk/mediastream.html
http://xbmc.org/forum/showthread.php?t=40804
mediastream 0.99, amazing skin ;)
http://www.teamrazorfish.co.uk/mediastream.html
http://xbmc.org/forum/showthread.php?t=40804
indeed! thanks for the link, bro..
please post a trac ticket (diff), then i will include in svn
Let's work a little bit more on it, and that will be done ! :)
I'm pretty sure I pasted the correct one. I just ran a couple test movies and they all pulled up the pictures fine. Let me know which particular movie you noticed it on and I'll check it out. I'll paste again just in case I somehow did accidentally change it.
It seems it doesn't work here (refreshed my first porn, and tried on few others movies ...) but plot, runtime, director, everything work ;) :
http://cestvouslesbons.free.fr/screenshot111.jpg
Bleckshire
2008-12-02, 15:43
Hmm. I'll have to look into it in a bit. It seems to be working over here on various titles. Here's a shot of the results it pulls through the scraper testing exe:
http://www.bleckshire.com/exscrape.jpg
Looks like all the information, including the cast thumbs, are pulling. Ran it on my xbox as well and it worked. Can anyone else try this and let me know their results so we can narrow it down?
Looks like all the information, including the cast thumbs, are pulling. Ran it on my xbox as well and it worked. Can anyone else try this and let me know their results so we can narrow it down?
Ok, my fault sorry, "Grad actors thumb" must be checked on the xbmc settings ;) Now it's Ok !
But .... the actress size (very little) is due to the skin ?
Bleckshire
2008-12-03, 06:49
Ok, my fault sorry, "Grad actors thumb" must be checked on the xbmc settings ;) Now it's Ok !
But .... the actress size (very little) is due to the skin ?
Hehe. Glad it works for you now. Enjoy :P
EDIT: Most likely due to the skin. It's grabbing the main picture on the star's profile page. (ie.http://images.excaliburfilms.com/pornlist/starpicsCH3WY/Sophia_Santi.jpg)
Hehe. Glad it works for you now. Enjoy :P
EDIT: Most likely due to the skin. It's grabbing the main picture on the star's profile page. (ie.http://images.excaliburfilms.com/pornlist/starpicsCH3WY/Sophia_Santi.jpg)
That was what i thought ;) need to move a little bit the mediastream topic now ! :)
http://xbmc.org/forum/showthread.php?t=41495 lol
Oh Bleckshire, 2 things :
- Could you take the ticket as asked spiff :
please post a trac ticket (diff), then i will include in svn
- And 2nd thing : check your personnal messages ! :D
Bleckshire
2008-12-03, 21:45
Submitted the ticket and spiff already grabbed it.
jeffbacon
2008-12-07, 23:00
Forgive my ignorance but how do I actually integrate this scraper in my XBMC? I assume I copy the xml to a file and put it in a dir but can someone point me to the correct dir?
Bleckshire
2008-12-08, 05:52
Forgive my ignorance but how do I actually integrate this scraper in my XBMC? I assume I copy the xml to a file and put it in a dir but can someone point me to the correct dir?
Q:\system\scrapers\video\
That's the directory if you've got XBMC on an Xbox of course. I'm assuming it's the same or similar for other platforms. 'system\scrapers\video'
Anacotic
2009-02-09, 12:30
It doesn´t work for me.
Gets the Plot and some Actors but doesen´t get any Pictures...
It doesn´t work for me.
Gets the Plot and some Actors but doesen´t get any Pictures...
Excalibutfilms changed their jpg URL. Maybe Bleckshire or someone else can update them ?
Also, another bug : the title (for example "my first porn 10" is now grabbed as "my first porn 10, rent my first porn 10")
<scraper name="Excalibur" content="movies" thumb="excalibur.jpg" language="en">
<NfoUrl dest="3">
<RegExp input="$$1" output="<url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url>" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>
</RegExp>
</NfoUrl>
<CreateSearchUrl dest="3">
<RegExp input="$$1" output="<url>http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1</url>" dest="3">
<expression noclean="1"></expression>
</RegExp>
</CreateSearchUrl>
<GetSearchResults dest="6">
<RegExp input="$$5" output="<?xml version="1.0" encoding="iso-8859-1" standalone="yes"?><results>\1</results>" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression><a href="([^/]*).htm"></expression>
</RegExp>
<!-- ajouter -f après $$4 -->
<RegExp input="$$1" output="<entity><title>/1</title><url>http://www.excaliburfilms.com/AdultDVD/$$4-f.htm</url></entity>" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
</RegExp>
<RegExp input="$$1" output="<entity><title>\2</title><url>http://www.excaliburfilms.com/AdultDVD/\1.htm</url></entity>" dest="5+">
<expression repeat="yes"><a href="http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm" class="searchTitle12"><font class="searchTitle12">([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetSearchResults>
<GetDetails dest="3">
<RegExp input="$$5" output="<details>\1</details>" dest="3">
<RegExp input="$$1" output="<thumbs><thumb>http://images.excaliburfilms.com/DVD/reviews/imagesBB020609/largemoviepic/\1</thumb>" dest="5+">
<expression>imagesBB020609/([^"]*)</expression>
</RegExp>
<RegExp input="$$1" output="<thumb>http://images.excaliburfilms.com/DVD/reviews/imagesBB020609/largemoviepic/\1-b.jpg</thumb></thumbs>" dest="5+">
<expression>imagesBB020609/([^\.]*)</expression>
</RegExp>
<RegExp input="$$1" output="<title>\1</title>" dest="5+">
<expression>Buy ([^D]*) </expression>
</RegExp>
<RegExp input="$$1" output="<actor><name>\2</name><thumb>http://images.excaliburfilms.com/pornlist/starpicsAA020309/\1.jpg</thumb></actor>" dest="5+">
<expression repeat="yes"><a href="/pornlist/starpgs/([^\.]*)[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<genre>\1</genre>" dest="5+">
<expression>href='/DVD/[^>]*>([^<]*)</a><BR></expression>
</RegExp>
<RegExp input="$$1" output="<year>\1</year>" dest="5+">
<expression>/excal/release[^>]*>[0-9]*/[0-9]*/([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<runtime>\1 mins</runtime>" dest="5+">
<expression>Time: </font><[^>]*>([0-9]*)</expression>
</RegExp>
<RegExp input="$$1" output="<plot>\1</plot>" dest="5+">
<expression>Description:</font></a>[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<studio>\1</studio>" dest="5+">
<expression>href='/dvd/adultstudios/[^>]*>([^<]*)</expression>
</RegExp>
<RegExp input="$$1" output="<director>\1</director>" dest="5+">
<expression>>Director: </font>[^>]*>([^<]*)</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetDetails>
</scraper>
this is working and your getting the front and back cover in thumbnails not just the front
Anacotic
2009-02-20, 17:46
alright, but now i dont get actors ^^
ill take a look today to see if i can fix actors too
ill take a look today to see if i can fix actors too
Oh ! thanks tiben20 ! Can't wait for the actors fix ;)
Ok, the scraper has been updated. We have fixed the title grabbing bug.
It was "my first porn 10, rent my firt porn", now it's "my first porn 10"
My friend will fix the actor bug. I'll post it as soon as it will be done.
htpcrazy
2009-03-05, 06:12
i noticed a few issues when feeding some movies in this scraper. check with "Momma Knows Best". for some reason it does not pull the actors from this title. i have not tried with others but check it out.
a request. trying to figure out how xml works and was wondering if someone could paste the code to extract the full date of release instead of just the year? makes sorting easier. thanks
replace the regexp for actors with this and give it a try i cant give it a try since my ip is banned since i did a scan of 9600 movies
<RegExp input="$$1" output="<actor><name>\2</name><thumb>http://Images.ExcaliburFilms.com/pornlist/starpicsAA020309/\1.jpg</thumb></actor>" dest="5+">
<expression repeat="yes"><a href="http://www.excaliburfilms.com/pornlist/starpgs/pornlist/starpgs/([^\.]*)[^>]*>([^<]*)</expression>
</RegExp>
impressive collection mate.
my respects ;)
<RegExp input="$$1" output="<year>\1</year>" dest="5+">
<expression>/excal/release[^>]*>([0-9]*/[0-9]*/[^<]*)</expression>
</RegExp>
and for the date like 10/3/2007 its this one
lol spliff its not the movies in own its only a prog i did on my own which convert ed2k movies into fake .avi and i started to scrape the folder with thoes fake .avi and here i a am banned lol
htpcrazy
2009-03-05, 18:19
test both of your codes for actors and year. actors no long display and year only shows the month...is there a format that xmbc is expecting for "year"? or can it be any type of string?
year is just that - year. only year.
htpcrazy
2009-03-05, 18:50
ah thanks...wish i could sort by yyyy/mm/dd. easier to find most/least recent videos.
replace the regexp for actors with this and give it a try i cant give it a try since my ip is banned since i did a scan of 9600 movies
<RegExp input="$$1" output="<actor><name>\2</name><thumb>http://Images.ExcaliburFilms.com/pornlist/starpicsAA020309/\1.jpg</thumb></actor>" dest="5+">
<expression repeat="yes"><a href="http://www.excaliburfilms.com/pornlist/starpgs/pornlist/starpgs/([^\.]*)[^>]*>([^<]*)</expression>
</RegExp>
Tiben, no actors are grabbed with this code :S
oops i just pasted the code with an error.
http://www.excaliburfilms.com/pornlist/starpgs/pornlist/starpgs/
instead of
http://www.excaliburfilms.com/pornlist/starpgs/
<RegExp input="$$1" output="<actor><name>\2</name><thumb>http://Images.ExcaliburFilms.com/pornlist/starpicsAA020309/\1.jpg</thumb></actor>" dest="5+">
<expression repeat="yes"><a href="http://www.excaliburfilms.com/pornlist/starpgs/([^\.]*)[^>]*>([^<]*)</expression>
</RegExp>
oops i just pasted the code with an error.
http://www.excaliburfilms.com/pornlist/starpgs/pornlist/starpgs/
instead of
http://www.excaliburfilms.com/pornlist/starpgs/
Ok Tiben, but there is a bigger issue : Like said htpcrazy, if you try a movie like "momma knows best" the actors are not grabbed at all.
htpcrazy
2009-03-06, 23:34
found some issues here. i did a search for Naked Housewives . if you go through the browser the correct movie can be found as a first result. but through the scraper, it is not there. i'm guessing there is a bug to do with exact matches.
found some issues here. i did a search for Naked Housewives . if you go through the browser the correct movie can be found as a first result. but through the scraper, it is not there. i'm guessing there is a bug to do with exact matches.
Exactly, search engine has also a bug. For example, to find "I love Tory", i have to type : "i love t"
The scraper don't get results with "i love tory"
I would like to thanks vdrfan for his amazing work. EVERYTHING is corrected, and work !
- Titles are correctly grabbed (without "rent this dvd")
- Search works now like a charm ! for example : "I love Tory" or "Naked Housewives" finally get a result, and the good one ! :D
- Actors / actress are now fully grabbed (Ouch ! all your collection need to be reactualised, most of actors / actress weren't grabbed befor ....)
Many thx again vdrfan, really ! Here is the final code, pasted from the Excalibur.xml, in the svn build r18898
please refrain from filling up our forum db with those pastes. thx
please refrain from filling up our forum db with those pastes. thx
Sorry, never seen that in the rules or somewhere, I never supposed that disturbed you.
Next time, I'll attach a file.
Ok, new version.
Very little bug in the previous release with search results. here is the excalibur.xml :
http://xbmc.org/trac/raw-attachment/ticket/6155/Excalibur.xml
thanks. your xml is malformed though. opening in a browser before you publish is a good idea :)
thanks. your xml is malformed though. opening in a browser before you publish is a good idea :)
Sorry :S
Xml updated
Oops, seems as if a nasty space was added .. I will double check before committing to SVN later today.
are we in trouble again with excalibur?
a while ago i found "Raquel Darrian - Best Ass On The Planet" (btw: it IS the best ass on planet earth hehe) while searching for another title with best - today when scraping for "best ass on the planet" or "raquel darrian" i get no results back from excalibur.
edit: forgot to mention: version used is SVN 20720 win.
Working fine here using the correct "actress" name ;)
"Racquel Darrian" (ok)
"Best Ass On The Planet" (ok)
"Racquel Darrian Best Ass On The Planet" (ok)
While "Racquel Darrian - Best Ass On The Planet" will result in some other flicks from/with her due to the Excaliburfilms.com search engine not dealing with the "-".
Ghettorab
2009-07-06, 03:58
This version doesn't seem to work with the latest version of Plex. It used to work great with previous versions of Plex. It will not find any titles.
You might want to ask in the Plex forum for Plex related support? Scraper works fine in XBMC.