Regular Expressions

This is a discussion on Regular Expressions within the RUBY forums in Programming Languages category; I have a image url like this <img src ="http://www.ingolfwetrust.com/golf-central/content/binary/Davis-Love-III.jpg"> From this url i need to extract only the picture name using regular Expressions. That is I Want to Extract like this => Davis-Love-III.jpg. Any Advices -- Posted via http://www.ruby-forum.com/ ....

Go Back   Application Development Forum > Programming Languages > RUBY

Object Mix

Register FAQ Calendar Search Today's Posts Mark Forums Read
  #1  
Old 08-25-2008, 06:24 AM
Newb Newb
Guest
 
Default Regular Expressions

I have a image url like this <img src
="http://www.ingolfwetrust.com/golf-central/content/binary/Davis-Love-III.jpg">

From this url i need to extract only the picture name using regular
Expressions.
That is I Want to Extract like this => Davis-Love-III.jpg.
Any Advices
--
Posted via http://www.ruby-forum.com/.

Reply With Quote
  #2  
Old 08-25-2008, 06:46 AM
James Coglan
Guest
 
Default Re: Regular Expressions

[Note: parts of this message were removed to make it a legal post.]

2008/8/25 Newb Newb <hema@angleritech.com>

> I have a image url like this <img src
> ="
> http://www.ingolfwetrust.com/golf-ce...s-Love-III.jpg
> ">
>
> From this url i need to extract only the picture name using regular
> Expressions.
> That is I Want to Extract like this => Davis-Love-III.jpg.
> Any Advices




# url = "
http://www.ingolfwetrust.com/golf-central/content/binary/Davis-Love-III.jpg"
filename = url.scan(/[^\/]+/).last

This looks for all sections of the string that do not contain a slash and
picks the last one.

Reply With Quote
  #3  
Old 08-25-2008, 07:21 AM
Newb Newb
Guest
 
Default Re: Regular Expressions

>
> # url = "
> http://www.ingolfwetrust.com/golf-central/content/binary/Davis-Love-III.jpg"
> filename = url.scan(/[^\/]+/).last
>
> This looks for all sections of the string that do not contain a slash
> and
> picks the last one.



Thanks for the reply..it works but my url is like this

<img src
="http://www.ingolfwetrust.com/golf-central/content/binary/Davis-Love-III.jpg">
- <img src
="http://www.ingolfwetrust.com/golf-central/aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf">

so it gives me only this

aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf">


Pls Help me on this


--
Posted via http://www.ruby-forum.com/.

Reply With Quote
  #4  
Old 08-25-2008, 07:44 AM
Lex Williams
Guest
 
Default Re: Regular Expressions

> Thanks for the reply..it works but my url is like this
>
> <img src
> ="http://www.ingolfwetrust.com/golf-central/content/binary/Davis-Love-III.jpg">
> - <img src
> ="http://www.ingolfwetrust.com/golf-central/aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf">
>
> so it gives me only this
>
> aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf">
>
>
> Pls Help me on this


What do you want to do with
aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf ? Are you using it ?
If not , for the rest of the pictures , you could be calling
File.basename on them and get the last part only . For example , on
http://www.ingolfwetrust.com/golf-ce...s-Love-III.jpg
you will receive Davis-Love-III.jpg .
--
Posted via http://www.ruby-forum.com/.

Reply With Quote
  #5  
Old 08-25-2008, 08:13 AM
Newb Newb
Guest
 
Default Re: Regular Expressions

> What do you want to do with
> aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf ? Are you using it ?
> If not , for the rest of the pictures , you could be calling
> File.basename on them and get the last part only . For example , on
> http://www.ingolfwetrust.com/golf-ce...s-Love-III.jpg
> you will receive Davis-Love-III.jpg .



sorry yar ..Nothing Work On That.I get nil value

Any ways i used this expression to extract the imgage url =>
(/<img.*?>/)
It has given me image url.
But I want to get all the url's based on file extensions like jpg and
png..
That Is... if it contains image file extension,Only those image urls
has to fetched


Help Me
--
Posted via http://www.ruby-forum.com/.

Reply With Quote
  #6  
Old 08-25-2008, 08:36 AM
Lee Jarvis
Guest
 
Default Re: Regular Expressions

#!/usr/bin/ruby

require 'rubygems'
require 'hpricot'

ext = %w( .jpg .png .gif ... )
Hpricot(DATA.read).search("img") do |img|
name = File.basename(img.attributes['src'])
puts name if ext.include?(File.extname(name))
end

__END__
[~]$ blah
<img src="http://host.com/foo.jpg" />
<img src="http://ruby-lang.org/something/path/to/image.jpg" />
<img src="http://google.com/PNgdvsbajkDbnm976" />


#=>
foo.jpg
image.jpg
Reply With Quote
  #7  
Old 08-25-2008, 08:38 AM
Lee Jarvis
Guest
 
Default Re: Regular Expressions

#!/usr/bin/ruby

require 'rubygems'
require 'hpricot'

ext = %w( .jpg .png .gif ... )
Hpricot(DATA.read).search("img") do |img|
name = File.basename(img.attributes['src'])
puts name if ext.include?(File.extname(name))
end

__END__
[~]$ blah
<img src="http://host.com/foo.jpg" />
<img src="http://ruby-lang.org/something/path/to/image.jpg" />
<img src="http://google.com/PNgdvsbajkDbnm976" />
Reply With Quote
  #8  
Old 08-25-2008, 09:13 AM
Jesús Gabriel y Galán
Guest
 
Default Re: Regular Expressions

On Mon, Aug 25, 2008 at 2:13 PM, Newb Newb <hema@angleritech.com> wrote:
>> What do you want to do with
>> aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf ? Are you using it ?
>> If not , for the rest of the pictures , you could be calling
>> File.basename on them and get the last part only . For example , on
>> http://www.ingolfwetrust.com/golf-ce...s-Love-III.jpg
>> you will receive Davis-Love-III.jpg .

>
>
> sorry yar ..Nothing Work On That.I get nil value
>
> Any ways i used this expression to extract the imgage url =>
> (/<img.*?>/)
> It has given me image url.
> But I want to get all the url's based on file extensions like jpg and
> png..
> That Is... if it contains image file extension,Only those image urls
> has to fetched


Another way to extract the URL and get only the path:

require 'uri'

tag = '<img src="http://www.ingolfwetrust.com/golf-central/aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf">'
File.basename(URI.split(URI.extract(b)[0])[5])
=> "aggbug.ashx"

Then you can check with File.extname to check for the extensions you
want, as others have shown.

Hope this gives you other ideas,

Jesus.

Reply With Quote
  #9  
Old 08-27-2008, 07:38 AM
Newb Newb
Guest
 
Default Re: Regular Expressions

Hi all ..
I redefine my Question now..i want to get all the image url which has
jpg .png file extensions using regular expressions ..
Any Advices...
--
Posted via http://www.ruby-forum.com/.

Reply With Quote
  #10  
Old 08-27-2008, 09:02 AM
Damjan Rems
Guest
 
Default Re: Regular Expressions

Newb Newb wrote:
>>
>> # url = "
>> http://www.ingolfwetrust.com/golf-central/content/binary/Davis-Love-III.jpg"
>> filename = url.scan(/[^\/]+/).last
>>
>> This looks for all sections of the string that do not contain a slash
>> and
>> picks the last one.

>
>
> Thanks for the reply..it works but my url is like this
>
> <img src
> ="http://www.ingolfwetrust.com/golf-central/content/binary/Davis-Love-III.jpg">
> - <img src
> ="http://www.ingolfwetrust.com/golf-central/aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf">
>
> so it gives me only this
>
> aggbug.ashx?id=56f9e33a-6333-4daf-b4e1-137ddc051adf">
>
>
> Pls Help me on this


Althow it is not the most genius one ;-)

url.split("/").last.gsub(/"|>/,'')


by
TheR
--
Posted via http://www.ruby-forum.com/.

Reply With Quote
Reply


Thread Tools
Display Modes


All times are GMT -5. The time now is 08:37 PM.


Powered by vBulletin® Version 3.7.2
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.2.0
vB Ad Management by =RedTyger=

In an effort to better serve ads to our visitors, cookies are used on objectmix.com. For more information, check out our Privacy Policy.