@Leopold_Deux wrote:
Hi, I'm trying to use manipulate to extract the actual url from a google alerts url in an rss feed. Right now a url shows like this:
https://www.google.com/url?rct=j&sa=t&url=https://www.portada-online.com/2019/08/06/womens-soccer-keeps-getting-marketers-attention-marketing-soccer-marketing-news/&ct=ga&cd=CAIyGmZiZGYyZmRhNGExYjViMjc6Y29tOmVuOlVT&usg=AFQjCNG96Vp9TqDf9KFbY4jbLyx3i3XemA
whereas I just want the actual url without the google parts at the beginning & end of it:
https://www.portada-online.com/2019/08/06/womens-soccer-keeps-getting-marketers-attention-marketing-soccer-marketing-news/
i'm at a loss with regexp, but something like this i think:
(Indentations not preserved)
MY_FEED:
rss: https://www.google.com/alerts/feeds/blahblahblah
accept_all: yes
manipulate:
- url:
extract: ????????anyone can help out with the regexp here?
edit: I see it might actually use the urlrewrite plugin but I can't figure it out. it lists google in the names of sites supported but I'm not sure if that pertains to my question exactly and i can't figure out how to write it correctly to not throw errors
Posts: 2
Participants: 1