Cross-domain requests with jQuery

12 Jan 2010

Chris Heilmann recently posted on how to use YQL to make cross-domain requests, which would usually be prohibited due to the same-domain-policy. I already knew about YQL, but I had no idea that it allowed retrieval of HTML from other sites, via JSON, returned as a single string!

Instead of asking for JSON format, ask for XML, but also add a callback parameter to your query. Voila!

So, in short, YQL allows us to make cross-domain GET requests!

Chris also posted a demo!

With a bit of hacking, we can make jQuery work with YQL for all cross-domain GET requests. UPDATE: I’ve decided to put this in my “jQuery Plugins” repo at Github:

Cross-Domain Ajax mod @ Github

With this mod, any GET request made via jQuery.ajax to another domain will work!

$('#container').load('http://google.com'); // SERIOUSLY!
 
$.ajax({
    url: 'http://news.bbc.co.uk',
    type: 'GET',
    success: function(res) {
        var headline = $(res.responseText).find('a.tsh').text();
        alert(headline);
    }
});
 
// Works with $.get too!

Have fun!

Thanks for reading! Please share your thoughts with me on Twitter. Have a great day!

So far there's been 39 Responses to
“Cross-domain requests with jQuery”

Phunky January 12th, 2010 at 4:02 pm

The ability to scrape HTML from external services (and have it cached by Yahoo!) was one of the main things that excited me when I was at last years London Hackday.

I’ve been meaning to abuse… erm I mean test it for a while but just never got round to it.
Francois Botha January 12th, 2010 at 5:00 pm

And to make POST requests, see http://www.wait-till-i.com/2009/11/16/using-yql-to-read-html-from-a-document-that-requires-post-data/
Cedric Dugas January 12th, 2010 at 5:41 pm

Nice idea, really like it

Christopher January 12th, 2010 at 5:52 pm

Great idea!
YQL even supports SSL, so I would make the yql url dependent on the website’s protocol to avoid mixed content warnings in IE:

    var protocol = location.protocol,
        hostname = location.hostname,
        YQL = protocol + '//query.yahooapis.com/v1/public/yql?callback=?',
        query = 'select * from html where url="{URL}" and xpath="*"';

Besides that, you forgot an semicolon after the “var” declarations. 🙂

James January 12th, 2010 at 6:41 pm

@Francois, looking into that now 🙂 Thanks for the link!

@Christopher, thanks, fixed it!
Vasili January 13th, 2010 at 12:11 am

I use this for a little online recipe bookmarking script I made. It saves me so much time from having to copy and paste the title, URL, and image source. Now I just have to copy the URL, paste it into the URL field and the other two are filled. 😀
malsup January 13th, 2010 at 3:03 am

cool stuff.
Mathias Bynens January 13th, 2010 at 9:31 am

Hmmm… Seems like someone got inspired by your script, and added the same functionality to MooTools: http://mootools.net/shell/aUgSz/
Martin Kirk January 13th, 2010 at 1:25 pm

Yay….

Now when i visit homepages, that look EXACTLY like my bank, and even lets me log ind… will rob all my money…

and furthermore… Steal my Facebook, Gmail, and MSDN — just because i can’t tell the difference between a real login that works, and a fake login that works…

There are good reasons why browsers by default doesn’t support cross site Javascript post/get’s

Gaaahhh
Martin Kirk January 13th, 2010 at 1:27 pm

… Not to mention Content origin…

with Cross-Site JS… you may “steal” content without even using the server (which is done the old way)

By letting Clients grab content – you’ll expose the risk that content providers get hacked/grabbed by zillions of clients without knowing it…
James January 13th, 2010 at 1:30 pm

@Martin, wow, do you even understand how this works? It’s using JSONP to retrieve arbitrary HTML… And the HTML retrieved is that which is seen by the YQL proxy — not by you. This isn’t breaking the same-domain-policy or even trying to. There’s no way that an attacker can harness this in that way, – it’d probably be better for them just to go to the bank’s website and copy the source.
Martin Kirk January 13th, 2010 at 1:39 pm

@James

Yes, i perfectly understand whats going on – Heck i’ve grabbed HTML content from other sites myself – but always using a server-proxy:
client -> server -> X-Site -> sever -> client

the way i read your post you are doing:
client -> X-Site -> client

which is exactly what I’m talking about, no ?
James January 13th, 2010 at 1:40 pm

Nope. client -> YQL Server -> X-Site -> YQL Server -> client

How is client -> X-Site -> client even possible??
Martin Kirk January 13th, 2010 at 1:47 pm

Ahh… that makes me feel more safe 😀

i read that you retrieved HTML… not just JSON’like-data from a trusted server…

Cross-site Requests are not allowed – and only possible with custom plugins (afaik.)
Regent January 14th, 2010 at 9:16 am

Can we expect this in the official assembly of jQuery? 🙂
Eric January 14th, 2010 at 10:28 pm

Umm, is it just me or does anyone else not like the idea of relying on Yahoo’s proxy? If yahoo is down for some reason you’re going to be SOL. Not that this is all that likely, but still, it’s an extra dependency. I mean… are we even sure Yahoo will be in business in a year? 🙂
Mathias Bynens January 14th, 2010 at 11:01 pm

@Eric: If you want to use your own proxy, go ahead. Good luck.
Chris Heilmann January 19th, 2010 at 10:46 am

@Eric comments like this will eventually make me stop caring to build solutions like that. We built YQL because we run our own services on it. We then offer it to the world to make it better and what we get is “I wonder if they’ll be around in a year”. Why I fight the good fight in the company I start to wonder when I get messages like these back. I am quite sure that Yahoo will be around in a year’s time – otherwise I wouldn’t spend that much of my effort in there. If the people who constantly claim that Yahoo is dead while blatantly praising everything else some other companies or random startups do will be I am not too sure about.
David January 20th, 2010 at 2:17 pm

Is it possible, with this plugin, to get the current location of an iFrame?
I mean, I need to get the actual location.href of an iframe, but i get permission denied with simple javascript…
Clayton Carney January 20th, 2010 at 11:44 pm

Perhaps I’m just dense, but I encountered an anomaly with this plugin. I built a simple html file to test the

$(‘#container’).load(‘http://google.com’); // SERIOUSLY!

example. I opened the file in the browser and nothing happened. After stepping through with FireBug, I noticed the request was using a FILE:// protocol to the Yahoo URL. I made the follow change to the plugin:

//YQL = protocol + ‘/query.yahooapis.com/v1/public/yql?callback=?’,
YQL = ‘http://query.yahooapis.com/v1/public/yql?callback=?’,

and it worked fine. Any thoughts on how to modify this plugin so it would work in this situation? I often perform initial development on local files, before moving to a server.
James January 22nd, 2010 at 7:06 pm

@Regent, doubtful. I’m actually quite worried about this getting too popular. The horde of jQuery beginners will think this works like same-domain XHR… which it absolutely doesn’t!

@David, nope. You can’t get the navigated-too location of an external-domain iframe. But you can get its first location… iframeElement.src.

@Clayton, ahh, yes, I added the protocol thing so it would work with https. I’ve updated it, so it should now work locally too. See the commit: http://github.com/jamespadolsey/jQuery-Plugins/commit/3db614a8e3a04f871bccbbe8f18442850ddf19bd
Regent January 22nd, 2010 at 7:24 pm

@James, Yes, you’re right, this is a good reason.
If the Yahoo server is not working, can not use another alternative to 100% result?
For example CssHttpRequest – http://nb.io/hacks/csshttprequest
Eli February 6th, 2010 at 2:26 am

Just curious, but how would you install this in the simplest terms?

Also, is this basically an iFrame functions but with javascript? but without restraining the loaded page to the dimensions of an iframe but use the entire page as if it was loaded locally and not remotely?
Ben Klaswer March 25th, 2010 at 8:34 pm

is there a way to send some request to an extrnal domain (opened in a new popup window), and then the external domain after the user clicks a button set some parameters back to the initial server? We have full acces to both domain to put js on it, is there a way using Jquery

Regards,

Ben Klaswer
ape April 8th, 2010 at 9:40 am

neat & works like a charm 🙂

Unfortunately I can’t get it to work in IE 8 when I change the YQL query to xml for xml responses (select * from xml), am I missing something?
FF parses perfectly, only MSIE does not. Json response is ok, though, but jQuery.find() nor filter does work 🙁
Kate April 27th, 2010 at 7:10 pm

This works great for me when I make the Ajax call immediately upon loading, but if I try to make the call after clicking a button, I get an error response. I’m trying to do this:

$(‘button’).click(function() {
$.ajax({
//AJAX code here
});
});

Clicking the button gives me an error, whereas it works fine immediately upon loading the page if I just have this:

$.ajax({
//AJAX code here
});

I must be missing something obvious…any thoughts?

Thanks!
Kate April 27th, 2010 at 7:29 pm

Disregard my above comment. I figured it out, and it was something unique to my site.

Thanks for the great plug-in! 🙂
Oggyb May 3rd, 2010 at 9:27 pm

Thank you, James. This plugin absolutely made my day!
Oggyb May 3rd, 2010 at 9:41 pm

Having said the above, I’m noticing an anomaly in the content returned from YQL.

I’m using the plugin to .load() content from public Google Calendar events, the url for which is a variable parsed by jQuery. Sometimes the content returned is in German, and sometimes it is (correctly) in English!

Is this the fault of YQL, something I did, something you did, or something Google did? I can’t work it out.

Thanks.
Swashata June 28th, 2010 at 8:27 am

Okay looks like modifying this portion of your plugin solves the error call!
if (_success && data.results[0] != undefined) { // Fake XHR callback. _success.call(this, { responseText: data.results[0] // YQL screws with s // Get rid of them .replace(/]+?/>|/gi, '') }, 'success'); } else { o.error.call(this, 'not received', 'data is null'); }
Please let me know if I am correct or not 🙂
kk August 12th, 2010 at 11:19 am

so what wolud be the best practice to pass xpath parametar , now you use xpath=”*” , but what if i want to filter something on page , for example xpath=’//div[@class=”someContents”]’ … and/or use limit and offset keywords ?
tsu August 17th, 2010 at 8:44 pm

Ermm, and this is so great because…? I’m routinely doing screen-scraping of OPS (other people’s sites) via PHP’s get-contents() function on my own server. This way I get the same code I see in Firebug, which is important to me for extraction. And I don’t have to rely on Yahoo.
Nikita Rybak August 18th, 2010 at 6:49 pm

Thanks for the hack!
One thing, since nobody seemed to note it before.

YQL will refuse to return content if webmaster has banned robots from his site (I tested query on yahoo site and response was very clear). In particular, I was trying to get information from google maps business pages (like http://www.google.com/maps/place?cid=17434047103649409317)
christoff August 19th, 2010 at 12:27 am

Perhaps someone can help me, i need to have jQuery render an xml feed in html cross domain from http://clinicaltrials.gov/search?term=%22lyme+disease%22&studyxml=true for instance.

tried the following but it did not work.

$(document).ready(function(){
$.ajax({
type: “GET”,
url: “http://clinicaltrials.gov/search?term=%22lyme+disease%22&studyxml=true”,
dataType: “xml”,
success: function(xml) {
$(xml).find(‘site’).each(function(){
var nct_id = $(this).attr(‘nct_id’);
var title = $(this).find(‘title’).text();
var url = $(this).find(‘url’).text();
var condition_summary = $(this).find(‘condition_summary’).text();
var condition_summary = $(this).find(‘condition_summary’).text();
$(”).html(‘‘+title+’‘).appendTo(‘#page-wrap’);
$(this).find(‘desc’).each(function(){
var brief = $(this).find(‘brief’).text();
var long = $(this).find(‘long’).text();
$(”).html(brief).appendTo(‘#link_’+id);
$(”).html(long).appendTo(‘#link_’+id);
});
});
}
});
});
Martin Kirk August 20th, 2010 at 1:08 pm

#christoff

you need a serverside-proxy…

James September 1st, 2010 at 12:34 pm

Hey I’m getting a strange error:

Uncaught ReferenceError: jsonp1283340405175 is not defined

Any ideas? Seems like the JSON is not being parsed correctly?

Here’s my ajax request;

$.ajax({
    url: "http://userscripts.org/scripts/show/81657",
    type: "GET",
    success: function(res) {
        var ver = $(res.responseText).find('#summary').text().match(/[v([0-9.]+)]/);
        console.info(ver, "versus", version);
    }
});

Marius September 9th, 2010 at 3:18 pm

data.results[0] is undefined, in firebug console.
– any clue what am I doing wrong ?
intsam September 17th, 2010 at 8:20 pm

Great solution for the cross domain issue. But it didn’t work for some URLs. This URL that I’m trying to load contains most of the dynamic content instead of static. Don’t know if it’s the issue. And the other problem is this didn’t work for me in IE. Worked perfectly in Firefox. Is YQL is browser dependent?
Sebastian September 18th, 2010 at 1:32 am

@James
When I change google.com to facebook.com in the jQuery.ajax function you give in your examples and tests I get an error in my console, that is data.results[0] is undefined. Can someone please give me some advice with this? Here is the code snippet:

$.ajax({ type: 'GET', url: 'http://www.facebook.com', success: function(html){ process(html); }, error: function(){ debug("ajax error"); } }); {/code}

Cross-domain requests with jQuery

So far there's been 39 Responses to “Cross-domain requests with jQuery”

So far there's been 39 Responses to
“Cross-domain requests with jQuery”