| By Yakov Fain | Article Rating: |
|
| January 26, 2006 12:00 AM EST | Reads: |
85,352 |
To read local file streams, a program has to specify the file's location, i.e. "c:\practice\training.html". The same procedure is valid for reading of the remote files: just open the stream over the network. Java has a class URL that will help you to connect to a remote computer on the Internet.
At first, create an instance of the class URL:
try{
URL xyz = new URL("http://www.xyz.com:80/training.html");
}
catch(MalformedURLException e){
e.printStackTrace();
}
The MalformedURLException could be thrown if a non-valid URL has been used, for example missed protocol if you forgot to start URL with http://, extra spaces, etc. The MalformedURLException does not indicate that the remote machine has problems - just check the spelling of the URL.
Creation of the URL object does not establish the connection with the remote machine: you'll still need to open a stream to read it. Usually you have to perform the following steps to read a file from the Internet:
Step 1. Create and instance of the class URL
Step 2. Create an instance of the class URLConnection and open a connection using the URL instance from step 1.
Step 3. Get a reference to an input stream of this object by calling the method URLConnection.getInputStream()
Step 4. Read the data from the stream (use the buffered reader to speed up the process).
Since the streams from the package java.io are being used here for the read/write operations, you'll have to handle I/O exceptions.
The server you are trying to connect to has to be up and running and, in case of using http protocol, the special software (Web Server) has to be "listening to" the port that you specified in the URL instance. By default, Web servers are listening to the port number 80.
The program below reads and prints on the system console the content of the file index.html from yahoo.com. Obviously, to test this program your computer has to be connected to the Internet.
import java.net.*;
import java.io.*;
public class WebSiteReader {
public static void main(String args[]){
String nextLine;
URL url = null;
URLConnection urlConn = null;
InputStreamReader inStream = null;
BufferedReader buff = null;
try{
// Create the URL obect that points
// at the default file index.html
url = new URL("http://www.yahoo.com" );
urlConn = url.openConnection();
inStream = new InputStreamReader(
urlConn.getInputStream());
buff= new BufferedReader(inStream);
// Read and print the lines from index.html
while (true){
nextLine =buff.readLine();
if (nextLine !=null){
System.out.println(nextLine);
}
else{
break;
}
}
} catch(MalformedURLException e){
System.out.println("Please check the URL:" +
e.toString() );
} catch(IOException e1){
System.out.println("Can't read from the Internet: "+
e1.toString() );
}
}
}
The class WebSiteReader explicitly creates the URLConnection object. Strictly speaking we could get away just with the class URL:
URL url = new URL("http://www.yahoo.com");
InputStream in = url.getInputStream();
Buff= new BufferedReader(new InputStreamReader(in));
The reason why you may consider using the URLConnection class is that it could give you some additional control over the I/O process. For example, by calling its method setDoInput(true) you could allow (or disallow) downloads.
Connecting Through HTTP Proxy Servers
Most of the companies use firewalls for security reasons and their employees reach the Internet through the HTTP proxy server. Check the settings of your Internet browser to find out the host name and port number of the proxy server. If you are using Microsoft Internet Explorer check the menu Internet Options | Connections | LAN Setting. Netscape Navigator has proxy settings under Preferences | Advanced | Proxies.
While Java Applets know parameters of the proxy servers because they live inside the browser that has the proper settings, Java applications should set the parameters of the proxy server, for example:
System.setProperty("http.proxyHost","xyz.com");
System.setProperty("http.proxyPort", 8080);
If you do not want to hardcode these value, pass them to the program from the command line:
c:\practice>java -Dhttp.proxyHost=xyz.com -Dhttp.proxyPort=8080 WebSiteReader
If you run this program without specifying the proxy settings, you'll get the UnknownHostException.
How to Download Files From the Internet
By using the class URL with input streams, we should be able to download practically any file (images, music, binary files) from the unsecured Internet site. The trick is in proper opening of the file stream. Let‚s write the class FileDownload that takes a URL and the file name as a command line arguments and copies remote file on your local disk.
import java.io.DataOutputStream;
import java.io.FileOutputStream;
import java.io.DataInputStream;
import java.net.URL;
import java.net.URLConnection;
class FileDownloader{
public static void main(String args[]){
if (args.length!=2){
System.out.println(
"Proper Usage: java FileDownloader RemoteFileURL LocalFileName");
System.exit(0);
}
DataInputStream in=null;
DataOutputStream out=null;
FileOutputStream fOut=null;
try{
URL remoteFile=new URL(args[0]);
URLConnection fileStream=remoteFile.openConnection();
// Open the input streams for the remote file
fOut=new FileOutputStream(args[1]);
// Open the output streams for saving this file on disk
out=new DataOutputStream(fOut);
in=new DataInputStream(fileStream.getInputStream());
// Read the remote on save save the file
int data;
while((data=in.read())!=-1){
fOut.write(data);
}
System.out.println("Download of " + args[0] + " is complete." );
} catch (Exception e){
e.printStackTrace();
} finally {
try{
in.close();
fOut.flush();
fOut.close();
} catch(Exception e){e.printStackTrace();}
}
}
}
To download the Yahoo's main page into c:\temp directory start this program as follows:
java FileDownloader http://www.yahoo.com/index.html c:\\temp\\yahoo.html
The Stock Quote Program
In this section we'll write the program that can read stock market quotes from the Internet. There are many Internet sites providing stock market quotes, and 20 minutes delayed quotes are free. Wall Street companies subscribe for the real-time market data feed. One of the popular Internet sites is Yahoo and the URL for getting stock prices is http://finance.yahoo.com Point your Web browser to this site and get the price quote of any stock symbol. Note the URL of the resulting Web page in your browser. For example, if you've selected the symbol IBM, the URL would look like this:
http://finance.yahoo.com/q?s=IBM
Right click on this page and select the View Source from the popup menu to see the HTML contents of this page: you'll see lots of HTML tags and the information about the IBM's trading will be buried somewhere deep inside the file:
... Last Trade:</TD><TD class=yfnc_tabledata1><BIG><B>95.32</B> ...
The next step is to modify the URL in our class WebSiteReader to print the content of the page about the symbol IBM:
url = new URL("http://finance.yahoo.com/q?s=IBM");
You can also store the whole page in a Java String variable instead of printing it. Just change the while loop to look as follows:
String theWholePage;
while (txt =buff.readLine() != null ){
theWholePage=theWholePage + txt;
}
If you add some smart tokenizing of theWholePage to get rid of all HTML tags and everything but Last Trade value, you can create your own little GUI Stock Quote screen. While this approach is useful to sharpen you tokenizing skills, it may not be the best solution, especially if Yahoo will change the wording of this page. That's why we'll be using another Yahoo's URL that provide stock quotes in a cleaner comma separated values format (CSV).
Here's the URL that should be used for the IBM's symbol:
http://quote.yahoo.com/d/quotes.csv?s=IBM&f=sl1d1t1c1ohgv&e=.csvThis URL would produce a string that looks something like this (the price quotes are not real):
"IBM",95.32,"1/16/2004","5:01pm",+1.30,95.00,95.35,94.71,9305000The next class StockQuoter prints the price quote for the symbol that is specified as a command line argument.
import java.net.*;
import java.io.*;
import java.util.StringTokenizer;
public class StockQuoter {
String csvString;
URL url = null;
URLConnection urlConn = null;
InputStreamReader inStream = null;
BufferedReader buff = null;
StockQuoter(String symbol){
try{
url = new
URL("http://quote.yahoo.com/d/quotes.csv?s="
+ symbol + "&f=sl1d1t1c1ohgv&e=.csv" );
urlConn = url.openConnection();
inStream = new
InputStreamReader(urlConn.getInputStream());
BufferedReader buff= new BufferedReader(inStream);
// get the quote as a csv string
csvString =buff.readLine();
// parse the csv string
StringTokenizer tokenizer = new
StringTokenizer(csvString, ",");
String ticker = tokenizer.nextToken();
String price = tokenizer.nextToken();
String tradeDate = tokenizer.nextToken();
String tradeTime = tokenizer.nextToken();
System.out.println("Symbol: " + ticker +
" Price: " + price + " Date: " + tradeDate
+ " Time: " + tradeTime);
} catch(MalformedURLException e){
System.out.println("Please check the spelling of the URL:"
+ e.toString() );
} catch(IOException e1){
System.out.println("Can't read from the Internet: " +
e1.toString() );
}
finally{
try{
inStream.close();
buff.close();
}catch(Exception e){
e.printStackTrace();
}
}
}
public static void main(String args[]){
if (args.length==0){
System.out.println(
"Sample Usage: java StockQuoter IBM");
System.exit(0);
}
StockQuoter sq = new StockQuoter(args[0]);
}
}
To see the latest price of IBM stock start the StockQuoter program as follows:
c:\practice>java StockQuoter IBM
In this lesson I tried to show you that working with the streams over the net may be as simple as dealing with files on your local disk. These days Java runs remote-controlled Mars rovers, and this is not a rocket science anymore - just open a Java stream that points at Mars.
Published January 26, 2006 Reads 85,352
Copyright © 2006 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
- Your First Java Program
- Intro to Object-Oriented Programming with Java
- Methods, Constructors, Overloading and Access Levels
- Java Exceptions
- Java Streams Basics
- Java Serialization - Lesson 7
- Java Serialization
- Teaching Kids Programming: Even Younger Kids Can Learn Java
- Taking a Sun Java Studio Creator for a Drive
- Java Basics: Introduction to Java Threads, Part 1
- Java Gotchas: Instance Variables Hiding
- Java Basics: Introduction to Java Threads, Part 2
- SYS-CON Webcast: Eclipse IDE for Students, Useful Eclipse Tips & Tricks
- Java Basics: Lesson 11, Java Packages and Imports (Live Video Education)
More Stories By Yakov Fain
Yakov Fain is a Managing Director of Farata Systems, consulting, training and product company. He has authored several Java books, dozens of technical articles. SYS-CON Books released his latest co-authored book , Rich Internet Applications with Adobe Flex and Java: Secrets of the Masters in Spring 2007. Sun Microsystems has nominated and awarded Yakov with the title Java Champion. He leads the Princeton Java Users Group. He is an Adobe Certified Flex Instructor. Currently Yakov works on the book for O'Reilly "Enterprise Application Development with Flex". He twits at twitter.com/yfain.
![]() |
Dave Mason 10/18/07 03:00:39 AM EDT | |||
Hello Yakov, |
||||
![]() |
nitin 11/27/04 02:34:32 AM EST | |||
hi, |
||||
![]() |
MIchael Behrens 02/08/04 08:17:43 PM EST | |||
Great Article! Keep''m coming! |
||||
![]() |
Ferruccio Spagna 01/31/04 10:13:33 AM EST | |||
I am very happy with your code, Yakov. |
||||
![]() |
Yakov Fain 01/31/04 08:59:53 AM EST | |||
In this lesson I do this using HTTP GET request by attaching parameters to the URL after a question mark separated with an ampersand. Fo HTTP POST, after getting the URLConnection stream, do something like this: urlConn.setRequestProperty("Content-Type", "application/x-www-form-urlencoded"); String myData = "myParam1=" + URLEncoder.encode ("abc") + DataOutputStream encodedParams = new DataOutputStream (urlConn.getOutputStream ()); encodedParams.writeBytes (myData); |
||||
![]() |
Ferruccio Spagna 01/31/04 08:29:47 AM EST | |||
Ok. What you say occurs when you have a page with a form. Well known. But my question was: you want to read the stream (see your last article) and perhaps manipulate it. You then use the code: |
||||
![]() |
Yakov Fain 01/31/04 08:16:45 AM EST | |||
Well, here''s the sample from my book "The Java Tutorial for the Real World". 1. An HTML form has 2 fields to perform a book search: Find a book Enter a word from the book title: 2. The servlet FindBooks runs on the server side, gets the parameters and sends back to the browser a page that reads that this book cost $65: public class FindBooks extends javax.servlet.http.HttpServlet { public void doGet(HttpServletRequest req, out.println(""); out.println("the book "+title+" costs only $65"); out.println("Please enter your credit card number"); out.println(""); |
||||
![]() |
Ferruccio Spagna 01/31/04 07:33:44 AM EST | |||
Thank you for your answer. I have several good Java books, but I didn''t find how I can read a CGI URL stream passing POST parameters in the calling instruction anywhere. Tell me please in a few words the way, if it is possible. |
||||
![]() |
Yakov Fain 01/30/04 07:38:59 PM EST | |||
Ferruccio, Java Servlets and JavaServer Pages technologies deal with HTTP Post and Get requests, parameters, etc. Wait for my lesson on servlets, or get a good book if you need to know the answer now. |
||||
![]() |
Ferruccio Spagna 01/30/04 06:10:50 PM EST | |||
I add to my above question: how can I pass the parameter values calling the URL? |
||||
![]() |
Ferruccio Spagna 01/30/04 05:59:24 PM EST | |||
What about reading a CGI URL accepting METHOD=POST parameters? |
||||
![]() |
Yakov Fain 01/29/04 11:27:25 PM EST | |||
Dennis, You can run on the remote machine one of the following: RMI server, SocketServer, a Servlet,FTP server... that has to create an instance of the class java.io.File that points at your directory. The File.list() will return the list of files in this directory as a String array. Now create instances of File for each of the array elements and call File.lastModify() to check the timestamp. After finding the file with the proper date, send its URL (or a stream reference) to the client for reading. |
||||
![]() |
Dennis Christopher 01/29/04 01:44:43 PM EST | |||
I found the article useful, and above average in clarity. I am wondering if anyone knows how you set up the |
||||
![]() |
Josh Davis 01/28/04 12:49:02 PM EST | |||
Yakov, You should mention that HttpURLConnection may ''hang'' when contacting servers that don''t behave correctly. This can cause a lot of problems in a server side application. The solutions are: |
||||
![]() |
John Pantone 01/26/04 01:55:42 PM EST | |||
Very useful, clear. |
||||
![]() |
Yakov Fain 01/23/04 09:49:54 PM EST | |||
If the proxy requires authentication, set the following properties: System.setProperty("http.proxyUser", "JLarkin"); You may also try using the class java.net.Authenticator |
||||
![]() |
John Larkin 01/23/04 11:28:08 AM EST | |||
Our proxy server requires a user_id and password. |
||||
![]() |
Selvan Rajan 01/23/04 10:10:41 AM EST | |||
What would be the case to deal with cookies from some of the web sites? Especially it becomes cumbersome, if there is a redirection involved after setting the cookie. |
||||
![]() |
Debashish 01/23/04 02:07:51 AM EST | |||
In the following para : I think the second line of code should be : |
||||
- The Case for Single-Purpose Services
- Oracle-Sun: IBM Reportedly Behind Delay
- Current Trends in the Data Management Market
- Cloud BI & Amazon VPC
- IBM Puts Systems Chief on Leave of Absence
- GITEX TECHNOLOGY WEEK 2009 Exhibitor Profiles
- The Curious Case of Build Release Management eBook
- Cloud-Oriented Switch Start-up Valued at $230M
- Tips for Efficient PaaS Application Design
- Reporting Solutions Using Crystal Reports for Eclipse
- Java Kicks Ruby on Rails in the Butt
- Interviewing Java Developers With Tears in My Eyes
- The Case for Single-Purpose Services
- Oracle-Sun: IBM Reportedly Behind Delay
- Open Source Selenium Web Application Testing System
- Configuring the PHP Version of Eclipse 3.5 RC4
- Current Trends in the Data Management Market
- What Could You Do with Your Code in 20 Lines or Less?
- JSR Watch: Here’s to Progress
- Free Software Foundation Waves Developers Off Mono & C#
- Java Developer's Journal Exclusive: 2006 "JDJ Editors' Choice" Awards
- The i-Technology Right Stuff
- Creating Web Applications with the Eclipse Web Tools Project
- Eclipse Special: Remote Debugging Tomcat & JBoss Apps with Eclipse
- The Next Programming Models, RIAs and Composite Applications
- Where Are RIA Technologies Headed in 2008?
- How to Bring Eclipse 3.1, J2SE 5.0, and Tomcat 5.0 Together
- SYS-CON Webcast: Eclipse IDE for Students, Useful Eclipse Tips & Tricks
- Eclipse: The Story of Web Tools Platform 0.7
- "Eclipse 3.0 is a Great Leap Forward," Says JDJ's Dudney



































