[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: HttpApi Question

From: Scott Klement <sk@xxxxxxxxxxxxxxxx>
To: HTTPAPI and FTPAPI Projects <ftpapi@xxxxxxxxxxxxxxxxxxxxxx>
Subject: Re: HttpApi Question
Date: Thu, 10 Oct 2013 02:18:57 -0500

Mike,

HTTP contains protocol data, the meta-information that makes HTTP work,and then it also has the data that's being transferred, whcih I shallrefer to as the 'payload' of the transaction. Since the payload is 8-bitbinary data, it can contain any conceivable sequence of bytes...therefore there needs to be some way for HTTP to communicate when the'payload' has finished.... It can't tell that based on the contents ofthe data, because the data can literally contain anything.

There are 3 different ways that HTTP can communicate when the transferof the 'payload' is complete:

1) The most common method is to send a content-length in themeta-information. Then the receiver knows exactly how many bytes toexpect. It knows the file is complete when it has received that many bytes.

This is most common when downloading a file that's on disk... in thiscase the server knows at the outset exactly how big the file is (inbytes) and can just send the length. But, imagine a situation wheredata is being generated by an application... in this case, the servermay not know the total size of the document at the outset. Theapplication may be sending megabytes of data, to the server, and theserver won't want to buffer the whole thing just to calculate the length-- so it receives one buffer of data at a time from the application,then writes it to the client, then receives the next buffer, etc, untilit reaches the end. In this case it won't know until the end what thetotal length is... so content-length cannot be used.

2) It can use "chunked" transfer-coding. In this case, the HTTP serverwill receive one buffer full from the application, and immediately sendit on to the client. Each buffer full is called a "chunk". HTTP willsend the length of the chunk, followed by the data, then will send thelength of the next chunk, followed by more data, etc. This repeatsuntil the server has no more data to send, in which case it sends alength of 0 to tell the client that the transfer is complete. This way,the server does not need to know the total payload size at the outset,it only needs to know how much it's sending in each chunk...

3) The original way of sending data (which is not recommended as ofHTTP/1.1, but is still supported for backward compatibility with olderHTTP specs) is to indicate the end of the data by disconnecting. Inthis method, the server sends "connection: Close" in the meta-data totell the client to expect that it will be disconnected when the transferis compelte. then, it just streams data until it has no more, andfinally disconnects. The server doesn't need to know the total lengthat the outset, because it does not send a length. When the applicationstops sending it data, it sends it on to the client and then disconnectsto indicate that it's done.

This 3rd method was the original way of doing it in old HTTP specs, butis not recommended because the client cannot detect whether theconnection was dropped due to a communications error, or whether it wasdropped due to the end of the payload being reached. Although thismethod is deprecated, it is still allowed for backward compatibilty inthe HTTP specs.

Anyway, it would seem that the server that Nancy is talking to is usingthis 3rd method. In that case, the "Connection broken" is completelynormal and expected -- it's how the eerver indicates the end of the data.

But, if THAT is the case, then Nancy would not be getting a return valueof -1 from http_url_post -- she'd be getting a 1 = success. She shouldnot be getting an error code from httpapi in that case.

She said that she's getting success immediately followed by an error --and I don't understand what she means by that. If she's actuallygetting an error in the return code from httpapi, then something isstill wrong, and we need to dig deeper.

But, if she's getting a successful response from http_url_post, but ismerely seeing the "connection broken" message in the debug/trace log,then there's no problem at all... everything is working correctly. Inthis case, she is incorrectly interpreting a normal message in the debuglog as an error.... she shouldn't be using the debug log to determineif an error occurred. She should be using the return code fromhttp_url_post.

But, I don't know which of these situations (or potentially, acompletely different one that I haven't thought of) is occurring. Thisis why I need clarification on what she means by "successful responsefollowed immediately by an error".



On 10/9/2013 4:15 PM, Mike Krebs wrote:

One thing I don't see is that your program is getting a length of the data the server is sending. Someone with protocol knowledge would have to answer if that is acceptable but it would mean that recvdoc would have to continue until it received "EOF" (which it will do - right Scott?). But it would not know if it received the entire document because it wouldn't know how long the document is. I wonder if that has something to do with the connection broken problem?


-----------------------------------------------------------------------
This is the FTPAPI mailing list.  To unsubscribe, please go to:
http://www.scottklement.com/mailman/listinfo/ftpapi
-----------------------------------------------------------------------

Follow-Ups:
- Re: HttpApi Question
  - From: naschueller

References:
- Re: HttpApi Question
  - From: naschueller
- RE: HttpApi Question
  - From: Mike Krebs

Prev by Date: RE: HttpApi Question
Next by Date: Re: HttpApi Question
Previous by thread: RE: HttpApi Question
Next by thread: Re: HttpApi Question
Index(es):
- Date
- Thread