ebxml-transport message

Subject: To do: Re:RE: Regarding the Thursday ebXML Conf Call
From: MKudela@uc-council.org
To: dick@8760.com, drummond@onramp.net, ebXML-Transport@lists.oasis-open.org, kit@mitre.org
Date: Fri, 3 Mar 2000 08:35:18 -0500




Subject
        To do: Re:RE: Regarding the Thursday ebXML Conf Call



Starts:

                         Priority:

Due:

                         Type:
                                    Group To Do


                         |-------------|
                         | [ ] Repeats |
                         |-------------|
Owner:
        Melanie Kudela/Princeton/UCC
Assignments already sent

                                                                        To:
               dick@8760.com, drummond@onramp.net,
               ebXML-Transport@lists.oasis-open.org, kit@mitre.org
                                                                        cc:
               rawlins@metronet.com



|------------------|
| [ ] Mark Private |
|------------------|
            Others cannot see any details about this event.
|---------------|
| [ ] Notify me |
|---------------|
            Have Notes notify you before the event.
Categorize:




Description:


__________________




    What was presented to us during our call today was that you were
selecting
mime as "the" ebXML solution.  If that is not the case, then we don't have
an
issue.  Although I personally can't understand why anyone would want to use
an
antiquated technology such as e-mail to forward XML, especially in a
business to
business exchange, I do see the need to accomodate folks desires.  If on
the
other hand, you are selecting mime as "the" ebXML solution, then I think
that
you will loose participation and support of the bulk of the XML community.




Rik, it appears the Requirements group needs to be made aware that people
will want to send XML documents via E-mail and E-mail is not a W3C
specification.

Dick




Hi,
Regarding the EBXML Transport/R/P conference today, my understanding is
that the group is pursuing a mime-based envelope structure? I mentioned
that at the EBXML Requirements teleconference, which happened to be
immediately after the TRP teleconf, and they weren't happy about that
direction and hoped the TRP group will revisit that decision. In
particular, it was suggested that there is an EBXML requirement for all
EBXML specifications to be compliant with W3C technical specificatons.
(Though Mike Rawlins said that the transport may be an exception.) I
cite the relevant section from the EBXML Requirements document, which is
to be released to the full EBXML body for review this Monday.

Kit.


1.2.1 ebXML Vision
The ebXML vision is to deliver:

"A single set of internationally agreed upon technical specifications
that consist of common XML semantics and related document structures to
facilitate global trade."

This single set of ebXML technical specifications will create a Single
Global Electronic Market.  To create this single global electronic
market, this single set of ebXML technical specifications:

 - is fully compliant with W3C XML technical specifications holding a
recommended status,
 - provides for interoperability within and between ebXML compliant
trading partner applications,
 -  maximizes interoperability and efficiency while providing a
transition path from accredited electronic data interchange (EDI)
standards and developing XML business standards, and
 - will be submitted to an appropriate internationally recognized
standards body for accreditation as an international standard.









Perhaps we should do a better job of drawing a clear distinction between
the outer wrapper of an ebXML message (which is where MIME comes in) and
the various components of the message. As I recall we agreed that one of
the key components of an ebXML message is the header. The header will be an
XML document and the header analysis currently under way will presumably
spell out a generic header and an extension mechanism for handling optional
or domain specific headers. There will presumably be a number of other
components (e.g. routing info, manifest ...) that will all be in XML. I
thought we agreed that we want to deal with message payloads that may or
may not be XML documents. Again assuming I am on the right track, then the
question remains. How do we glue multiple objects, some in XML and others
in XYZ format, into a discreet message. I think MIME may serve a very
narrow purpose, i.e. as the glue. Is there an alternative glue standard
that is pure XML *and* can be described using an XML schema language ? Just
because multiple objects are glued together using MIME, does not
subsequently bind us to an SMTP infrastructure.  I think we all agree that
we would like to take a transport neutral approach. Am I correct in
assuming that the purist view is that we should only deal with XML payloads
?





__________________


> The problem that is being addressed by mime encapsulation only arises
when
> there is a need to transmit multiple XML Message bodies and provide
support
> for arbitrarily large binary objects (i.e. images). Most participants
seemed
> to agree that this type of mixed transmission would not be well suited to
> further encapsulation in another XML envelope. A case was sited were only
> one of the messages is desired out of a potentially large multi-part
block.
> With most parser implementations a pure XML encapsulation would only
allow
> the use of SAX parsing to avoid processing the entire message.

I am not sure that this would be possible, but just thinking loudly. Would
this work?

For large files:
1) Compose a XML msg where payload is XML as well. Payload is user specific
so it might be that some of them would need to transfer 30MB image file.
That file could be embedded (CDATA) or it could be referenced (URI) or
whatever else.
2) Send it via ANY reliable protocol. Who cares about packets; requirement
is that the whole XML msg is going to be delivered to the receiving end.
3) Use XML parser on receiving side. If SAX, go to manifest and find that
there is a big file in the payload. Save it somewhere. If DOM good luck?

For recursive "EnvelopeAsAPayload" pattern:
1) On receiving end while digesting manifest, discover that payload has
another envelop(s). Process those recursively.

In both cases MIME is possible, but not required. Anyway, this might be a
good candidate for an early prototype (proof of concept)?

Nikola




__________________


Nikola:

<SNIP>
> With most parser implementations a pure XML encapsulation would only
allow
> the use of SAX parsing to avoid processing the entire message.
</SNIP>

Whoa!! This is bogus.  In the context of valid XML, both SAX and DOM must
parse the entire docoment to compare it with the schema/DTD to ascertain
whether or not it is valid.  A SAX parser can stop parsing if it encounters
syntax error in well formed /valid XML.  You are talking about parsing, not
handlers for encountered instances.

<Nikola>
I am not sure that this would be possible, but just thinking loudly. Would
this work?
For large files:
1) Compose a XML msg where payload is XML as well.</NIKOLA>
This would constitute an XML message.  It is redundant to mention an XML
message and XML payload.

<NIKOLA> Payload is user specific
so it might be that some of them would need to transfer 30MB image file.
That file could be embedded (CDATA) or it could be referenced (URI) or
whatever else.</NIKOLA>
First of all,  you cannot declare binary data as CDATA.  Any handlers will
puke on the input.  "Embed" is a member function of the XLink:show
namespace
which issues a direction to include the referenced data into the document.
If the XLink:actuate is selected to "onLoad", your handler will try to
interpret pure binary data.  The MIME type for XML is Text.  Third - I
cannot think of any rational how a 30mb image file can possibly be
considered part of an XML e-business transaction.  It could be the
"subject"
of the transaction, but XML is a text language.

<NIKOLA>
2) Send it via ANY reliable protocol. Who cares about packets; requirement
is that the whole XML msg is going to be delivered to the receiving
end.</NIKOLA>
I believe that TCP/IP - HTTP is the basis for which this will be built.
The
protocol MUST be O/S and Platform independant.  We cannot accept anything
less. HTTP has many of the built in error handling routines we need for
ebXML and is also easily integrated with security protocols.

<NIKOLA>
3) Use XML parser on receiving side. If SAX, go to manifest and find that
there is a big file in the payload. Save it somewhere. If DOM good
luck?</NIKOLA>
Which XML Parser?  If you are referring to the Perl parser (XML::Parser),
the parser is built on top of James Clark's XPat parser in C.  The parser
does not actually embed the link.  It just reads the text.  It is the
handlers that act upon the parsed XML that perform this functionality.
Therefore, a link to a 30 mb file will parse just as quickly as a link to a
1 mb file.

If you are building the message blocks for transporting,  the construction
of the messaging can be a very important aspect for efficiency.  I do not
think you should force people to use either DOM or SAX as part of a design
rule.




__________________


Nikola:

<SNIP>
> With most parser implementations a pure XML encapsulation would only
allow
> the use of SAX parsing to avoid processing the entire message.
</SNIP>

Whoa!! This is bogus.  In the context of valid XML, both SAX and DOM must
parse the entire docoment to compare it with the schema/DTD to ascertain
whether or not it is valid.  A SAX parser can stop parsing if it encounters
syntax error in well formed /valid XML.  You are talking about parsing, not
handlers for encountered instances.

<Nikola>
I am not sure that this would be possible, but just thinking loudly. Would
this work?
For large files:
1) Compose a XML msg where payload is XML as well.</NIKOLA>
This would constitute an XML message.  It is redundant to mention an XML
message and XML payload.

<NIKOLA> Payload is user specific
so it might be that some of them would need to transfer 30MB image file.
That file could be embedded (CDATA) or it could be referenced (URI) or
whatever else.</NIKOLA>
First of all,  you cannot declare binary data as CDATA.  Any handlers will
puke on the input.  "Embed" is a member function of the XLink:show
namespace
which issues a direction to include the referenced data into the document.
If the XLink:actuate is selected to "onLoad", your handler will try to
interpret pure binary data.  The MIME type for XML is Text.  Third - I
cannot think of any rational how a 30mb image file can possibly be
considered part of an XML e-business transaction.  It could be the
"subject"
of the transaction, but XML is a text language.

<NIKOLA>
2) Send it via ANY reliable protocol. Who cares about packets; requirement
is that the whole XML msg is going to be delivered to the receiving
end.</NIKOLA>
I believe that TCP/IP - HTTP is the basis for which this will be built.
The
protocol MUST be O/S and Platform independant.  We cannot accept anything
less. HTTP has many of the built in error handling routines we need for
ebXML and is also easily integrated with security protocols.

<NIKOLA>
3) Use XML parser on receiving side. If SAX, go to manifest and find that
there is a big file in the payload. Save it somewhere. If DOM good
luck?</NIKOLA>
Which XML Parser?  If you are referring to the Perl parser (XML::Parser),
the parser is built on top of James Clark's XPat parser in C.  The parser
does not actually embed the link.  It just reads the text.  It is the
handlers that act upon the parsed XML that perform this functionality.
Therefore, a link to a 30 mb file will parse just as quickly as a link to a
1 mb file.

If you are building the message blocks for transporting,  the construction
of the messaging can be a very important aspect for efficiency.  I do not
think you should force people to use either DOM or SAX as part of a design
rule.




Message text written by "Duane Nickull"
><NIKOLA> Payload is user specific
so it might be that some of them would need to transfer 30MB image file.
That file could be embedded (CDATA) or it could be referenced (URI) or
whatever else.</NIKOLA>
First of all,  you cannot declare binary data as CDATA.<

>>>>>>>>>>
Technically the way to do this is using UUENCODING  - but that makes for a
massive file, but doable.

Since the URI is the best way to go - you can cache content, you can ping
headers via tcp/ip to get size and date/time stamp, before you decide to
haul it over the wire.

30MB files are not unusual for medical imagery - in fact full body MRI's -
150MB,
and CatScans, Heart Scans, where there is a real time element as well -
bigger.

I think we should ensure these can be handled - since bandwidth is
increasing
orders of magnitude.  So five years from now - this stuff will be small
potatos.

The business need drives the technical solution requirements as it should.

DW.


Message text written by "Duane Nickull"
><NIKOLA> Payload is user specific
so it might be that some of them would need to transfer 30MB image file.
That file could be embedded (CDATA) or it could be referenced (URI) or
whatever else.</NIKOLA>
First of all,  you cannot declare binary data as CDATA.<

>>>>>>>>>>
Technically the way to do this is using UUENCODING  - but that makes for a
massive file, but doable.

Since the URI is the best way to go - you can cache content, you can ping
headers via tcp/ip to get size and date/time stamp, before you decide to
haul it over the wire.

30MB files are not unusual for medical imagery - in fact full body MRI's -
150MB,
and CatScans, Heart Scans, where there is a real time element as well -
bigger.

I think we should ensure these can be handled - since bandwidth is
increasing
orders of magnitude.  So five years from now - this stuff will be small
potatos.

The business need drives the technical solution requirements as it should.

DW.



Thanks Duane.
I hoped that you'd check the scenario. However, as you can conclude reading
the first paragraph, we might have two requirements to fulfill:

1)
<Duane>
> Third - I
> cannot think of any rational how a 30mb image file can possibly be
> considered part of an XML e-business transaction.  It could be the
"subject"
> of the transaction, but XML is a text language.
</Duane>

It looks to me that there might be a business transaction that needs to
include very large files, which would mean that delivery of the file
(embedded, referenced, attached, ??? via payload) has to be part of the
agreed document exchange. This would entail that it might require (among
other things) later auditing as well.

2)
<NIKOLA>
1) Compose a XML msg where payload is XML as well.
</NIKOLA>

<Duane>
This would constitute an XML message.  It is redundant to mention an XML
message and XML payload.
</Duane>

Messages might need to include other messages (possible recursively). In
that case one of the possible candidate solutions was to put lower level
envelopes inside the higher level payloads. If you don't treat payload as
an
XML content then how you parse, handle, ... your payload? And sometimes 1)
your payload needs to (embed, reference, ???) large files.

> If you are building the message blocks for transporting,  the
construction
> of the messaging can be a very important aspect for efficiency.  I do not
> think you should force people to use either DOM or SAX as part of a
design
> rule.

I agree. DOM and SAX were just mentioned and used as possible candidate
methods. MIME was also one of the possible candidates. It would be ideal
that we just have "XML". However, how we format an ebXML msg would
influence
what method we use in order to be efficient and vice versa. As always in a
real world, your "Data Model" is not independent of how you "Process your
Data".

<Nikola>
Anyway, this might be a good candidate for an early prototype (proof of
concept)?
</Nikola>





Thanks Duane.
I hoped that you'd check the scenario. However, as you can conclude reading
the first paragraph, we might have two requirements to fulfill:

1)
<Duane>
> Third - I
> cannot think of any rational how a 30mb image file can possibly be
> considered part of an XML e-business transaction.  It could be the
"subject"
> of the transaction, but XML is a text language.
</Duane>

It looks to me that there might be a business transaction that needs to
include very large files, which would mean that delivery of the file
(embedded, referenced, attached, ??? via payload) has to be part of the
agreed document exchange. This would entail that it might require (among
other things) later auditing as well.

2)
<NIKOLA>
1) Compose a XML msg where payload is XML as well.
</NIKOLA>

<Duane>
This would constitute an XML message.  It is redundant to mention an XML
message and XML payload.
</Duane>

Messages might need to include other messages (possible recursively). In
that case one of the possible candidate solutions was to put lower level
envelopes inside the higher level payloads. If you don't treat payload as
an
XML content then how you parse, handle, ... your payload? And sometimes 1)
your payload needs to (embed, reference, ???) large files.

> If you are building the message blocks for transporting,  the
construction
> of the messaging can be a very important aspect for efficiency.  I do not
> think you should force people to use either DOM or SAX as part of a
design
> rule.

I agree. DOM and SAX were just mentioned and used as possible candidate
methods. MIME was also one of the possible candidates. It would be ideal
that we just have "XML". However, how we format an ebXML msg would
influence
what method we use in order to be efficient and vice versa. As always in a
real world, your "Data Model" is not independent of how you "Process your
Data".

<Nikola>
Anyway, this might be a good candidate for an early prototype (proof of
concept)?
</Nikola>



<David>
> First of all,  you cannot declare binary data as CDATA.<
>
> >>>>>>>>>>
> Technically the way to do this is using UUENCODING  - but that makes for
a
> massive file, but doable.
</David>

<http://www.ucc.ie/xml/>
(It would, however, be possible to include a text-encoded transformation of
a binary file as a CDATA marked section, using something like UUencode with
the markup characters ] and > removed from the map so that they could not
occur and be misinterpreted.)
</http://www.ucc.ie/xml/>

<David>
> I think we should ensure these can be handled - since bandwidth is
> increasing
> orders of magnitude.  So five years from now - this stuff will be small
> potatos.
</David>

I agree. And, five years from now we might have many other things improved
(X..., ...). And we hope ebXML will be everywhere.

<David>
> The business need drives the technical solution requirements as it
should.
</David>

Absolutely. Many times implementers give solutions (how) without justifing
it (why).