Stefano Zacchirolichanged title from Review metadata deposit specs of a metadataonly deposit to Review metadata deposit specs for metadata-only deposit
changed title from Review metadata deposit specs of a metadataonly deposit to Review metadata deposit specs for metadata-only deposit
After this morning's meeting with @vlorentz and @ardumont:
We will keep the metadata-only deposit specs with the idea of a separate namespace swh for which we need to write the schema (not sure we have that).
This way, the xml with metadata has a section where the identified artifact is mentioned:
Reference a snapshot, revision or release:
With ${type} in {snp (snapshot), rev (revision), rel (release) }:<swh:deposit> <swh:reference> <swh:object id="swh:1:${type}:aaaaaaaaaaaaaa..."/> </swh:reference></swh:deposit>
We need to add to the list of types: directory and content
The possibility to deposit metadata on an origin should be implemented as well, but is not suited for institutional repositories (e.g HAL).
Reference an origin:
I think we would want to "mention" SWHIDs there, by replacing <swh:object id=" with either <swh:swhid id=" or <swh:object swhid=" (weak preference for the latter)
Additionally, should the SWHID be a core SWHID, or do we allow context? In the latter case, what do we do if there's a line context?
I don't recall what the conclusion was about the proposal of <swh:swhid>$actual_swid</swh:swhid> which i found simpler and clearer.
(I have no clue if that proposal is irrelevant or not)
I guess a question which could help answering that also would be "Do we intend to add other attributes to swh:object"?
! In #2312 (closed), @ardumont wrote:
I don't recall what the conclusion was about the proposal of <swh:swhid>$actual_swid</swh:swhid> which i found simpler and clearer.
We didn't conclude anything, I said I'd think about it ;)
Since it's a simple text value, it should be an attribute, IMO. No point in allowing content in that tag
I don't have a preference, but I do think that we don't want clients to dismember the SWHID into Option C.
So if we say that the burden of the understanding of the context is on our side, we should go with A or B.
@vlorentz is right when saying that the element is only text and is not a complexe element (where other elements are included).
@ardumont is right when saying that the use of only an element looks clearer, but we should use that only if there is a reason to include more elements in the identified object
So the questions are:
do we think we will need that outside of the scenario we have seen yesterday (metadata-only deposit)?
and do we think that on the long-term maybe option C will have a "raison d'être"?
the evolution to json-ld will be easier with what schema?
Actually, I prefer A2, to make the distinction between origins (identified by an URL, <swh:origin url=...) and objects (identified by a SWHID, <swh:object swhid='...)
Actually, I prefer A2, to make the distinction between origins (identified by an URL, <swh:origin url=...) and objects (identified by a SWHID, <swh:object swhid='...)