I have a zip file that contains two XML files that I'll call the catalog file and the price file.
In my jobflow I move the zip file from a remote location to the DATAIN_DIR and then attempt to read from each of the XML files using XML Extract in a couple different places. According to this documentation link: http://doc.cloveretl.com/documentation/ ... aders.html
I should be able to access the inner contents of the xml and use wildcards, i.e.
In my case, the paths I am trying to resolve do have some wildcards in them. I will know the name of the zip file exactly because it will be passed in from a file event listener, and I will know the name of the inner folder because I can parse it from the zip file name, but won't know the exact inner filenames at runtime - I know they are catalog*.xml and price*.xml. So I am trying to set the path as (for example)
(where the parameters are set properly)
Unfortunately, when I do this, even though it does not error, the XML Extract doesn't actually find anything.
If I use the same underlying catalog*.xml file unzipped, it extracts properly. (Basically, I had a fully working jobflow with xml files and now am trying to retrofit it to work with the zip files, to no avail.)
To simplify the scenario I tried doing a simple hard-coding of the actual file name for testing purposes based on the suggested format, i.e.
(where the inner folder name is catalog_5_123 and the catalog xml file inside it is called catalog_5_abc.xml)
and it still is unable to extract any snippets, even though it does not fail the jobflow as it fails if I have an invalid zip file name.
I am open to unzipping the file first if I have to but my understanding is that this should work, so my first choice would be to get it working. If I have to unzip the file then I would need to be able to do that from my jobflow after FTP'ing the file and I don't see a component to do this in the jobflow.
Do any of you have an idea how to get the XmlExtract component to pull from the inner files of the zip file or if not how I can unzip the files from the jobflow after FTPing them locally?
Thanks in advance!