Page 1 of 1

Unable to identify the header and footer of the pdf using ranorex spy

Posted: Wed May 22, 2019 5:57 am
by Priyanshu
Hi Team,
I want to read the pdf using ranorex spy where my pdf contains more than 1 page, my first page data is overflown to the second page but when I try to spy the data in second page I am unable to identify where its the data which is overflown to the second page or it is the header data. Please suggest the solution.

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Thu May 23, 2019 7:42 am
by odklizec
Hi,

At first, are you sure there is enabled accessibility for given PDF? Are you able to track anything at all in your PDF? Could you please upload the PDF file in question?

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Fri May 24, 2019 7:54 am
by Priyanshu
Hi ,

Yes accessibility is enabled for the PDF. Also I am able to track each of the element in the pdf. I am unable to attach pdf file here.

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Fri May 24, 2019 8:03 am
by odklizec
Hi,

Well, if you are able to track other PDF elements, except the header and footer, then there is most probably nothing you can do about them? I would suggest to contact directly Ranorex support (via support form), but I'm sure they will need to see the PDF in question.

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Fri May 24, 2019 9:57 am
by Priyanshu
Hi ,
Actually I am able to read the data but I am unable to identify if its a header data or normal pdf content. Ranorex is not forming any such hierarchy from which I would be able to understand if its a header/footer or its normal content.

Problem statement:

I need to verify (match ) the pdf content from my application and make sure that data is same printed as the application showing. But data is coming under various labels and each label section have different content. Now Ranorex spy given me Xpath for every single row/identifier different and in case data is reached to second page it start first reading the second page header then normal content, but I am unable to identify if its a header data or normal pdf content so that I can match with application data.

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Mon May 27, 2019 7:49 am
by odklizec
Hi,

Sadly, without seeing an example PDF, not necessarily the production one, it's next to impossible to suggest something reliable. Could you please create an example PDF, which would match the structure of your problematic PDF, but filled with "lorem ipsum" text? ;)

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Mon May 27, 2019 1:11 pm
by Priyanshu
Hi,

I have mailed the PDF over the mail [email protected].

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Mon May 27, 2019 1:26 pm
by odklizec
Well, in case you want to contact support, you should use their support form, available here:
https://www.ranorex.com/support-query/
They no longer answer support emails via [email protected] address.

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Mon May 27, 2019 1:29 pm
by Priyanshu
Please suggest where I can send the PDF? Here I see pdf extension is not supported to attach file

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Mon May 27, 2019 1:31 pm
by odklizec
Zip it then. This should solve the problem with attaching unsupported extensions. Also, simple renaming of file extension should help too ;)

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Mon May 27, 2019 1:33 pm
by Priyanshu
PFA

Re: Unable to identify the header and footer of the pdf using ranorex spy

Posted: Tue May 28, 2019 8:39 pm
by Support Team
Hi Priyanshu,

Thank you for these files. Unfortunately, PDF elements traditionally do not include much detail about their structure and object and usually only contain basic information such as the text itself for accessibility applications to utilize (such as a screen reader). PDFs do not have automation in mind and we must work with what we get.

If you are wanting to validate the text in the PDF, Ranorex is able to read this specifically as long as the PDF reader allows (I personally use Firefox). If you are wanting to validate the design/layout of the PDF, then image-based automation is likely your best option.

The below screenshot shows Ranorex successfully tracking the PDF text (loaded in Firefox) on the third page. All text objects on all pages were properly recognized by Ranorex.
pg3.png
I hope this information helps!

Regards,
Ned