Python to Load Binary Documents to SAP HANA
Python is a widely used general-purpose, high-level programming language. Its design philosophy emphasizes code readability, and its syntax allows programmers to express concepts in fewer lines of code than would be possible in languages such as C++ or Java.
What is Unstructured Data?. Unstructured data (or unstructured information) refers to information that either does not have a pre-defined data model or is not organized in a pre-defined manner. Unstructured information is typically text-heavy, but may contain data such as dates, numbers, and facts as well.
How to Load PDF or XML files (Unstructured data) to SAP HANA by using Python
1) Download Python for windows from the following link click here
2) Download Python odbc drivers from the following link click here
3) Install python in windows directory C:\Python33\ as shown in the picture. Click on finish to complete the installation process.
4) Next install downloaded Python ODBC driver. Click on finish to complete the installation process.
Now we are ready to load the (PDF or XML ) Unstructured files to SAP HANA Cloud Platform. Before doing that make sure that you followed the prerequisite steps.
If you are loading data to SAP HANA on Premises. Then you can skip the step of creating the tunel.