Understanding "Convert 2D Points to 3D Points"

florianbecker · August 14, 2024, 7:41am

I was looking through example projects and found a step “Convert 2D Points to 3D Points”.
It takes as input a (mask) image and a Pose and outputs a Point Cloud (XYZ-Normal).
I am trying to understand how this step works internally. How can an image and a 3d pose be combined to get to a point cloud?
Can you please explain?

Is the image projected as a plane into the 3d world with the image center being at the pose center? But how is the size of the image plane scaled accordingly?
So the point cloud that is generated is always a plane?

YNWA.Mech-Mind · August 15, 2024, 6:08am

This step is usually used in combination and here is an example of how it is used. Hope it will answer your question well.

Mask is generated from the point cloud. The purpose of doing this is to improve some of the edge distortion issues.

Edge point cloud before processing

Edge point cloud after processing

Because the mask is generated from the point cloud, the generated mask is able to correspond to the original point cloud.

florianbecker · August 19, 2024, 9:19am

Thanks. So the image, i.e. mask, is expected to be derived from “Perspective Projection” from the point cloud in camera coordinates and the pose is the center of the point cloud.
For what objects can I use this step?
I assume that this step is only meant to be used for flat surfaces and only for a single object at once?

I tried this step naively (on purpose) for a whole and filled KLT:

For my understanding, how are the mask image and pose combined to get a point cloud? Using an inverse pinhole model (and distortion model)? How is the pose orientation taken into account?

I assume a distortion model is also used since when I set my pose orientation manually to e.g. 45° then my point cloud plane looks like a parabola from the side.

why · September 13, 2024, 2:37am

“The understanding is correct. This step only applies to planar point clouds. A segmentation algorithm (such as deep learning or clustering) can be used to segment out the planar point clouds first.”