'real-time' 태그의 글 목록

일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

'real-time'에 해당되는 글 9건

2010.09.18 Zenitum's 4th Open Lab
2010.05.30 test: composing OpenCV Iplimage and OpenGL graphics in one window screen
2010.03.03 ARToolKit test log
2010.02.23 virtual studio 구현: workflow
2010.02.10 Seong-Woo Park & Yongduek Seo & Ki-Sang Hong <Real-Time Camera Calibration for Virtual Studio>
2009.11.08 Branislav Kisačanin & Vladimir Pavlović & Thomas S. Huang <Real-Time Vision for Human-Computer Interaction>
2009.07.23 Brian Williams, Georg Klein and Ian Reid <Real-Time SLAM Relocalisation>
2009.03.31 A. J. Davison <Real-time simultaneous localisation and mapping with a single camera>
2009.03.26 Civera, Davison & Montiel <Inverse Depth Parametrization for Monocular SLAM>

ISMAR 2010: B O R D E R L E S S (0)	2010.10.16
David M. L. Williams 서강대 영상대학원 특강 (0)	2010.10.07
2010 공개 SW 개발자대회 2차 기술세미나 - 모바일 오픈소스 플랫폼 '안드로이드' (0)	2010.08.24
3D 영상산업의 전망과 동향 (Stereo Pictures 성필문 회장) (0)	2010.03.24
RoSEC 2010 winter school (0)	2010.01.14

test: composing OpenCV Iplimage and OpenGL graphics in one window screen

2010. 5. 30. 01:59

보호되어 있는 글입니다.
내용을 보시려면 비밀번호를 입력하세요.

password

http://www.hitl.washington.edu/artoolkit/

ARToolKit Patternmaker
Automatically create large numbers of target patterns for the ARToolKit, by the University of Utah.

ARToolKit-2.72.tgz 다운로드

http://www.openvrml.org/

DSVideoLib
A DirectShow wrapper supporting concurrent access to framebuffers from multiple threads. Useful for developing applications that require live video input from a variety of capture devices (frame grabbers, IEEE-1394 DV camcorders, USB webcams).

openvrml on macports
http://trac.macports.org/browser/trunk/dports/graphics/openvrml/Portfile

galaxy:~ lym$ port search openvrml
openvrml @0.17.12 (graphics, x11)
    a cross-platform VRML and X3D browser and C++ runtime library
galaxy:~ lym$ port info openvrml
openvrml @0.17.12 (graphics, x11)
Variants:    js_mozilla, mozilla_plugin, no_opengl, no_x11, player, universal,
             xembed

OpenVRML is a free cross-platform runtime for VRML and X3D available under the
GNU Lesser General Public License. The OpenVRML distribution includes libraries
you can use to add VRML/X3D support to an application. On platforms where GTK+
is available, OpenVRML also provides a plug-in to render VRML/X3D worlds in Web
browsers.
Homepage:    http://www.openvrml.org/

Build Dependencies:   pkgconfig
Library Dependencies: boost, libpng, jpeg, fontconfig, mesa, libsdl
Platforms:            darwin
Maintainers:          raphael@ira.uka.de openmaintainer@macports.org
galaxy:~ lym$ port deps openvrml
openvrml has build dependencies on:
    pkgconfig
openvrml has library dependencies on:
    boost
    libpng
    jpeg
    fontconfig
    mesa
    libsdl
galaxy:~ lym$ port variants openvrml
openvrml has the variants:
    js_mozilla: Enable support for JavaScript in the Script node with Mozilla
    no_opengl: Do not build the GL renderer
    xembed: Build the XEmbed control
    player: Build the GNOME openvrml-player
    mozilla_plugin: Build the Mozilla plug-in
    no_x11: Disable support for X11
    universal: Build for multiple architectures

openvrml 설치

galaxy:~ lym$ sudo port install openvrml
Password:
---> Fetching boost-jam
---> Attempting to fetch boost-jam-3.1.17.tgz from http://nchc.dl.sourceforge.net/boost
---> Verifying checksum(s) for boost-jam
---> Extracting boost-jam
---> Applying patches to boost-jam
---> Configuring boost-jam
---> Building boost-jam
---> Staging boost-jam into destroot
---> Installing boost-jam @3.1.17_0
---> Activating boost-jam @3.1.17_0
---> Cleaning boost-jam
---> Fetching boost
---> Attempting to fetch boost_1_39_0.tar.bz2 from http://nchc.dl.sourceforge.net/boost
---> Verifying checksum(s) for boost
---> Extracting boost
---> Applying patches to boost
---> Configuring boost
---> Building boost
---> Staging boost into destroot
---> Installing boost @1.39.0_2
---> Activating boost @1.39.0_2
---> Cleaning boost
---> Fetching libsdl
---> Attempting to fetch SDL-1.2.13.tar.gz from http://distfiles.macports.org/libsdl
---> Verifying checksum(s) for libsdl
---> Extracting libsdl
---> Applying patches to libsdl
---> Configuring libsdl
---> Building libsdl
---> Staging libsdl into destroot
---> Installing libsdl @1.2.13_6
---> Activating libsdl @1.2.13_6
---> Cleaning libsdl
---> Fetching glut
---> Verifying checksum(s) for glut
---> Extracting glut
---> Configuring glut
---> Building glut
---> Staging glut into destroot
---> Installing glut @3.7_3
---> Activating glut @3.7_3
---> Cleaning glut
---> Fetching xorg-dri2proto
---> Attempting to fetch dri2proto-2.1.tar.bz2 from http://distfiles.macports.org/xorg-dri2proto
---> Verifying checksum(s) for xorg-dri2proto
---> Extracting xorg-dri2proto
---> Configuring xorg-dri2proto
---> Building xorg-dri2proto
---> Staging xorg-dri2proto into destroot
---> Installing xorg-dri2proto @2.1_0
---> Activating xorg-dri2proto @2.1_0
---> Cleaning xorg-dri2proto
---> Fetching xorg-glproto
---> Attempting to fetch glproto-1.4.10.tar.bz2 from http://distfiles.macports.org/xorg-glproto
---> Verifying checksum(s) for xorg-glproto
---> Extracting xorg-glproto
---> Configuring xorg-glproto
---> Building xorg-glproto
---> Staging xorg-glproto into destroot
---> Installing xorg-glproto @1.4.10_0
---> Activating xorg-glproto @1.4.10_0
---> Cleaning xorg-glproto
---> Fetching mesa
---> Attempting to fetch MesaLib-7.4.3.tar.bz2 from http://nchc.dl.sourceforge.net/mesa3d
---> Attempting to fetch MesaGLUT-7.4.3.tar.bz2 from http://nchc.dl.sourceforge.net/mesa3d
---> Attempting to fetch AppleSGLX-57.tar.bz2 from http://xquartz.macosforge.org/downloads/src/
---> Verifying checksum(s) for mesa
---> Extracting mesa
---> Applying patches to mesa
---> Configuring mesa
---> Building mesa
---> Staging mesa into destroot
---> Installing mesa @7.4.3_0+hw_render
---> Activating mesa @7.4.3_0+hw_render
---> Cleaning mesa
---> Fetching openvrml
---> Attempting to fetch openvrml-0.17.12.tar.gz from http://nchc.dl.sourceforge.net/openvrml
---> Verifying checksum(s) for openvrml
---> Extracting openvrml
---> Configuring openvrml
---> Building openvrml
---> Staging openvrml into destroot
---> Installing openvrml @0.17.12_0
---> Activating openvrml @0.17.12_0
---> Cleaning openvrml

cd ~/Desktop/ARToolKit/lib/SRC/ARvrml

                    make

                    cd ~/Desktop/ARToolKit/examples/simpleVRML

                    make

                    cd ~/Desktop/ARToolKit/bin

                    ./simpleVRML

ARToolKit-2.72.1 설치 후 테스트

graphicsTest on the bin directory
-> This test confirms that your camera support ARToolKit graphics module with OpenGL.

videoTest on the bin directory
-> This test confirms that your camera supports ARToolKit video module and ARToolKit graphics module.

simpleTest on the bin directory
-> You need to notice that better the format is similar to ARToolKit tracking format, faster is the acquisition (RGB more efficient).

simple.c

#ifdef _WIN32 #include <windows.h> #endif #include <stdio.h> #include <stdlib.h> #ifndef __APPLE__ #include <GL/gl.h> #include <GL/glut.h> #else #include <OpenGL/gl.h> #include <GLUT/glut.h> #endif #include <AR/gsub.h> #include <AR/video.h> #include <AR/param.h> #include <AR/ar.h> #include <ciostream> // // Camera configuration. // #ifdef _WIN32 char *vconf = "Data\\WDM_camera_flipV.xml"; #else char *vconf = ""; #endif int xsize, ysize; int thresh = 100; int count = 0; char *cparam_name = "Data/camera_para.dat"; ARParam cparam; char *patt_name = "Data/patt.hiro"; int patt_id; double patt_width = 80.0; double patt_center[2] = {0.0, 0.0}; double patt_trans[3][4]; static void init(void); static void cleanup(void); static void keyEvent( unsigned char key, int x, int y); static void mainLoop(void); static void draw( void ); int main(int argc, char **argv) { glutInit(&argc, argv); init(); // starting the video capture, reading in the marker and camera parameters arVideoCapStart(); // video starting in the real-time state argMainLoop( NULL, keyEvent, mainLoop ); // defined in "include/AR/gsub.c" return (0); } static void keyEvent( unsigned char key, int x, int y) { /* quit if the ESC key is pressed */ if( key == 0x1b ) { printf("*** %f (frame/sec)\n", (double)count/arUtilTimer()); cleanup(); exit(0); } } /* main loop */ static void mainLoop(void) { ARUint8 *dataPtr; ARMarkerInfo *marker_info; int marker_num; int j, k; /* grab a vide frame */ if( (dataPtr = (ARUint8 *)arVideoGetImage()) == NULL ) { arUtilSleep(2); return; } if( count == 0 ) arUtilTimerReset(); count++; argDrawMode2D(); argDispImage( dataPtr, 0,0 ); /* detect the markers in the video frame */ if( arDetectMarker(dataPtr, thresh, &marker_info, &marker_num) < 0 ) { cleanup(); exit(0); } arVideoCapNext(); /* check for object visibility */ k = -1; for( j = 0; j < marker_num; j++ ) { if( patt_id == marker_info[j].id ) { if( k == -1 ) k = j; else if( marker_info[k].cf < marker_info[j].cf ) k = j; } } if( k == -1 ) { argSwapBuffers(); return; } /* get the transformation between the marker and the real camera */ arGetTransMat(&marker_info[k], patt_center, patt_width, patt_trans); // http://www.hitl.washington.edu/artoolkit/documentation/tutorialcamera.htm cout << "werwstsfg" << endl; printf("TEST %f %f %f\n",patt_trans[0][3],patt_trans[1][3],patt_trans[2][3]); draw(); argSwapBuffers(); } static void init( void ) { ARParam wparam; /* open the video path */ if( arVideoOpen( vconf ) < 0 ) exit(0); /* find the size of the window */ if( arVideoInqSize(&xsize, &ysize) < 0 ) exit(0); printf("Image size (x,y) = (%d,%d)\n", xsize, ysize); /* set the initial camera parameters */ if( arParamLoad(cparam_name, 1, &wparam) < 0 ) { printf("Camera parameter load error !!\n"); exit(0); } arParamChangeSize( &wparam, xsize, ysize, &cparam ); arInitCparam( &cparam ); printf("*** Camera Parameter ***\n"); arParamDisp( &cparam ); if( (patt_id=arLoadPatt(patt_name)) < 0 ) { printf("pattern load error !!\n"); exit(0); } /* open the graphics window */ argInit( &cparam, 1.0, 0, 0, 0, 0 ); } /* cleanup function called when program exits */ static void cleanup(void) { arVideoCapStop(); arVideoClose(); argCleanup(); } static void draw( void ) { double gl_para[16]; GLfloat mat_ambient[] = {0.0, 0.0, 1.0, 1.0}; GLfloat mat_flash[] = {0.0, 0.0, 1.0, 1.0}; GLfloat mat_flash_shiny[] = {50.0}; GLfloat light_position[] = {100.0,-200.0,200.0,0.0}; GLfloat ambi[] = {0.1, 0.1, 0.1, 0.1}; GLfloat lightZeroColor[] = {0.9, 0.9, 0.9, 0.1}; argDrawMode3D(); argDraw3dCamera( 0, 0 ); glClearDepth( 1.0 ); glClear(GL_DEPTH_BUFFER_BIT); glEnable(GL_DEPTH_TEST); glDepthFunc(GL_LEQUAL); /* load the camera transformation matrix */ argConvGlpara(patt_trans, gl_para); glMatrixMode(GL_MODELVIEW); glLoadMatrixd( gl_para ); glEnable(GL_LIGHTING); glEnable(GL_LIGHT0); glLightfv(GL_LIGHT0, GL_POSITION, light_position); glLightfv(GL_LIGHT0, GL_AMBIENT, ambi); glLightfv(GL_LIGHT0, GL_DIFFUSE, lightZeroColor); glMaterialfv(GL_FRONT, GL_SPECULAR, mat_flash); glMaterialfv(GL_FRONT, GL_SHININESS, mat_flash_shiny); glMaterialfv(GL_FRONT, GL_AMBIENT, mat_ambient); glMatrixMode(GL_MODELVIEW); glTranslatef( 0.0, 0.0, 25.0 ); glutSolidCube(50.0); glDisable( GL_LIGHTING ); glDisable( GL_DEPTH_TEST ); }

"hiro" 패턴을 쓰지 않으면, 아래와 같은 에러가 난다.

/Users/lym/ARToolKit/build/ARToolKit.build/Development/simpleTest.build/Objects-normal/i386/simpleTest ; exit;
galaxy:~ lym$ /Users/lym/ARToolKit/build/ARToolKit.build/Development/simpleTest.build/Objects-normal/i386/simpleTest ; exit;
Using default video config.
Opening sequence grabber 1 of 1.
vid->milliSecPerFrame: 200 forcing timer period to 100ms
Video cType is raw , size is 320x240.
Image size (x,y) = (320,240)
Camera parameter load error !!
logout

Using default video config.
Opening sequence grabber 1 of 1.
vid->milliSecPerFrame: 200 forcing timer period to 100ms
Video cType is raw , size is 320x240.
Image size (x,y) = (320,240)
*** Camera Parameter ***
--------------------------------------
SIZE = 320, 240
Distortion factor = 159.250000 131.750000 104.800000 1.012757
350.47574 0.00000 158.25000 0.00000
0.00000 363.04709 120.75000 0.00000
0.00000 0.00000 1.00000 0.00000
--------------------------------------
Opening Data File Data/object_data2
About to load 2 Models
Read in No.1
Read in No.2
Objectfile num = 2

arGetTransMat() 안에서 다음과 같이 pattern의 transformation 값을 출력해 보면,

// http://www.hitl.washington.edu/artoolkit/documentation/tutorialcamera.htm
printf("camera transformation: %f %f %f\n",conv[0][3],conv[1][3],conv[2][3]);

결과:

console

camera transformation: 134.438993 63.934746 582.012800
camera transformation: 134.445606 63.981777 582.120969
camera transformation: 134.474482 63.995219 582.242088
camera transformation: 134.599202 63.998890 582.630168
camera transformation: 134.501440 63.963350 582.269908
camera transformation: 134.464995 64.013854 582.242347
camera transformation: 134.490045 63.956372 582.209032
camera transformation: 134.375223 63.789206 581.551681
camera transformation: 133.561691 63.159733 577.815148
camera transformation: 133.063396 62.927971 575.690113
camera transformation: 133.355195 63.043104 577.132167
camera transformation: 134.613795 63.954793 582.183804
camera transformation: 132.159546 64.070513 574.724387
camera transformation: 132.448489 64.937645 575.654565
camera transformation: 130.686699 65.617613 570.876666
camera transformation: 130.650742 65.840462 571.732330
camera transformation: 130.636143 65.874965 573.631585
camera transformation: 129.504212 56.174073 571.607662
camera transformation: 125.830031 48.411508 566.542108
camera transformation: 121.581157 45.285999 569.393613
camera transformation: 123.683377 47.387303 571.546352
camera transformation: 127.458933 44.409366 568.928211
camera transformation: 127.303034 44.345058 568.159484
camera transformation: 127.320462 44.350160 568.224561
camera transformation: 127.317729 44.349189 568.212422
camera transformation: 127.317729 44.349189 568.212422
camera transformation: 125.300218 43.641056 559.530004
camera transformation: 127.269746 44.332084 568.002352
camera transformation: 127.314772 44.348305 568.201544
camera transformation: 127.328986 44.353467 568.264290
camera transformation: 127.328986 44.353467 568.264290
camera transformation: 134.859914 41.818072 563.541940
camera transformation: 135.040310 41.877534 564.294626
camera transformation: 135.043507 41.878547 564.307919
camera transformation: 135.043507 41.878547 564.307919
camera transformation: 135.043507 41.878547 564.307919
camera transformation: 130.805179 40.514050 546.854285
camera transformation: 134.889481 41.829859 563.688319
camera transformation: 135.047962 41.880133 564.327580
camera transformation: 135.047962 41.880133 564.327580
camera transformation: 135.047962 41.880133 564.327580
camera transformation: 145.248889 34.185486 561.683418
camera transformation: 145.056709 34.137696 560.948388
camera transformation: 145.056709 34.137696 560.948388
camera transformation: 145.056709 34.137696 560.948388
camera transformation: 145.056709 34.137696 560.948388
camera transformation: 141.044529 33.130566 545.431075
camera transformation: 144.985976 34.118918 560.662976
camera transformation: 145.057722 34.137896 560.951561
camera transformation: 145.057722 34.137896 560.951561
camera transformation: 145.057722 34.137896 560.951561
camera transformation: 153.656796 18.847826 551.173961
camera transformation: 153.459454 18.820515 550.460694
camera transformation: 153.463400 18.821020 550.474774
camera transformation: 153.463400 18.821020 550.474774
camera transformation: 153.463400 18.821020 550.474774
camera transformation: 150.756045 18.471968 541.053654
camera transformation: 153.457933 18.819963 550.450362
camera transformation: 153.471652 18.822038 550.502303
camera transformation: 153.471652 18.822038 550.502303
camera transformation: 153.471652 18.822038 550.502303
camera transformation: 165.753777 10.789852 542.625784
camera transformation: 165.872430 10.798618 543.003766
camera transformation: 165.861243 10.797709 542.967712
camera transformation: 165.861243 10.797709 542.967712
camera transformation: 165.861243 10.797709 542.967712
camera transformation: 159.707526 10.325843 522.933657
camera transformation: 165.749957 10.789214 542.611588
camera transformation: 165.878724 10.799263 543.027862
camera transformation: 165.858931 10.797578 542.960740
camera transformation: 165.858931 10.797578 542.960740
camera transformation: 172.080657 1.469299 534.761847
camera transformation: 172.099660 1.470041 534.825697
camera transformation: 172.105059 1.470117 534.842380
camera transformation: 172.105059 1.470117 534.842380
camera transformation: 172.105059 1.470117 534.842380
camera transformation: 166.665623 1.366321 518.259388
camera transformation: 171.958367 1.467311 534.398567
camera transformation: 172.100170 1.470062 534.827885
camera transformation: 172.101379 1.469965 534.828683
camera transformation: 172.101379 1.469965 534.828683
camera transformation: 181.319872 -6.361278 526.585438
camera transformation: 181.274748 -6.360755 526.433490
camera transformation: 181.253058 -6.360225 526.371230
camera transformation: 181.253058 -6.360225 526.371230
camera transformation: 181.253058 -6.360225 526.371230
camera transformation: 178.239568 -6.285195 517.597418
camera transformation: 181.243052 -6.360170 526.334529
camera transformation: 181.262355 -6.360482 526.395503
camera transformation: 181.262355 -6.360482 526.395503
camera transformation: 181.262355 -6.360482 526.395503
camera transformation: 187.108940 -10.223686 510.799056
camera transformation: 187.181645 -10.227215 510.978572
camera transformation: 187.181645 -10.227215 510.978572
camera transformation: 187.181645 -10.227215 510.978572
camera transformation: 187.181645 -10.227215 510.978572
camera transformation: 183.952885 -10.095289 502.048962
camera transformation: 187.138129 -10.225454 510.860204
camera transformation: 187.186616 -10.227454 510.990564
camera transformation: 187.186616 -10.227454 510.990564
camera transformation: 187.186616 -10.227454 510.990564
camera transformation: 174.882900 -17.728211 507.700497
camera transformation: 175.151320 -17.750571 508.526338
camera transformation: 175.156303 -17.750970 508.543547
camera transformation: 175.156303 -17.750970 508.543547
camera transformation: 175.156303 -17.750970 508.543547
camera transformation: 173.093356 -17.563969 502.840939
camera transformation: 175.132943 -17.749048 508.472818
camera transformation: 175.147617 -17.750226 508.517538
camera transformation: 175.147617 -17.750226 508.517538
camera transformation: 175.147617 -17.750226 508.517538
camera transformation: 153.570679 -27.610874 523.025575
camera transformation: 154.835853 -27.811536 527.263384
camera transformation: 154.855814 -27.814620 527.336320
camera transformation: 154.855814 -27.814620 527.336320
camera transformation: 154.855814 -27.814620 527.336320
camera transformation: 152.299460 -27.392362 519.749682
camera transformation: 154.752070 -27.798192 526.972641
camera transformation: 154.827218 -27.810047 527.230484
camera transformation: 154.858447 -27.815063 527.341550
camera transformation: 154.840860 -27.812225 527.285658
camera transformation: 135.840483 -41.072535 553.345517
camera transformation: 136.128073 -41.152366 554.504789
camera transformation: 136.136155 -41.154777 554.540872
camera transformation: 136.136155 -41.154777 554.540872
camera transformation: 136.136155 -41.154777 554.540872
camera transformation: 130.527637 -39.583364 532.044687
camera transformation: 135.988306 -41.113532 553.943304
camera transformation: 136.142041 -41.156307 554.559791
camera transformation: 136.142041 -41.156307 554.559791
camera transformation: 136.142041 -41.156307 554.559791
camera transformation: 114.644838 -45.377904 573.264490
camera transformation: 115.199061 -45.580086 576.005279
camera transformation: 115.229438 -45.591515 576.169804
camera transformation: 115.246340 -45.597695 576.252987
camera transformation: 115.246340 -45.597695 576.252987
camera transformation: 113.754556 -45.048098 568.589400
camera transformation: 115.200177 -45.581112 576.034506
camera transformation: 115.235757 -45.593964 576.202107
camera transformation: 115.245920 -45.597558 576.251805
camera transformation: 115.245920 -45.597558 576.251805
camera transformation: 99.671642 -40.181365 582.198352
camera transformation: 100.462758 -40.471438 586.962569
camera transformation: 100.537384 -40.500194 587.472050
camera transformation: 100.549497 -40.504513 587.541289
camera transformation: 100.549497 -40.504513 587.541289
camera transformation: 97.303318 -39.355658 570.579586
camera transformation: 100.336305 -40.427915 586.316769
camera transformation: 100.547949 -40.504219 587.544305
camera transformation: 100.548709 -40.504451 587.544697
camera transformation: 100.548709 -40.504451 587.544697
camera transformation: 89.621585 -31.117271 596.138707
camera transformation: 90.219712 -31.290746 599.985966
camera transformation: 90.329912 -31.322731 600.684517
camera transformation: 90.328693 -31.322303 600.672256
camera transformation: 90.327473 -31.321874 600.659986
camera transformation: 87.759503 -30.579438 586.130418
camera transformation: 90.158155 -31.275121 599.754395
camera transformation: 90.313555 -31.317626 600.555676
camera transformation: 90.312882 -31.317428 600.555444
camera transformation: 90.312882 -31.317428 600.555444
camera transformation: 71.029270 -24.465717 602.374578
camera transformation: 70.940132 -24.442041 601.728877
camera transformation: 70.905596 -24.432504 601.444114
camera transformation: 70.905596 -24.432504 601.444114
camera transformation: 70.905596 -24.432504 601.444114
camera transformation: 68.768061 -23.836713 585.531494
camera transformation: 70.761778 -24.392775 600.274063
camera transformation: 70.901723 -24.431512 601.417396
camera transformation: 70.899923 -24.430914 601.396803
camera transformation: 70.899923 -24.430914 601.396803
camera transformation: 48.950365 -26.084962 601.595042
camera transformation: 48.933292 -26.081172 601.647515
camera transformation: 48.907404 -26.070042 601.356804
camera transformation: 48.907438 -26.070153 601.365086
camera transformation: 48.908143 -26.070507 601.373649
camera transformation: 47.153553 -25.289153 579.698461
camera transformation: 48.752213 -26.003037 599.555228
camera transformation: 48.887848 -26.061948 601.158725
camera transformation: 48.906954 -26.070190 601.376902
camera transformation: 48.906954 -26.070190 601.376902
camera transformation: 36.527862 -27.859885 601.451937
camera transformation: 36.678154 -27.949073 603.625633
camera transformation: 36.699226 -27.961495 603.914756
camera transformation: 36.699226 -27.961495 603.914756
camera transformation: 36.699226 -27.961495 603.914756
camera transformation: 34.828000 -26.821094 576.608529
camera transformation: 36.532837 -27.864475 601.649382
camera transformation: 36.672854 -27.945472 603.510826
camera transformation: 36.696060 -27.959637 603.870801
camera transformation: 36.696060 -27.959637 603.870801
camera transformation: 35.748520 -26.890392 599.448608
camera transformation: 35.952229 -27.020554 603.539403
camera transformation: 35.983429 -27.041974 604.319312
camera transformation: 35.983462 -27.043056 604.402832
camera transformation: 35.983320 -27.043073 604.409701
camera transformation: 33.960297 -25.864785 576.135985
camera transformation: 35.748413 -26.917867 602.033525
camera transformation: 35.951659 -27.027518 604.189188
camera transformation: 35.971207 -27.037715 604.385121
camera transformation: 35.972959 -27.037579 604.303563
camera transformation: 38.696380 -24.720939 614.161095
camera transformation: 38.183800 -24.450617 606.346543
camera transformation: 38.134450 -24.424857 605.615128
camera transformation: 38.135159 -24.425102 605.615481
camera transformation: 38.135159 -24.425102 605.615481
camera transformation: 36.853820 -23.745396 586.509814
camera transformation: 38.056856 -24.382855 604.374995
camera transformation: 38.136416 -24.425786 605.632439
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 36.853820 -23.745396 586.509814
camera transformation: 38.056856 -24.382855 604.374995
camera transformation: 38.136416 -24.425786 605.632439
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 36.853820 -23.745396 586.509814
camera transformation: 38.056856 -24.382855 604.374995
camera transformation: 38.136416 -24.425786 605.632439
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 38.135450 -24.425204 605.617310
camera transformation: 7.229969 -28.426098 487.494944
camera transformation: 1.061351 -37.746923 490.163596
camera transformation: 1.039532 -38.012456 494.388390
camera transformation: -1.890196 -40.741575 489.196950
camera transformation: -1.905179 -40.851093 490.795875
camera transformation: -1.913698 -40.848960 490.823090
camera transformation: -1.913698 -40.848960 490.823090
camera transformation: -1.913698 -40.848960 490.823090
camera transformation: -1.275810 -46.033944 570.939355
camera transformation: -1.360000 -46.027133 571.481545
camera transformation: -1.360001 -46.027141 571.481641
camera transformation: -1.360001 -46.027141 571.481641
camera transformation: -1.360001 -46.027141 571.481641
camera transformation: -4.856311 -45.941600 492.994856
camera transformation: -3.554997 -42.126784 489.440430
camera transformation: -5.463370 -44.468718 490.659087
camera transformation: -9.441343 -43.715187 491.482322
camera transformation: -9.587181 -44.110448 496.792789
camera transformation: -9.609268 -44.160040 497.482136
camera transformation: -9.608472 -44.160158 497.478898
camera transformation: -9.608472 -44.160158 497.478898
camera transformation: -10.481053 -50.965780 590.361852
camera transformation: -10.621224 -50.926162 590.569728
camera transformation: -10.629070 -50.921812 590.551891
camera transformation: -10.629097 -50.921897 590.546394
camera transformation: -10.636885 -50.917353 590.524294
camera transformation: -6.014300 -46.196831 498.526090
camera transformation: -6.009832 -46.082883 497.231266
camera transformation: -6.010114 -46.077167 497.168853
camera transformation: -6.010114 -46.077167 497.168853
camera transformation: -6.010114 -46.077167 497.168853
camera transformation: -3.463019 -44.233782 493.963337
camera transformation: -4.785040 -43.402498 496.156243
camera transformation: -4.783746 -43.479250 496.922893
camera transformation: -4.785978 -43.496124 497.102079
camera transformation: -4.785978 -43.496124 497.102079
camera transformation: -2.223724 -40.852272 492.668115
camera transformation: -2.255911 -40.924862 493.830850
camera transformation: -2.271698 -40.922372 493.898586
camera transformation: -2.272499 -40.926932 493.964230
camera transformation: -2.273289 -40.926828 493.967891
camera transformation: -0.843473 -38.095411 491.335923
camera transformation: -0.931192 -38.413990 495.584437
camera transformation: -0.934646 -38.412155 495.581977
camera transformation: -0.936060 -38.406104 495.520218
camera transformation: -0.936060 -38.406104 495.520218
camera transformation: 1.160322 -40.264248 495.616755
camera transformation: 1.161880 -40.256411 495.494743
camera transformation: 1.161880 -40.256411 495.494743
camera transformation: 1.161880 -40.256411 495.494743
camera transformation: 1.161880 -40.256411 495.494743
camera transformation: 3.723426 -39.508401 504.989547
camera transformation: 2.678958 -37.770843 496.488485
camera transformation: 2.410065 -37.547308 495.340346
camera transformation: 2.410310 -37.557735 495.465695
camera transformation: 2.412940 -37.558864 495.465748
camera transformation: 4.203019 -39.053885 496.367027
camera transformation: 4.204413 -39.092548 496.951603
camera transformation: 4.204475 -39.096509 497.013231
camera transformation: 4.203762 -39.096434 497.017567
camera transformation: 4.203762 -39.096434 497.017567
camera transformation: 5.836132 -34.680737 472.145396
camera transformation: 5.595480 -36.378936 496.482830
camera transformation: 5.600604 -36.497013 497.981744
camera transformation: 5.599892 -36.506204 498.105048
camera transformation: 5.599892 -36.506204 498.105048
camera transformation: 8.653772 -36.053810 496.365853
camera transformation: 8.656689 -36.070246 496.641526
camera transformation: 8.655321 -36.066243 496.579657
camera transformation: 8.655321 -36.066243 496.579657
camera transformation: 8.655321 -36.066243 496.579657
camera transformation: 11.681594 -37.460802 533.449922
camera transformation: 8.750170 -37.668607 528.373758
camera transformation: 8.706002 -36.202737 498.655489
camera transformation: 8.659246 -36.073637 496.682038
camera transformation: 8.656511 -36.065632 496.558303
camera transformation: 10.115806 -34.679194 497.429046
camera transformation: 10.106989 -34.657666 497.079133
camera transformation: 10.103787 -34.649678 496.950432
camera transformation: 10.103787 -34.649678 496.950432
camera transformation: 10.103787 -34.649678 496.950432
camera transformation: 13.357781 -37.088943 550.459166
camera transformation: 10.836519 -37.433833 555.303929
camera transformation: 10.101605 -35.371871 512.485432
camera transformation: 10.113543 -34.698273 497.824009
camera transformation: 10.102475 -34.651602 497.000985
camera transformation: 18.350948 -27.329877 502.107301
camera transformation: 18.321398 -27.294670 501.341088
camera transformation: 18.322018 -27.294771 501.336240
camera transformation: 18.321814 -27.294491 501.330843
camera transformation: 18.321814 -27.294491 501.330843
camera transformation: 22.829063 -30.847622 591.172408
camera transformation: 22.590924 -30.974564 597.758595
camera transformation: 22.590913 -30.974577 597.758539
camera transformation: 22.590902 -30.974590 597.758482
camera transformation: 22.590891 -30.974603 597.758426
camera transformation: 37.103910 -10.551708 515.807167
camera transformation: 47.377631 9.966732 526.726739
camera transformation: 49.596898 16.198552 526.553013
camera transformation: 56.476216 22.342972 528.741435

Feature List
* A simple framework for creating real-time augmented reality applications
* A multiplatform library (Windows, Linux, Mac OS X, SGI)
* Overlays 3D virtual objects on real markers ( based on computer vision algorithm)
* A multi platform video library with:

o multiple input sources (USB, Firewire, capture card) supported
o multiple format (RGB/YUV420P, YUV) supported
o multiple camera tracking supported
o GUI initializing interface

* A fast and cheap 6D marker tracking (real-time planar detection)
* An extensible markers patterns approach (number of markers fct of efficency)
* An easy calibration routine
* A simple graphic library (based on GLUT)
* A fast rendering based on OpenGL
* A 3D VRML support
* A simple and modular API (in C)
* Other language supported (JAVA, Matlab)
* A complete set of samples and utilities
* A good solution for tangible interaction metaphor
* OpenSource with GPL license for non-commercial usage

framework

"ARToolKit is able to perform this camera tracking in real time, ensuring that the virtual objects always appear overlaid on the tracking markers."

how to
1. 매 비디오 프레임 마다 사각형 모양을 찾기
2. 검은색 사각형에 대한 카메라의 상대적 위치를 계산
3. 그 위치로부터 컴퓨터 그래픽 모델이 어떻게 그려질지를 계산
4. 실제 영상의 마커 위에 모델을 그림

limitations
1. 추적하는 마커가 영상 안에 보일 때에만 가상 물체를 합성할 수 있음
2. 이 때문에 가상 물체들의 크기나 이동이 제한됨
3. 마커의 패턴의 일부가 가려지는 경우 가상 물체를 합성할 수 없음
4. range(거리)의 제한: 마커의 모양이 클수록 멀리 떨어진 패턴까지 감지할 수 있으므로 추적할 수 있는 volume(범위)이 더 커짐
(이때 거리는 pattern complexity (패턴의 복잡도)에 따라 달라짐: 패턴이 단순할수록 한계 거리가 길어짐)
5. 추적 성능이 카메라에 대한 마커의 상대적인 orientation(방향)에 따라 달라짐
: 마커가 많이 기울어 수평에 가까워질수록 보이는 패턴의 부분이 줄어들기 때문에 recognition(인식)이 잘 되지 않음(신뢰도가 떨어짐)
6. 추적 성능이 lighting conditions (조명 상태)에 따라 달라짐
: 조명에 의해 종이 마커 위에 reflection and glare spots (반사)가 생기면 마커의 사각형을 찾기가 어려워짐
: 종이 대신 반사도가 적은 재료를 쓸 수 있음

ARToolKit Vision Algorithm

Development
Initialization
1. Initialize the video capture and read in the marker pattern files and camera parameters. -> init()
Main Loop
2. Grab a video input frame. -> arVideoGetImage()
3. Detect the markers and recognized patterns in the video input frame. -> arDetectMarker()
4. Calculate the camera transformation relative to the detected patterns. -> arGetTransMat)
5. Draw the virtual objects on the detected patterns. -> draw()
Shutdown
6. Close the video capture down. -> cleanup()

ref.
http://king8028.tistory.com/entry/ARToolkit-simpletestc-%EC%84%A4%EB%AA%8512
http://kougaku-navi.net/ARToolKit.html

ARToolKit video configuration

camera calibration

Default camera properties are contained in the camera parameter file camera_para.dat, that is read in each time an application is started.

The program calib_dist is used to measure the image center point and lens distortion, while calib_param produces the other camera properties. (Both of these programs can be found in the bin directory and their source is in the utils/calib_dist and utils/calib_cparam directories.)

ARToolKit gives the position of the marker in the camera coordinate system, and uses OpenGL matrix system for the position of the virtual object.

ARToolKit API Documentation
http://artoolkit.sourceforge.net/apidoc/

ARMarkerInfo	Main structure for detected marker
ARMarkerInfo2	Internal structure use for marker detection
ARMat	Matrix structure
ARMultiEachMarkerInfoT	Multi-marker structure
ARMultiMarkerInfoT	Global multi-marker structure
ARParam	Camera intrinsic parameters
arPrevInfo	Structure for temporal continuity of tracking
ARVec	Vector structure

arVideoGetImage()

video.h

/**
* \brief get the video image.
*
* This function returns a buffer with a captured video image.
* The returned data consists of a tightly-packed array of
* pixels, beginning with the first component of the leftmost
* pixel of the topmost row, and continuing with the remaining
* components of that pixel, followed by the remaining pixels
* in the topmost row, followed by the leftmost pixel of the
* second row, and so on.
* The arrangement of components of the pixels in the buffer is
* determined by the configuration string passed in to the driver
* at the time the video stream was opened. If no pixel format
* was specified in the configuration string, then an operating-
* system dependent default, defined in <AR/config.h> is used.
* The memory occupied by the pixel data is owned by the video
* driver and should not be freed by your program.
* The pixels in the buffer remain valid until the next call to
* arVideoCapNext, or the next call to arVideoGetImage which
* returns a non-NULL pointer, or any call to arVideoCapStop or
* arVideoClose.
* \return A pointer to the pixel data of the captured video frame,
* or NULL if no new pixel data was available at the time of calling.
*/
AR_DLL_API ARUint8* arVideoGetImage(void);

ARParam

param.h

/** \struct ARParam
* \brief camera intrinsic parameters.
*
* This structure contains the main parameters for
* the intrinsic parameters of the camera
* representation. The camera used is a pinhole
* camera with standard parameters. User should
* consult a computer vision reference for more
* information. (e.g. Three-Dimensional Computer Vision
* (Artificial Intelligence) by Olivier Faugeras).
* \param xsize length of the image (in pixels).
* \param ysize height of the image (in pixels).
* \param mat perspective matrix (K).
* \param dist_factor radial distortions factor
*          dist_factor[0]=x center of distortion
*          dist_factor[1]=y center of distortion
*          dist_factor[2]=distortion factor
*          dist_factor[3]=scale factor
*/
typedef struct {
    int      xsize, ysize;
    double   mat[3][4];
    double   dist_factor[4];
} ARParam;

typedef struct {
    int      xsize, ysize;
    double   matL[3][4];
    double   matR[3][4];
    double   matL2R[3][4];
    double   dist_factorL[4];
    double   dist_factorR[4];
} ARSParam;

arDetectMarker()

ar.h 헤더 파일의 설명:

/**
* \brief main function to detect the square markers in the video input frame.
*
* This function proceeds to thresholding, labeling, contour extraction and line corner estimation
* (and maintains an history).
* It's one of the main function of the detection routine with arGetTransMat.
* \param dataPtr a pointer to the color image which is to be searched for square markers.
*                The pixel format depend of your architecture. Generally ABGR, but the images
*                are treated as a gray scale, so the order of BGR components does not matter.
*                However the ordering of the alpha comp, A, is important.
* \param thresh specifies the threshold value (between 0-255) to be used to convert
*                the input image into a binary image.
* \param marker_info a pointer to an array of ARMarkerInfo structures returned
*                    which contain all the information about the detected squares in the image
* \param marker_num the number of detected markers in the image.
* \return 0 when the function completes normally, -1 otherwise
*/
int arDetectMarker( ARUint8 *dataPtr, int thresh,
                    ARMarkerInfo **marker_info, int *marker_num );

You need to notice that arGetTransMat give the position of the marker in the camera coordinate system (not the reverse). If you want the position of the camera in the marker coordinate system you need to inverse this transformation (arMatrixInverse()).

XXXBK: not be sure of this function: this function must just convert 3x4 matrix to classical perspective openGL matrix. But in the code, you used arParamDecompMat that seem decomposed K and R,t, aren't it ? why do this decomposition since we want just intrinsic parameters ? and if not what is arDecomp ?

double arGetTransMat()

ar.h 헤더 파일의 설명:

/**
* \brief compute camera position in function of detected markers.
*
* calculate the transformation between a detected marker and the real camera,
* i.e. the position and orientation of the camera relative to the tracking mark.
* \param marker_info the structure containing the parameters for the marker for
*                    which the camera position and orientation is to be found relative to.
*                    This structure is found using arDetectMarker.
* \param center the physical center of the marker. arGetTransMat assumes that the marker
*              is in x-y plane, and z axis is pointing downwards from marker plane.
*              So vertex positions can be represented in 2D coordinates by ignoring the
*              z axis information. The marker vertices are specified in order of clockwise.
* \param width the size of the marker (in mm).
* \param conv the transformation matrix from the marker coordinates to camera coordinate frame,
*             that is the relative position of real camera to the real marker
* \return always 0.
*/
double arGetTransMat( ARMarkerInfo *marker_info,
                      double center[2], double width, double conv[3][4] )

arUtilMatInv()

ar.h 헤더 파일의 설명:

/**
* \brief Inverse a non-square matrix.
*
* Inverse a matrix in a non homogeneous format. The matrix
* need to be euclidian.
* \param s matrix input
* \param d resulted inverse matrix.
* \return 0 if the inversion success, -1 otherwise
* \remark input matrix can be also output matrix
*/
int arUtilMatInv( double s[3][4], double d[3][4] );

저작자표시 비영리 동일조건 (새창열림)

'Computer Vision' 카테고리의 다른 글

opencv: video capturing from a camera (4)	2010.03.13
Leordeanu & Hebert, "Unsupervised learning for graph matching" (0)	2010.03.04
Jonathan Mooser et al. "Tricodes: A Barcode-Like Fiducial Design for Augmented Reality Media" (0)	2010.03.02
"Design Patterns for Augmented Reality Systems" (0)	2010.03.02
virtual studio 구현: cross ratio test (0)	2010.02.26

posted by maetel

virtual studio 구현: workflow

2010. 2. 23. 00:47 Computer Vision

1> pattern identification 패턴 인식

rough preview

1) 무늬의 deep/light 색의 경계점들 찾기 edge detection
2) 찾은 점들을 직선으로 연결
3) 검출된 가로선과 세로선의 cross ratio와 실제 무늬의 cross ratio를 비교하여, 몇 번째 선인지 인식

detailed preview

1. initial identification process 초기 인식 과정 (특징점 인식)

1) chroma keying: RGB -> YUV 변환

2) gradient filtering: first-order derivative Gaussian filter (length = 7)
-1) 세로축에 대해 영상 축소 (1/4)하여 필터링
-2) Gx, Gy 절대값 비교하여 vertical / horizontal direction 판별
-3) 가로축에 대해

3) line fitting: lens distortion coefficient을 고려하여 이차곡선으로 피팅

4) identification
-1) 영상에서 찾아진 선들이 실제 무늬에서 몇 번째 선인지 인식
-2) feature points는 직선 식에 의해 피팅된 선들의 교점으로 정확하게 구할 수 있음

2. feature point tracking 실제 동작 과정 (특징점 위치 추적)
: feature points corresponding 검출된 특징점을 무늬의 교점과 매칭

1) intersection filter H (교점 필터)로 local maximum & minimum를 가지는 교점 검출

2) 검출된 교점의 부호를 판별하여 두 부류로 나눔

3) 이전 프레임에서의 교점의 위치를 기준으로 현재 프레임에서 검출된 교점에 대해 가장 가까운 이전 점을 찾음

* 다음 프레임에서 새로 나타난 특징점에 대해서도 이전 프레임에서의 카메라 변수를 이용해 실제 패턴 상의 교점을 영상으로 투영시켜 기준점으로 삼을 수 있음

2> real-time camera parameter extraction 실시간 카메라 변수 추출: Tsai's algorithm

1. determining image center 영상 중심 구하기: zooming
: using the center of expansion as a constant image-center

1) (lens distortion을 구하기 위한 초기화 과정에서) 정지된 카메라의 maximum zoom-out과 maximum zoom-in 상태에서 찾아서 인식한 특징점들을 저장

2) 두 개의 프레임에서 같은 점으로 나타난 특징점들을 연결한 line segments의 common intersection 교점을 계산

* 실제로 zooming은 여러 개의 lens들의 조합으로 작동하기 때문에 카메라의 zoom에 따라서 image center가 변하게 되지만, 이에 대한 표준 편차가 작으므로 무시하기로 함

2. lens distortion coefficient 계산
zooming이 없다면 고정된 값이 되므로 이하와 같이 매번 계산해 줄 필요가 없어짐

(1) f-k1 look-up table을 참조하는 방법
: zooming하는 과정에서 초점 거리 f와 렌즈 왜곡 변수 k1이 계속 변하게 되므로, 이에 대한 참조표를 미리 만들어 두고 나서 실제 동작 과정에서 참조
* 특징점들이 모두 하나의 평면에 존재하는 경우에는 초점거리 f와 카메라의 z 방향으로의 이동 Tz가 서로 coupled되기 때문에 카메라 변수가 제대로 계산되기 어렵다는 점을 고려하여 평면 상의 특징점들에 대해서 Tz/f를 인덱스로 사용하는 편법을 쓴다면, 카메라가 z 방향으로는 이동하지 않고 고정되어 있어야 한다는 (T1z = 0)조건이 붙게 됨

(2) collinearity를 이용하는 방법
: searching for k1 which maximally preserves collinearity 인식된 교점들에 대해 원래 하나의 직선에 속하는 점들이 왜곡 보상 되었을 때 가장 직선이 되게 하는 왜곡변수를 구함

1) 영상에서 같은 가로선에 속하는 교점들 (Xf, Yf) 가운데 세 개를 고름

2) 식7로부터 왜곡된 영상면 좌표 (Xd, Yd)를 구함

3) 식5로부터 왜곡 보상된 영상면 좌표 (Xu, Yu)를 구함

4) 식21과 같은 에러 함수 E(k1)를 정의

5) 영상에 나타난 N개의 가로선들에 대해서 E(k1) 값을 최소화하는 k1을 구함 (식 23) -> 비선형 최적화이나 iteration은 한 번

3. Tsai's algorithm
렌즈 왜곡 변수를 알면 카메라 캘리브레이션은 선형적 방법으로 구할 수 있게 됨

3> filtering
잡음으로 인해 검출된 교점에 오차가 생기므로 카메라변수가 틀려지게 됨
(->카메라가 정지해 있어도 카메라변수에 변화가 생겨 결과적으로 그래픽으로 생성된 가상의 무대에 떨림이 나타나게 됨)

averaging filter 평균 필터 (전자공학회논문지 제36권 S편 제7호 식19)

저작자표시 비영리 동일조건 (새창열림)

'Computer Vision' 카테고리의 다른 글

virtual studio 구현: cross ratio test (0)	2010.02.26
Chikara MATSUNAGA et al. "Optimal Grid Pattern for Automated Camera Calibration Using Cross Ratio" (0)	2010.02.26
chroma keying (0)	2010.02.22
3차원 인터페이스 시장조사 (1)	2010.02.22
Coelho et al. "An experimental evaluation of projective invariants" (0)	2010.02.22

posted by maetel

Seong-Woo Park & Yongduek Seo & Ki-Sang Hong <Real-Time Camera Calibration for Virtual Studio>

2010. 2. 10. 15:47 Computer Vision

Seong-Woo Park, Yongduek Seo, Ki-Sang Hong: Real-Time Camera Calibration for Virtual Studio. Real-Time Imaging 6(6): 433-448 (2000)
doi:10.1006/rtim.1999.0199

swPark_2000rtim.pdf

Real-Time Camera Calibration for Virtual Studio

Seong-Woo Park, Yongduek Seo and Ki-Sang Hong ¹

Dept. of E.E. POSTECH, San 31, Hyojadong, Namku, Pohang, Kyungbuk, 790-784, Korea

Abstract

In this paper, we present an overall algorithm for real-time camera parameter extraction, which is one of the key elements in implementing virtual studio, and we also present a new method for calculating the lens distortion parameter in real time. In a virtual studio, the motion of a virtual camera generating a graphic studio must follow the motion of the real camera in order to generate a realistic video product. This requires the calculation of camera parameters in real-time by analyzing the positions of feature points in the input video. Towards this goal, we first design a special calibration pattern utilizing the concept of cross-ratio, which makes it easy to extract and identify feature points, so that we can calculate the camera parameters from the visible portion of the pattern in real-time. It is important to consider the lens distortion when zoom lenses are used because it causes nonnegligible errors in the computation of the camera parameters. However, the Tsai algorithm, adopted for camera calibration, calculates the lens distortion through nonlinear optimization in triple parameter space, which is inappropriate for our real-time system. Thus, we propose a new linear method by calculating the lens distortion parameter independently, which can be computed fast enough for our real-time application. We implement the whole algorithm using a Pentium PC and Matrox Genesis boards with five processing nodes in order to obtain the processing rate of 30 frames per second, which is the minimum requirement for TV broadcasting. Experimental results show this system can be used practically for realizing a virtual studio.

전자공학회논문지 제36권 S편 제7호, 1999. 7
가상스튜디오 구현을 위한 실시간 카메라 추적 ( Real-Time Camera Tracking for Virtual Studio )
박성우 · 서용덕 · 홍기상 저 pp. 90~103 (14 pages)
http://uci.or.kr/G300-j12265837.v36n07p90

서지링크 한국과학기술정보연구원
가상스튜디오의 구현을 위해서 카메라의 움직임을 실시간으로 알아내는 것이 필수적이다. 기존의 가상스튜디어 구현에 사용되는 기계적인 방법을 이용한 카메라의 움직임 추적하는 방법에서 나타나는 단점들을 해결하기 위해 본 논문에서는 카메라로부터 얻어진 영상을 이용해 컴퓨터비전 기술을 응용하여 실시간으로 카메라변수들을 알아내기 위한 전체적인 알고리듬을 제안하고 실제 구현을 위한 시스템의 구성 방법에 대해 다룬다. 본 연구에서는 실시간 카메라변수 추출을 위해 영상에서 특징점을 자동으로 추출하고 인식하기 위한 방법과, 카메라 캘리브레이션 과정에서 렌즈의 왜곡특성 계산에 따른 계산량 문제를 해결하기 위한 방법을 제안한다.

DHJJIU_1999_v36Sn7_90.pdf

Practical ways to calculate camera lens distortion for real-time camera calibration
Pattern Recognition, Volume 34, Issue 6, June 2001, Pages 1199-1206
Seong-Woo Park, Ki-Sang Hong

swPark_2001jpr.pdf

generating virtual studio

Matrox Genesis boards
http://www.matrox.com/imaging/en/support/legacy/

http://en.wikipedia.org/wiki/Virtual_studio

http://en.wikipedia.org/wiki/Virtual_Studio_Technology

http://en.wikipedia.org/wiki/Chroma_key

camera tracking system : electromechanical / optical
pattern recognition
2D-3D pattern matches
planar pattern

feature extraction -> image-model matching & identification -> camera calibration

: to design the pattern by applying the concept of cross-ratio and to identify the pattern automatically

영상에서 찾아진 특징점을 자동으로 인식하기 위해서는 공간 상의 점들과 영상에 나타난 그것들의 대응점에 대해서 같은 값을 갖는 성질이 필요한데 이것을 기하적 불변량 (Geometric Invariant)이라고 한다. 본 연구에서는 여러 불변량 가운데 cross-ratio를 이용하여 패턴을 제작하고, 영상에서 불변량의 성질을 이용하여 패턴을 자동으로 찾고 인식할 수 있게 하는 방법을 제안한다.

Tsai's algorithm
R. Y. Tsai, A Versatile Camera Calibration Technique for High Accuracy 3-D Maching Vision Metrology Using Off-the-shelf TV Cameras and Lenses. IEEE Journal of Robotics & Automation 3 (1987), pp. 323–344.

direct image mosaic method
Sawhney, H. S. and Kumar, R. 1999. True Multi-Image Alignment and Its Application to Mosaicing and Lens Distortion Correction. IEEE Trans. Pattern Anal. Mach. Intell. 21, 3 (Mar. 1999), 235-243. DOI= http://dx.doi.org/10.1109/34.754589

Lens distortion
Richard Szeliski, Computer Vision: Algorithms and Applications: 2.1.6 Lens distortions & 6.3.5 Radial distortion

radial alignment constraint
"If we presume that the lens has only radial distortion, the direction of a distorted point is the same as the direction of an undistorted point."

cross-ratio http://en.wikipedia.org/wiki/Cross_ratio
: planar projective geometric invariance
- "pencil of lines"
http://mathworld.wolfram.com/CrossRatio.html
http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/MOHR_TRIGGS/node25.html
http://www.cut-the-knot.org/pythagoras/Cross-Ratio.shtml
http://web.science.mq.edu.au/~chris/geometry/

chap04.pdf

pattern identification

카메라의 움직임을 알아내기 위해서는 공간상에 인식이 가능한 물체가 있어야 한다. 즉, 어느 위치에서 보더라도 영상에 나타난 특징점을 찾을 수 있고, 공간상의 어느 점에 대응되는 점인지를 알 수 있어야 한다.

패턴이 인식 가능하기 위해서는 카메라가 어느 위치, 어느 자세로 보던지 항상 같은 값을 갖는 기하적 불변량 (Geometric Invariant)이 필요하다.

Coelho, C., Heller, A., Mundy, J. L., Forsyth, D. A., and Zisserman, A.1992. An experimental evaluation of projective invariants. In Geometric invariance in Computer Vision, J. L. Mundy and A. Zisserman, Eds. Mit Press Series Of Artificial Intelligence Series. MIT Press, Cambridge, MA, 87-104.

> initial identification process
extracting the pattern in an image: chromakeying -> gradient filtering: a first-order derivative of Gaussian (DoG) -> line fitting: deriving a distorted line (that is actually a curve) equation -> feature point tracking (using intersection filter)

R1x = 0

http://en.wikipedia.org/wiki/Difference_of_Gaussians

real-time camera parameter extraction

이상적인 렌즈의 optical axis가 영상면에 수직이고 변하지 않는다고 할 때, 영상 중심은 카메라의 줌 동작 동안 고정된 값으로 계산된다. (그러나 실제 렌즈의 불완전한 특성 때문에 카메라의 줌 동작 동안 영상 중심 역시 변하게 되는데, 이 변화량은 적용 범위 이내에서 2픽셀 이하이다. 따라서 본 연구에서는 이러한 변화를 무시하고 이상적인 렌즈를 가정하여 줌동작에 의한 영상 중심을 구하게 된다.)

For zoom lenses, the image centers vary as the camera zooms because the zooming operation is executed by a composite combination of several lenses. However, when we examined the location of the image centers, its standard deviation was about 2 pixels; thus we ignored the effect of the image center change.

calculating lens distortion coefficient

Zoom lenses are zoomed by a complicated combination of several lenses so that the effective focal length and distortion coefficient vary during zooming operations.

When using the coplanar pattern with small depth variation, it turns out that focal length and z-translation cannot be separated exactly and reliably even with small noise.

카메라 변수 추출에 있어서 공간상의 특징점들이 모두 하나의 평면상에 존재할 때는 초점거리와 z 방향으로의 이동이 상호 연관 (coupling)되어 계산값의 안정성이 결여되기 쉽다.

collinearity

Collinearity represents a property when the line in the world coordinate is also shown as a line in the image. This property is not preserved when the lens has a distortion.

Once the lens distortion is calculated, we can execute camera calibration using linear methods.

filtering

가상 스튜디오 구현에 있어서는 시간 지연이 항상 같은 값을 가지게 하는 것이 필수적이므로, 실제 적용에서는 예측 (prediction)이 들어가는 필터링 방법(예를 들면, Kalman filter)은 사용할 수가 없었다.

averaging filter 평균 필터

Orad http://www.orad.co.il

Evans & Sutherland http://www.es.com

저작자표시 비영리 동일조건 (새창열림)

'Computer Vision' 카테고리의 다른 글

Sawhney & Kumar "True Multi-Image Alignment and Its Application to Mosaicing and Lens Distortion Correction" (0)	2010.02.10
Gibbs et al. "Virtual Studios: An Overview" (0)	2010.02.10
Moons & Gool & Vergauwen [3D Reconstruction from Multiple Images] (0)	2010.02.09
Sola & Monin & Devy & Lemaire, "Undelayed initialization in bearing only SLAM" (0)	2010.02.09
2-D visual SLAM with Extended Kalman Filter 연습 (0)	2010.01.25

posted by maetel

Branislav Kisačanin & Vladimir Pavlović & Thomas S. Huang <Real-Time Vision for Human-Computer Interaction>

2009. 11. 8. 16:31 Computer Vision

Branislav Kisačanin & Vladimir Pavlović & Thomas S. Huang
Real-Time Vision for Human-Computer Interaction
(RTV4HCI)
Springer, 2005
(google book's overview)

2004 IEEE CVPR Workshop on RTV4HCI - Papers
http://rtv4hci.rutgers.edu/04/

Computer vision and pattern recognition continue to play a dominant role in the HCI realm. However, computer vision methods often fail to become pervasive in the field due to the lack of real-time, robust algorithms, and novel and convincing applications.

Keywords:

head and face modeling
map building
pervasive computing
real-time detection

Contents:

RTV4HCI: A Historical Overview.
- Real-Time Algorithms: From Signal Processing to Computer Vision.
- Recognition of Isolated Fingerspelling Gestures Using Depth Edges.
- Appearance-Based Real-Time Understanding of Gestures Using Projected Euler Angles.
- Flocks of Features for Tracking Articulated Objects.
- Static Hand Posture Recognition Based on Okapi-Chamfer Matching.
- Visual Modeling of Dynamic Gestures Using 3D Appearance and Motion Features.
- Head and Facial Animation Tracking Using Appearance-Adaptive Models and Particle Filters.
- A Real-Time Vision Interface Based on Gaze Detection -- EyeKeys.
- Map Building from Human-Computer Interactions.
- Real-Time Inference of Complex Mental States from Facial Expressions and Head Gestures.
- Epipolar Constrained User Pushbutton Selection in Projected Interfaces.
- Vision-Based HCI Applications.
- The Office of the Past.
- MPEG-4 Face and Body Animation Coding Applied to HCI.
- Multimodal Human-Computer Interaction.
- Smart Camera Systems Technology Roadmap.
- Index.

RTV4HCI: A Historical Overview

TurkOverviewHCI05.pdf

Matthew Turk (mturk@cs.ucsb.edu)
University of California, Santa Barbara
http://www.stanford.edu/~mturk/
http://www.cs.ucsb.edu/~mturk/

The goal of research in real-time vision for human-computer interaction is to develop algorithms and systems that sense and perceive humans and human activity, in order to enable more natural, powerful, and effective computer interfaces.

Computers in the Human Interaction Loop (CHIL)

perceptual interfaces
multimodal interfaces
post-WIMP(windows, icons, menus, pointer) interfaces

implicit user awareness or explicit user control

The user interface
- the software and devices that implement a particular model (or set of models) of HCI

Computer vision technologies must ultimately deliver a better "user experience".

B Shneiderman, Designing the User Interface: Strategies for Effective Human-Computer Interaction, Third Edition, Addison-Wesley, 1998.
: 1) time to learn 2) speed of performance 3) user error rates 4) retention over time 5) subjective satisfaction

- Presence and location (Face and body detection, head and body tracking)
- Identity (Face recognition, gait recognition)
- Expression (Facial feature tracking, expression modeling and analysis)
- Focus of attention (Head/face tracking, eye gaze tracking)
- Body posture and movement (Body modeling and tracking)
- Gesture (Gesture recognition, hand tracking)
- Activity (Analysis of body movement)

eg.
VIDEOPLACE (M W Krueger, Artificial Reality II, Addison-Wesley, 1991)
Magic Morphin Mirror / Mass Hallucinations (T Darrell et al., SIGGRAPH Visual Proc, 1997)

Principal Component Analysis (PCA)
Linear Discriminant Analysis (LDA)
Gabor Wavelet Networks (GWNs)
Active Appearance Models (AAMs)
Hidden Markov Models (HMMs)

Identix Inc.
Viisage Technology Inc.
Cognitec Systems

- MIT Medial Lab
ALIVE system (P Maes et al., The ALIVE system: wireless, full-body interaction with autonomous agents, ACM Multimedia Systems, 1996)
PFinder system (C R Wren et al., Pfinder: Real-time tracking of the human body, IEEE Trans PAMI, pp 780-785, 1997)
KidsRoom project (A Bobick et al., The KidsRoom: A perceptually-based interactive and immersive story environment, PRESENCE: Teleoperators and Virtual Environments, pp 367-391, 1999)

Flocks of Features for Tracking Articulated Objects

KolschBook05.pdf

Mathias Kolsch (kolsch@nps.edu
Computer Science Department, Naval Postgraduate School, Monterey
Matthew Turk (mturk@cs.ucsb.edu)
Computer Science Department, University of California, Santa Barbara

flocking behavior

http://www.movesinstitute.org/~kolsch/HandVu/HandVu.html

Visual Modeling of Dynamic Gestures Using 3D Appearance and Motion Features

Visual Modeling of Dynamic Gestures Using 3D Appearance and.pdf

Guangqi Ye (grant@cs.jhu.edu), Jason J. Corso, Gregory D. Hager
Computational Interaction and Robotics Laboratory
The Johns Hopkins University

Map Building from Human-Computer Interactions

arsenioCVPRWorkshop.pdf

http://groups.csail.mit.edu/lbr/mars/pubs/pubs.html#publications
Artur M. Arsenio (arsenio@csail.mit.edu)
Computer Science and Artificial Intelligence Laboratory
Massachusetts Institute of Technology

Vision-Based HCI Applications
Eric Petajan (eric@f2f-inc.com)
face2face animation, inc.
eric@f2f-inc.com

The Office of the Past

kim04office.pdf

Jiwon Kim (jwkim@cs.washington.edu), Steven M. Seitz (seitz@cs.washington.edu)
University of Washington
Maneesh Agrawala (maneesh@microsoft.com)
Microsoft Research
Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 10 - Volume 10 Page: 157 Year of Publication: 2004
http://desktop.google.com
http://grail.cs.washington.edu/projects/office/
http://www.realvnc.com/

Smart Camera Systems Technology Roadmap
Bruce Flinchbaugh (b-flinchbaugh@ti.com)
Texas Instruments

'Computer Vision' 카테고리의 다른 글

A. J. Haug "A Tutorial on Bayesian Estimation and Tracking Techniques Applicable to Nonlinear and Non-Gaussian Processes" (1)	2009.11.16
Particle filter 연습 (1)	2009.11.10
Kalman filter 연습 (1)	2009.11.05
M. Armstrong & A. Zisserman <Robust object tracking> (0)	2009.10.29
R. L. Thompson et al. <Providing synthetic views for teleoperation using visual pose tracking in multiple cameras> (1)	2009.10.27

posted by maetel

Brian Williams, Georg Klein and Ian Reid <Real-Time SLAM Relocalisation>

2009. 7. 23. 18:53 Computer Vision

Brian Williams, Georg Klein and Ian Reid
(Department of Engineering Science, University of Oxford, UK)
Real-Time SLAM Relocalisation

williams_etal_iccv2007.pdf

In Proceedings of the International Conference on Computer Vision, Rio de Janeiro, Brazil, 2007
demo 1
demo 2

• real-time, high-accuracy localisation and mapping during tracking
• real-time (re-)localisation when when tracking fails
• on-line learning of image patch appearance so that no prior training or map structure is required and features are added and removed during operation.

Lepetit's image patch classifier (feature appearance learning)
=> integrating the classifier more closely into the process of map-building
(by using classification results to aid in the selection of new points to add to the map)

> recovery from tracking failure: local vs. global
local - particle filter -> rich feature descriptor
global - proximity using previous key frames

- based on SceneLib (Extended Kalman Filter)
- rotational (and a degree of perspective) invariance via local patch warping
- assuming the patch is fronto-parallel when first seen
http://freshmeat.net/projects/scenelib/

active search

innovation covariance

joint compatibility test

randomized lists key-point recognition algorithm
1. randomized: (2^D - 1) tests -> D tests
2. independent treatment of classes
3. binary leaf scores (2^D * C * N bits for all scores)
4. intensity offset
5. explicit noise handing

training the classifier

The RANSAC (Random Sample Consensus) Algorithm

ref.
Davison, A. J. and Molton, N. D. 2007.
MonoSLAM: Real-Time Single Camera SLAM. IEEE Trans. Pattern Anal. Mach. Intell. 29, 6 (Jun. 2007), 1052-1067. DOI= http://dx.doi.org/10.1109/TPAMI.2007.1049

Vision-based global localization and mapping for mobile robots
Se, S. Lowe, D.G. Little, J.J. (MD Robotics, Brampton, Ont., Canada)

Lepetit, V. 2006.
Keypoint Recognition Using Randomized Trees. IEEE Trans. Pattern Anal. Mach. Intell. 28, 9 (Sep. 2006), 1465-1479. DOI= http://dx.doi.org/10.1109/TPAMI.2006.188

Lepetit, V., Lagger, P., and Fua, P. 2005.
Randomized Trees for Real-Time Keypoint Recognition. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cvpr'05) - Volume 2 - Volume 02 (June 20 - 26, 2005). CVPR. IEEE Computer Society, Washington, DC, 775-781. DOI= http://dx.doi.org/10.1109/CVPR.2005.288

Fischler, M. A. and Bolles, R. C. 1981.
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 6 (Jun. 1981), 381-395. DOI= http://doi.acm.org/10.1145/358669.358692

'Computer Vision' 카테고리의 다른 글

OpenCV 1.0 설치 on Mac OS X (0)	2009.07.27
cameras on mac os x (0)	2009.07.27
Durrant-Whyte & Bailey "Simultaneous localization and mapping" (0)	2009.07.22
임현, 이영삼 <이동로봇의 동시간 위치인식 및 지도작성(SLAM)> (3)	2009.07.21
Georg Klein & David Murrayt <Parallel Tracking and Mapping for Small AR Workspaces> (0)	2009.07.15

posted by maetel

A. J. Davison <Real-time simultaneous localisation and mapping with a single camera>

2009. 3. 31. 21:10 Computer Vision

Real-time simultaneous localisation and mapping with a single camera

Davison, A.J.
Dept. of Eng. Sci., Oxford Univ., UK;

Real-Time Simultaneous Localisation and Mapping with a Sing.pdf

This paper appears in: Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
Publication Date: 13-16 Oct. 2003
On page(s): 1403-1410 vol.2
ISBN: 0-7695-1950-4
INSPEC Accession Number: 7971070
Digital Object Identifier: 10.1109/ICCV.2003.1238654
Current Version Published: 2008-04-03

'Computer Vision' 카테고리의 다른 글

Rosten & Drummond <Machine learning for high-speed corner detection> (0)	2009.04.03
Montemerlo & Thrun <Simultaneous localization and mapping with unknown data association using FastSLAM> (0)	2009.03.31
<A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking> (0)	2009.03.31
Frank C. Park and Bryan J. Martin <Robot Sensor Calibrattion: Solving AX = XB on the Euclidean Group> (0)	2009.03.31
Rao-Blackwellized Particle Filter (0)	2009.03.31

posted by maetel

Civera, Davison & Montiel <Inverse Depth Parametrization for Monocular SLAM>

2009. 3. 26. 19:56 Computer Vision

Inverse Depth Parametrization for Monocular SLAM
Civera, J. Davison, A.J. Montiel, J.

InverseDepthParametrizationforMonocularSLAM.pdf

This paper appears in: Robotics, IEEE Transactions on
Publication Date: Oct. 2008
Volume: 24, Issue: 5
On page(s): 932-945
ISSN: 1552-3098
INSPEC Accession Number: 10301459
Digital Object Identifier: 10.1109/TRO.2008.2003276
First Published: 2008-10-03
Current Version Published: 2008-10-31

Javier Civera, Departamento de Informática e Ingeniería de Sistemas, Universidad de Zaragoza

Andrew J. Davison, Reader in Robot Vision at the Department of Computing, Imperial College London

Jose Maria Martinez Montiel, Robotics and Real Time Group, Universidad de Zaragoza

monocular simultaneous localization and mapping (SLAM)

representation of uncertainty

the standard extended Kalman filter (EKF)

direct parametrization of the inverse depth of features

feature initialization

camera motion estimates

6-D state vector --> converted to the Euclidean XYZ form

linearity index => automatic detection and conversion to maintain maximum efficiency

I. Introduction

monocular camera
: projective sensor measuring the beairng of image features

monocular (adj) 단안(單眼)(용)의, 외눈의

A stereo camera is a type of camera with two or more lenses. This allows the camera to simulate human binocular vision.

structure from motion = SFM
1) feature matching
2) global camera location & scene feature position estimates

sliding window processing

Sliding Window Protocol is a bi-directional data transmission protocol used in the data link layer (OSI model) as well as in TCP (transport layer of the OSI model). It is used to keep a record of the frame sequences sent and their respective acknowledgements received by both the users.

In robotics and computer vision, visual odometry is the process of determining the position and orientation of a robot by analyzing the associated camera images.

Odometry is the use of data from the movement of actuators to estimate change in position over time. Odometry is used by some robots, whether they be legged or wheeled, to estimate (not determine) their position relative to a starting location.

visual SLAM

probabilistic filtering approach

initializing uncertain depth estimates for distance features

Gaussian distributions implicit in the EKF

a new feature parametrization that is able to smoothly cope with initialization of features at all depth - even up to "infinity" - within the standard EKF framework: direct parametrization of inverse depth relative to the camera position from which a feature was first observed

A. Delayed and Undelayed Initialization

main map; main probabilistic state; main state vector

test for inclusion

delayed initialization
> treating newly detected features separately from the main map to reduce depth uncertainty before insertion into the full filter (with a standard XYZ representation)
- Features that retain low parallax over many frames (those very far from the camera or close to the motion epipole) are usually rejected completely because they never pass the test for inclusion
> (in 2-D and simulation) Initialization is delayed until the measurement equation is approximately Gaussian and the point can be safely triangulated.
> 3-D monocular vision with inertial sensing + auxiliary particle filter (in high frame rate sequence)

undelayed initialization
> While features with highly uncertain depths provide little information on camera translation, they are extremely useful as bearing references for orientation estimation.
: a multiple hypothesis scheme, initializing features at various depths and pruning those not reobserved in subsequent images
> Gaussian sum filter approximated by a federated information sharing method to keep the computational overhead low
-> to spread the Gaussian depth hypotheses along the ray according to inverse depth

Davision's particle method --> (Sola et al.) Gaussian sum filter --> (Civera at al.) new inverse depth scheme

A Gaussian sum is more efficient representation than particles (efficient enough that the separate Gaussians can call be put into the main state vector), but not as efficient as the single Gaussian representation that the inverse depth parametrization aalows.

B. Points at Infinity

efficient undelayed initialization + features at all depths (in outdoor scenes)

Point at infinity: a feature that exhibits no parallax during camera motion due to its extreme depth
-> not used for estimating camera translationm but for estimating rotation

The homogeneous coordinate systems of visual projective geometry used normally in SFM allow explicit representation of points at infinity(, and they have proven to play an important role during offline structure and motion estimation).

sequential SLAM system

Montiel and Davison: In special case where all features are known to be infinite -- in very-large-scale outdoor scenes or when the camera rotates on a tripod -- SLAM in pure angular coordinates turns the camera into a real-time visual compass.

Our probabilistic SLAM algorithm must be able to represent the uncertainty in depth of seemingly infinite features. Observing no parallax for a feature after 10 units of camera translation does tell us something about its depth -- it gives a reliable lower bound, which depends on the amount of motion made by the camera (if the feature had been closer than this, we would have observed parallax).

The explicit consideration of uncertainty in the locations of points has not been previously required in offline computer vision algorithms, but is very important in a more difficult online case.

C. Inverse Depth Representation

There is a unified and straightforward parametrization for feature locations that can handle both initialization and standard tracking of both close and very distant features within the standard EKF framework.

standard tracking

An explicit parametrization of the inverse depth of a feature along a semiinfinite ray from the position from which it was first viewed allows a Gaussian distribution to cover uncertainty in depth that spans a depth range from nearby to infinity, and permits seamless crossing over to finite depth estimates of features that have been apparently infinite for long periods of time.

linearity index + inverse depth parametrization

The projective nature of a camera means that the image measurement process is nearly linear in this inverse depth coordinate.

Inverse depth appears in the relation between image disparity and point depth in a stereo vision; it is interpreted as the parallax with respect to the plane at infinity. (Hartley and Zisserman)

Inverse depth is used to relate the motion field induced by scene points with the camera velocity in optical flow analysis.

modified polar coordinates

target motion analysis = TMA

EKF-based sequential depth estimation from camera-known motion

multibaseline stereo

matching robustness for scene symmetries

sequential EKF process using inverse depth
( ref. Stochastic Approximation and Rate-Distortion Analysis for Robust Structure and Motion Estimation )

undelayed initialization for 2-D monocular SLAM
( ref. A unified framework for nearby and distant landmarks in bearing-only SLAM )

FastSLAM-based system for monocular SLAM
( ref. Ethan Eade & Tom Drummond, Scalable Monocular SLAM )

special epipolar update step

FastSLAM

( ref. Civera, J. Davison, A.J. Montiel, J.M.M., Inverse Depth to Depth Conversion for Monocular SLAM
J. Montiel and A. J. Davison “A visual compass based on SLAM,” )

loop-closing

II. State Vector Definition

handheld camera motion
> constant angular and linear velocity model

quaternion

'Computer Vision' 카테고리의 다른 글

Ethan Eade & Tom Drummond <Scalable Monocular SLAM> (0)	2009.03.27
people in SLAM (0)	2009.03.27
camera calibration 09-02-16 (0)	2009.02.16
Special Issue on Visual SLAM (IEEE Transactions on Robotics, Vol. 24, No. 5) (0)	2009.02.14
IMU sensor calibration (0)	2009.02.06

posted by maetel

돛단배

Tag

Notice

Recent Post

Recent Comment

Recent Trackback

Archive

My Link

calendar

Category

'real-time'에 해당되는 글 9건

Zenitum's 4th Open Lab

4th Open Lab

September 14, 2010 | Written by admin

'Footmarks' 카테고리의 다른 글

test: composing OpenCV Iplimage and OpenGL graphics in one window screen

ARToolKit test log

'Computer Vision' 카테고리의 다른 글

virtual studio 구현: workflow

'Computer Vision' 카테고리의 다른 글

Seong-Woo Park & Yongduek Seo & Ki-Sang Hong <Real-Time Camera Calibration for Virtual Studio>

Abstract

'Computer Vision' 카테고리의 다른 글

Branislav Kisačanin & Vladimir Pavlović & Thomas S. Huang <Real-Time Vision for Human-Computer Interaction>

'Computer Vision' 카테고리의 다른 글

Brian Williams, Georg Klein and Ian Reid <Real-Time SLAM Relocalisation>

'Computer Vision' 카테고리의 다른 글

A. J. Davison <Real-time simultaneous localisation and mapping with a single camera>

'Computer Vision' 카테고리의 다른 글

Civera, Davison & Montiel <Inverse Depth Parametrization for Monocular SLAM>

'Computer Vision' 카테고리의 다른 글

티스토리툴바

Search

Tag

Notice

Recent Post

Recent Comment

Recent Trackback

Archive

My Link

calendar

Category

'real-time'에 해당되는 글 9건

September 14, 2010 | Written by admin

'Footmarks' 카테고리의 다른 글

'Computer Vision' 카테고리의 다른 글

'Computer Vision' 카테고리의 다른 글

Abstract

'Computer Vision' 카테고리의 다른 글

'Computer Vision' 카테고리의 다른 글

'Computer Vision' 카테고리의 다른 글

'Computer Vision' 카테고리의 다른 글

'Computer Vision' 카테고리의 다른 글

티스토리툴바