Autonomous robots are at advanced stage in various fields, and they are expected to autonomously work at the scenes of nursing care or medical care in the near future. In this paper, we focus on object counting task by images. Since the number of objects is not a mere physical quantity, it is difficult for conventional phisical sensors to measure such quantity and an intelligent sensing with higher-order recognition is required to accomplish such counting task. It is often that we count the number of objects in various situations. In the case of several objects, we can recognize the number at a glance. On the other hand, in the case of a dozen of objects, the task to count the number might become troublesome. Thus, simple and easy way to enumerate the objects automatically has been expected. In this study, we propose a method to recognize the number of objects by image. In general, the target object to count varies according to user's request. In order to accept the user's various requests, the region belonging to the desired object in the image is selected as a template. Main process of the proposed method is to search and count regions which resembles the template. To achieve robustness against spatial transformation, such as translation, rotation, and scaling, scale-invariant feature transform (SIFT) is employed as a feature. To show the effectiveness, the proposed method is applied to few images containing everyday objects, e.g., binders, cans etc.