研究者詳細 - 史　又華

写真a

シ　ヨウカ

史　又華

Scopus 論文情報

論文数: 82 Citation: 582 h-index: 13

Click to view the Scopus page. The data was downloaded from Scopus API in May 19, 2026, via http://api.elsevier.com and http://www.scopus.com .

Google Scholar 情報（Citations per year）

Citation: 798 h-index: 14 i10-index: 22

Click to view the Google Scholar page.

Scopus 情報

所属

理工学術院基幹理工学部

職名

教授

学位

博士（工学） ( 早稲田大学 )

ホームページ

http://www.eps.sci.waseda.ac.jp/teachers_popup/shi.html

学歴

　

-

2005年

早稲田大学工学研究科電子・情報通信学

所属学協会

　

　

　

電子情報通信学会
　

　

　

情報処理学会
　

　

　

IEEE
　

　

　

応用物理学会
　

　

　

人工知能学会

研究分野

計算機システム / 情報セキュリティ / 電子デバイス、電子機器

研究キーワード

高信頼性設計、エネルギーハーベスティング回路設計、AIハードウェア設計

受賞

IEEK Best Paper Award

2012年11月

論文

Bennet's Doubler-Extended Converter With Optimized Bias for Enhanced Energy Extraction From Triboelectric Nanogenerators

Yirui Su, Youhua Shi

IEEE Transactions on Power Electronics 2025年09月

DOI

Scopus

2

被引用数

(Scopus)
A Novel Security Threat Model for Automated AI Accelerator Generation Platforms

Chao Guo, Youhua Shi

IEEE Access 2025年

DOI

Scopus

2

被引用数

(Scopus)
DSE-Based Hardware Trojan Attack for Neural Network Accelerators on FPGAs

Chao Guo, Masao Yanagisawa, Youhua Shi

IEEE Transactions on Neural Networks and Learning Systems 2024年10月

　概要を見る

Over the past few years, the emergence and development of design space exploration (DSE) have shortened the deployment cycle of deep neural networks (DNNs). As a result, with these open-sourced DSE, we can automatically compute the optimal configuration and generate the corresponding accelerator intellectual properties (IPs) from the pretrained neural network models and hardware constraints. However, to date, the security of DSE has received little attention. Therefore, we explore this issue from an adversarial perspective and propose an automated hardware Trojan (HT) generation framework embedded within DSE. The framework uses an evolutionary algorithm (EA) to analyze user-input data to automatically generate the attack code before placing it in the final output accelerator IPs. The proposed HT is sufficiently stealthy and suitable for both single and multifield-programmable gate array (FPGA) designs. It can also implement controlled accuracy degradation attacks and specified category attacks. We conducted experiments on LeNet, VGG-16, and YOLO, respectively, and found that for the LeNet model trained on the CIFAR-10 dataset, attacking only one kernel resulted in 97.3% of images being classified in the category specified by the adversary and reduced accuracy by 59.58%. Moreover, for the VGG-16 model trained on the ImageNet dataset, attacking eight kernels can cause up to 96.53% of the images to be classified into the category specified by the adversary and causes the model's accuracy to decrease to 2.5%. Finally, for the YOLO model trained on the PASCAL VOC dataset, attacking with eight kernels can cause the model to identify the target as the specified category and cause slight perturbations to the bounding boxes. Compared to the un-compromised designs, the look-up tables (LUTs) overhead of the proposed HT design does not exceed 0.6%.

DOI

Scopus

4

被引用数

(Scopus)
A Dual-Output Rectifier-Based Self-Powered Interface Circuit for Triboelectric Nanogenerators

Yirui Su, Masao Yanagisawa, Youhua Shi

IEEE Transactions on Power Electronics 39 ( 6 ) 6630 - 6634 2024年06月

　概要を見る

Triboelectric nanogenerators (TENGs) offer a cost-effective solution for harvesting energy in Internet of Things (IoT) devices. However, their practical application is limited due to extremely high output voltage and low intrinsic capacitance, alongside the nonself-powered nature of current interface circuits and low transfer efficiency resulting from output voltage asymmetry. Addressing these issues, this letter introduces a dual-output rectifier-based interface circuit, innovatively designed to rectify TENG output into two distinct voltage magnitudes, optimizing for energy harvesting and switching generation. The experimental results validate our approach, showing gains of 2.75 and 2.34 times in terms of maximum output power over a full-wave rectifier (FWR)-based design at 2 and 3 Hz, respectively. Furthermore, under identical frequency and load conditions (1 MΩ at 2 and 3 Hz), the output gains reached 152 and 160 times that of the FWR. Our approach brings about a significant advancement in TENG integration for low-frequency and low-load IoT devices. This letter is accompanied by two videos demonstrating the charge response of 1 MΩ load at 2 and 3 Hz, respectively.

DOI

Scopus

11

被引用数

(Scopus)
Transition Detector-Based Radiation-Hardened Latch for Both Single- And Multiple-Node Upsets

Saki Tajima, Masao Yanagisawa, Youhua Shi

IEEE Transactions on Circuits and Systems II: Express Briefs 67 ( 6 ) 1114 - 1118 2020年06月

　概要を見る

This brief presents an output transition detector-based radiation-hardened latch (TDRHL) for reliability improvement. With an error recovery assistant logic and an in-situ transition detector, for any radiation induced single- and double-node upsets, the proposed TDRHL can 1) provide full self-recovery capability and 2) generate a warning signal for architecture-level recovery when soft errors cause the latch output flipped. The evaluation results show that TDRHL outperforms state-of-the-art double-node upset tolerant designs with addition error detection capability, and up to 5.0X power-delay-product improvement can be achieved.

DOI

Scopus

24

被引用数

(Scopus)
Robust secure scan design against scan-based differential cryptanalysis

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEEE Transactions on Very Large Scale Integration (VLSI) Systems 20 ( 1 ) 176 - 181 2012年01月

　概要を見る

Scan technology carries the potential risk of being misused as a side channel to leak out the secrets of crypto cores. The existing scan-based attacks could be viewed as one kind of differential cryptanalysis, which takes advantages of scan chains to observe the bit changes between pairs of chosen plaintexts so as to identify the secret keys. To address such a design/test challenge, this paper proposes a robust secure scan structure design for crypto cores as a countermeasure against scan-based attacks to maintain high security without compromising the testability. © 2006 IEEE.

DOI

Scopus

24

被引用数

(Scopus)
Improved launch for higher TDF coverage with fewer test patterns

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 29 ( 8 ) 1294 - 1299 2010年08月

　概要を見る

Due to the limitations of scan structure, the second vector in transition delay test is usually applied either by shift operation or by functional launch, which possibly results in unsatisfying transition delay fault (TDF) coverage. To overcome such a limitation for higher TDF coverage, a novel improved launch delay test technique that combines the pros of launch-on-shift and launch-on-capture tests is introduced in this paper. The proposed method can achieve near perfect TDF coverage with fewer test patterns without the need for a global fast scan enable signal. Experimental results on ISCAS89 and ITC99 benchmark circuits are included to show the effectiveness of the proposed method. © 2010 IEEE.

DOI

Scopus
Robust Training to Secure Automated AI Accelerator Generation Against Malicious Platforms

Chao Guo, Youhua Shi

2026年

DOI

Scopus
Strategy for Improving Cycle of Maximized Energy Output of Triboelectric Nanogenerators

Yirui Su, Masao Yanagisawa, Youhua Shi

Proceedings of the 2023 International Conference on IC Design and Technology, ICICDT 2023 131 - 135 2023年

　概要を見る

Triboelectric nanogenerator (TENG) is a new energy harvesting technology proposed in recent years. However, the output energy per cycle from TENG to a load has been proven to be confined within Cycle of Maximized Energy Output (CMEO) which is always used as a common standard to evaluate the performance of a TENG. In this work, a new energy extraction strategy is proposed that utilizes the energy of TENG's negative half cycle to complete the pre-biasing for TENG's positive half cycle, which realizes the output of TENG per cycle beyond the CMEO limit. The simulation results show that the output energy per cycle of this strategy is 1.74 times that of CMEO.

DOI

Scopus

1

被引用数

(Scopus)
An Area-Power-Efficient Multiplier-less Processing Element Design for CNN Accelerators

Jiaxiang Li, Masao Yanagisawa, Youhua Shi

Proceedings of International Conference on ASIC 2023年

　概要を見る

Machine learning has achieved remarkable success in various domains. However, the computational demands and memory requirements of these models pose challenges for deployment on privacy-secured or wearable edge devices. To address this issue, we propose an area-power-efficient multiplier-less processing element (PE) in this paper. Prior to implementing the proposed PE, we apply a power-of-2 dictionary-based quantization to the model. We analyze the effectiveness of this quantization method in preserving the accuracy of the original model and present the standard and a specialized diagram illustrating the schematics of the proposed PE. Our evaluation results demonstrate that our design achieves approximately 30% lower power consumption and 35% smaller core area compared to a conventional multiplication-and-accumulation (MAC) PE. Moreover, the applied quantization reduces the model size and operand bit-width, resulting in reduced on-chip memory usage and energy consumption for memory accesses.

DOI

Scopus

1

被引用数

(Scopus)
Scalable Hardware Efficient Architecture for Parallel FIR Filters with Symmetric Coefficients

Jinghao Ye, Masao Yanagisawa, Youhua Shi

Electronics (Switzerland) 11 ( 20 ) 2022年10月

　概要を見る

Symmetric convolutions can be utilized for potential hardware resource reduction. However, they have not been realized in state-of-the-art transposed block FIR designs. Therefore, we explore the feasibility of symmetric convolution in transposed parallel FIRs and propose a scalable hardware efficient parallel architecture. The proposed design inserts delay elements after multipliers for temporal reuse of intermediate tap products. By doing this, the number of required multipliers can be reduced by half. As a result, we can achieve up to 3.2× and 1.64× area efficiency improvements over the modern transposed block method on reconfigurable and fixed designs, respectively. These results confirm the effectiveness of the proposed STB-FIR architecture for hardware-efficient, high-speed signal processing.

DOI

Scopus

5

被引用数

(Scopus)
Dataflow Optimization through Exploring Single-Layer and Inter-Layer Data Reuse in Memory-Constrained Accelerators

Jinghao Ye, Masao Yanagisawa, Youhua Shi

Electronics (Switzerland) 11 ( 15 ) 2022年08月

　概要を見る

Off-chip memory access has become the performance and energy bottleneck in memory-constrained neural network accelerators. To provide a solution for the energy efficient processing of various neural network models, this paper proposes a dataflow optimization method for modern neural networks by exploring the opportunity of single-layer and inter-layer data reuse to minimize the amount of off-chip memory access in memory-constrained accelerators. A mathematical analysis of three inter-layer data reuse methods is first presented. Then, a comprehensive exploration to determine the optimal data reuse strategy from single-layer and inter-layer data reuse approaches is proposed. The result shows that when compared to the existing single-layer-based exploration method, SmartShuttle, the proposed approach can achieve up to 20.5% and 32.5% of off-chip memory access reduction for ResNeXt-50 and DenseNet-121, respectively.

DOI

Scopus
Power-Efficient Deep Convolutional Neural Network Design through Zero-Gating PEs and Partial-Sum Reuse Centric Dataflow

Lin Ye, Jinghao Ye, Masao Yanagisawa, Youhua Shi

IEEE Access 9 17411 - 17420 2021年

　概要を見る

Convolution neural networks (CNNs) have shown great success in many areas such as object detection and pattern recognition at the cost of extreme high computation complexity and significant external memory access, which makes state-of-the-art deep CNNs difficult to be implemented on resource-constrained portable/wearable devices with limited capacity of battery. To address this design challenge, a power-efficient CNN design through zero-gating processing elements (PEs) and partial-sum reuse centric dataflow is proposed in this paper. Unlike the existing works which either only consider the zeros in activation maps or use off-chip training process for on-chip computation reduction, a zero-gating PE design is proposed to avoid unnecessary on-chip computation by taking advantages of the large number of zeros in both the filter's weights of pre-trained models and the activation maps. Furthermore, a partial-sum reuse centric dataflow is also proposed for off-chip DRAM access reduction. The evaluation results show that the overall power consumption of PE arrays with our proposal can be reduced by 37% and 14% at the cost of 8% and 1% area overhead when compared to the baseline PE design and the existing only-activation-gated design (i.e. that in Eyeriss), respectively. Moreover, the proposed method can achieve 35% and 47% DRAM access reduction with the corresponding 14% and 49% energy savings for AlexNet and VGG-16 when compared to that in Eyeriss.

DOI

Scopus

5

被引用数

(Scopus)
A Reconfigurable Area and Energy Efficient Hardware Accelerator of Five High-order Operators for Vision Sensor Based Robot Systems

Qianjin Wang, Yi Zhan, Bingqiang Liu, Jiajun Wu, Youhua Shi, Guoyi Yu, Chao Wang

2021 IEEE International Conference on Integrated Circuits, Technologies and Applications, ICTA 2021 189 - 190 2021年

　概要を見る

This paper proposes a reconfigurable hardware accelerator design of five major high-order operators for vision sensor based robot systems. These five high-order operators include convolution, median filtering, Euclidean distance, vector inner-product and iToF, which are intensively used in robot vision algorithms. In this work, a reconfigurable hardware accelerator design method for multiple high-order operators is proposed. FPGA implementation results show that the proposed design has achieved area efficiency with 17.54% reduced LUTs and 44.02% reduced FFs against the baseline hardware design of the five high-order operators. Case studies of Laplace edge-detection and iToF benchmark demonstrate the energy efficiency of proposed design with 19.70% and 6.2% reduction in energy consumption, respectively.

DOI

Scopus
A High-Performance Symmetric Hybrid Form Design for High-Order FIR Filters

Jinghao Ye, Masao Yanagisawa, Youhua Shi

Proceedings of 2020 IEEE Asia Pacific Conference on Circuits and Systems, APCCAS 2020 121 - 124 2020年12月

　概要を見る

In this paper, a symmetric hybrid form for high performance finite impulse response (FIR) filters with symmetric coefficients is proposed, which can be utilized in both fixed and reconfigurable FIR implementations to solve the driving capacity problem caused by the high fanout signals in the existing symmetric transposed form based FIR architecture. The evaluation results show that, when compared with the existing high speed FIR designs such as the symmetric systolic form in [13] and the hybrid form in [1], the proposed form can achieve significant area and power savings with great ADP and PDP reduction. Moreover, when compared with the symmetric systolic form in [13] the required latency can be approximately reduced by 33.3%, which clearly shows the performance improvement of the proposed method.

DOI

Scopus

1

被引用数

(Scopus)
Faithfully truncated adder-based area-power efficient FIR design with predefined output accuracy

Jinghao Ye, Masao Yanagisawa, Youhua Shi

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E103A ( 9 ) 1063 - 1070 2020年09月

　概要を見る

To solve the area and power problems in Finite Impulse Response (FIR) implementations, a faithfully truncated adder-based FIR design is presented in this paper for significant area and power savings while the predefined output accuracy can still be obtained. As a solution to the accuracy loss caused by truncated adders, a static error analysis on the utilization of truncated adders in FIRs was performed. According to the mathematical analysis, we show that, with a given accuracy constraint, the optimal truncated adder configuration for an area-power efficient FIR design can be effortlessly determined. Evaluation results on various FIR implementations by using the proposed faithfully truncated adder designs showed that up to 35.4% and 27.9% savings in area and power consumption can be achieved with less than 1 ulp accuracy loss for uniformly distributed random inputs. Moreover, as a case study for normally distributed signals, a fixed 6-tap FIR is implemented for electrocardiogram (ECG) signal filtering was implemented, in which even with the increased truncated bits up to 10, the mean absolute error (E) can be guaranteed to be less than 1 ulp while up to 29.7% and 25.3% savings in area and power can be obtained.

DOI

Scopus

1

被引用数

(Scopus)
A Power-Efficient Soft Error Hardened Latch Design with In-Situ Error Detection Capability

Saki Tajima, Masao Yanagisawa, Youhua Shi

Asia Pacific Conference on Postgraduate Research in Microelectronics and Electronics 2019-November 53 - 56 2019年11月

　概要を見る

A power-efficient single node upset hardened latch design with in-situ error detection capability, EDSL, is proposed in this paper for reliability improvement against soft errors. Our simulation results show that the proposed EDSL design can not only recover from any incurred single node upset, but also provide in-situ error detection capability when the latch output is upset. When compared with state-of-the-art error-detection-based and SNU resilient designs, the proposed EDSL latch can achieve up to 72.25% and 79.74% reduction of power-delay-product respectively, which clearly shows the effectiveness of the proposed method.

DOI

Scopus
A Zero-Gating Processing Element Design for Low-Power Deep Convolutional Neural Networks

Lin Ye, Jinghao Ye, Masao Yanagisawa, Youhua Shi

Proceedings - APCCAS 2019: 2019 IEEE Asia Pacific Conference on Circuits and Systems: Innovative CAS Towards Sustainable Energy and Technology Disruption 317 - 320 2019年11月

　概要を見る

Convolution neural networks (CNNs) have shown great success in many areas such as object detection and pattern recognition. However, the high computational complexity of state-of-the-art deep CNNs makes them extreme difficult to be run on resource-constrained mobile and wearable devices. To address this design challenge, in this paper we first analyzed the filters' weights of pre-trained models from four state-of-the-art CNNs. We found that in all the CNNs that we analyzed, from about 20% (AlexNet) to 43% (VGG-19) of the weights are zeros, which lead to redundant large amounts of computation. Then, based on this observation, a zero-gating processing element (PE) design was proposed for low-power deep CNNs, in which the vast number of zeros in both activation maps and filter weights are explored to eliminate redundant computation for power reduction. We implemented our proposal with VGG-16 using ImageNet dataset. Experiments were conducted for evaluations of area and total power consumption. Compared with the baseline PE design without zero-gating, overall the proposed zero-gating PE can achieve 37% power saving while the corresponding area overhead is less than 8%.

DOI

Scopus

5

被引用数

(Scopus)
A Bit-Segmented Adder Chain based Symmetric Transpose Two-Block FIR Design for High-Speed Signal Processing

Jinghao Ye, Masao Yanagisawa, Youhua Shi

Proceedings - APCCAS 2019: 2019 IEEE Asia Pacific Conference on Circuits and Systems: Innovative CAS Towards Sustainable Energy and Technology Disruption 29 - 32 2019年11月

　概要を見る

A high-speed FIR filter structure is proposed in this paper by utilizing bit-segmentation adders and symmetric transpose 2-block FIR structure. First, a bit-segmented adder chain-based design is proposed with bit-segmentation adders. Second, a basic unit design of symmetric transpose block FIR is proposed to reduce the critical path delay. The evaluation results show that, when compared with state-of-the-art high-speed CSD multiplier-based FIR filter design, the proposed design requires 14.1% less area while provides 7.9% frequency improvement, 10.2% reduction of power consumption, 22.8% reduction of energy-delay-product and 20.4% reduction of area-delay-product, which shows the effectiveness of the proposed method.

DOI

Scopus

1

被引用数

(Scopus)
An adder-segmentation-based FIR for high speed signal processing

Jinghao Ye, Masao Yanagisawa, Youhua Shi

Proceedings of International Conference on ASIC 2019年10月

　概要を見る

An advanced adder-segmentation-based FIR filter design for high speed signal processing is proposed in this paper. In the proposed method, the critical path delay is shortened through adder segmentation. An analysis for the optimization of adder segmentation is also proposed, which can be used for critical path delay balance to maximize the performance of FIR filters. The evaluation results show that the proposed design can achieve up to 30.7% and 22.8% reduction in area-delay-product (ADP) and energy-delay-product (EDP) when compared with the existing FIR filters.

DOI

Scopus

1

被引用数

(Scopus)
Static error analysis and optimization of faithfully truncated adders for area-power efficient FIR designs

Jinghao Ye, Nozomu Togawa, Masao Yanagisawa, Youhua Shi

Proceedings - IEEE International Symposium on Circuits and Systems 2019-May 2019年

　概要を見る

Faithfully truncated adders are used for low cost FIR implementations in this paper, which improves state-of-the-art CSD-based FIR filter designs for further area and power reduction while meeting the accuracy requirement. As a solution to the accuracy loss caused by truncated adders, this paper performed a static error analysis of truncated adders. Furthermore, based upon our mathematical analysis, we show that, with a given accuracy constraint, an optimal truncated adder configuration can be effortlessly determined for area-power efficient FIR designs. Evaluation results on various FIR designs showed that 16.8%~35.4% reduction in area and 11.8%~27.9% in power saving can be achieved with the proposed optimal truncated adder designs within an average error of 1 ulp.

DOI

Scopus

6

被引用数

(Scopus)
Hardware Trojan Detection Utilizing Machine Learning Approaches

Kento Hasegawa, Youhua Shi, Nozomu Togawa

Proceedings - 17th IEEE International Conference on Trust, Security and Privacy in Computing and Communications and 12th IEEE International Conference on Big Data Science and Engineering, Trustcom/BigDataSE 2018 1891 - 1896 2018年09月

　概要を見る

Hardware security has become a serious concern in recent years. Due to the outsourcing in hardware production, malicious circuits (or hardware Trojans) can be easily inserted into hardware products by attackers. Since hardware Trojans are tiny and stealthy, their detection is difficult. Under the circumstances, numerous hardware-Trojan detection methods have been proposed. In this paper, we elaborate the overview of hardware-Trojan detection and review the hardware-Trojan detection methods using machine learning which is one of the state-of-the-art approaches.

DOI

Scopus

42

被引用数

(Scopus)
Extension and performance/accuracy formulation for optimal GeAr-based approximate adder designs

Ken Hayamizu, Nozomu Togawa, Masao Yanagisawa, Youhua Shi

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E101A ( 7 ) 1014 - 1024 2018年07月

　概要を見る

Approximate computing is a promising solution for future energy-efficient designs because it can provide great improvements in performance, area and/or energy consumption over traditional exact-computing designs for non-critical error-tolerant applications. However, the most challenging issue in designing approximate circuits is how to guarantee the pre-specified computation accuracy while achieving energy reduction and performance improvement. To address this problem, this paper starts from the state-of-the-art general approximate adder model (GeAr) and extends it for more possible approximate design candidates by relaxing the design restrictions. And then a maximum-error-distance-based performance/accuracy formulation, which can be used to select the performance/energy-accuracy optimal design from the extended design space, is proposed. Our evaluation results show the effectiveness of the proposed method in terms of area overhead, performance, energy consumption, and computation accuracy.

DOI

Scopus

2

被引用数

(Scopus)
A low power soft error hardened latch with schmitt-trigger-based C-Element

Saki Tajima, Nozomu Togawa, Masao Yanagisawa, Youhua Shi

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E101A ( 7 ) 1025 - 1034 2018年07月

　概要を見る

To deal with the reliability issue caused by soft errors, this paper proposed a low power soft error hardened latch (SHC) design using a novel Schmitt-Trigger-based C-element for reliable low power applications. Unlike state-of-the-art soft error tolerant latches that are usually based on hardware redundancy with large area overhead and high power consumption, the proposed SHC latch is implemented through double-sampling and node-checking using a novel Schmitt-Trigger-based C-element, which can help to reduce the area overhead and the corresponding power consumption as well. The evaluation results show that the total number of transistors of the proposed SHC latch is only increased by 2 when compared to the conventional unhardened C2MOS latch, while up to 20.35% and 82.96% power reduction can be achieved when compared to the conventional un-hardened C2MOS latch and the existing soft error tolerant HiPeR design, respectively.

DOI

Scopus

7

被引用数

(Scopus)
A low cost and high speed CSD-based symmetric transpose block FIR implementation

Jinghao Ye, Youhua Shi, Nozomu Togawa, Masao Yanagisawa

Proceedings of International Conference on ASIC 2017-October 311 - 314 2017年07月

　概要を見る

In this paper, a low cost and high speed CSD-based symmetric transpose block FIR design was proposed for low cost digital signal processing. First, the existing area-efficient CSD-based multiplier was optimized by considering the reusability and the symmetry of coefficients for area reduction. Second, the position of the input register was changed for high speed transpose block FIR processing in which half of the number of required multipliers can be saved. When compared with the existing block FIR designs, the proposed FIR design can increase the data rate from 238.66 MHz to 373.13 MHz while saving 10.89% area and 21.30% energy consumption as well.

DOI

Scopus

8

被引用数

(Scopus)
Soft error tolerant latch designs with low power consumption (invited paper)

Saki Tajima, Nozomu Togawa, Masao Yanagisawa, Youhua Shi

Proceedings of International Conference on ASIC 2017-October 52 - 55 2017年07月

　概要を見る

As semiconductor technology continues scaling down, the reliability issue has become much more critical than ever before. Unlike traditional hard-errors caused by permanent physical damage which can't be recovered in field, soft errors are caused by radiation or voltage/current fluctuations that lead to transient changes on internal node states, thus they can be viewed as temporary errors. However, due to the unpredictable occurrence of soft errors, it is desirable to develop soft error tolerant designs. For this reason, soft error tolerant design techniques have gained great research interest. In this paper, we will explain the soft error mechanism and then review the existing soft error tolerant design techniques with particular emphasis on SEH family because they can achieve low power consumption and small performance overhead as well.

DOI

Scopus

2

被引用数

(Scopus)
Improved monitoring-path selection algorithm for suspicious timing error prediction based timing speculation

Shinnosuke Yoshida, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings - 2015 IEEE 11th International Conference on ASIC, ASICON 2015 2016年07月

　概要を見る

As process technology is scaling down, timing speculation techniques such as Razor and STEP are emerged as alternative solutions to reduce required margins due to various variation effects. Unlike Razor, STEP is a prediction-based timing speculation method to predict suspicious timing errors before they really appear, and thus it can result in more performance improvement. Therefore, an improved monitoring-path selection algorithm for STEP-based timing speculation is proposed in this paper, in which candidate monitoring-paths are selected based on short path removement and path length estimation. Experimental results show that the proposed algorithm realizes an average of 1.71X overclocking compared with worst-case based designs.

DOI

Scopus
A delay variation and floorplan aware high-level synthesis algorithm with body biasing

Koki Igawa, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings - International Symposium on Quality Electronic Design, ISQED 2016-May 75 - 80 2016年05月

　概要を見る

In this paper, we propose a delay variation and floorplan aware high-level synthesis algorithm with body biasing, which minimizes the average leakage energy of manufactured chips. To realize a floorplan-oriented high-level synthesis, we utilize a huddle-based distributed register architecture (HDR architecture), one of the DR architectures. HDR architecture divides the chip area into small partitions called a huddle and we can control a body bias voltage for every huddle. During high-level synthesis, we iteratively obtain expected leakage energy for every huddle when applying a body bias voltage. A huddle with smaller expected leakage energy contributes to reducing expected leakage energy of the entire circuit but can increase the latency. We assign CDFG nodes in critical paths to the huddles with larger expected leakage energy and those in non-critical paths to the huddles with smaller expected leakage energy. We expect to minimize the entire leakage energy in a manufactured chip without increasing its latency. Experimental results show that our algorithm reduces the average leakage energy by up to 38.9% without latency and yield degradation compared with typical-case design with body biasing.

DOI

Scopus

1

被引用数

(Scopus)
In-situ Trojan authentication for invalidating hardware-Trojan functions

Masaru Oya, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings - International Symposium on Quality Electronic Design, ISQED 2016-May 152 - 157 2016年05月

　概要を見る

Due to the fact that we do not know who will create hardware Trojans (HTs), and when and where they would be inserted, it is very difficult to correctly and completely detect all the real HTs in untrusted ICs, and thus it is desired to incorporate in-situ HT invalidating functions into untrusted ICs as a countermeasure against HTs. This paper proposes an in-situ Trojan authentication technique for gate-level netlists to avoid security leakage. In the proposed approach, an untrusted IC operates in authentication mode and normal mode. In the authentication mode, an embedded Trojan authentication circuit monitors the bit-flipping count of a suspicious Trojan net within the pre-defined constant clock cycles and identify whether it is a real Trojan or not. If the authentication condition is satisfied, the suspicious Trojan net is validated. Otherwise, it is invalidated and HT functions are masked. By doing this, even untrusted netlists with HTs can still be used in the normal mode without security leakage. By setting the appropriate authentication condition using training sets from Trust-HUB gate-level benchmarks, the proposed technique invalidates successfully only HTs in the training sets. Furthermore, by embedding the in-situ Trojan authentication circuit into a Trojan-inserted AES crypto netlist, it can run securely and correctly even if HTs exist where its area overhead is just 1.5% with no delay overhead.

DOI

Scopus

6

被引用数

(Scopus)
Timing monitoring paths selection for wide voltage IC

Weiwei Shan, Wentao Dai, Youhua Shi, Peng Cao, Xiaoyan Xiang

IEICE Electronics Express 13 ( 8 ) 2016年03月

　概要を見る

Wide voltage range circuit has got widespread attention where in-situ timing monitoring based adaptive voltage scaling (AVS) becomes necessary to reduce the design margin. However, the severe PVT variations across near-threshold to super-threshold cause too many critical paths to be monitored. Here activation oriented monitoring paths selection method is proposed to reduce the monitored paths for wide voltage IC. The minimum delay value of the longest activated path is found by dynamic timing analysis and set as the selection threshold. Those paths longer than this threshold by STA analysis are selected to be monitored. Applied on a 40 nm AVS Systemon-Chip, it reduces the monitoring paths to only 22% of all critical paths with remarkable power gains under 0.6 V–1.1 V.

DOI

Scopus

6

被引用数

(Scopus)
A process-variation-aware multi-scenario high-level synthesis algorithm for distributed-register architectures

Koki Igawa, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

International System on Chip Conference 2016-February 7 - 12 2016年02月

　概要を見る

In order to tackle a process-variation problem, we can define several scenarios, each of which corresponds to a particular LSI behavior, such as a typical-case scenario and a worst-case scenario. By designing a single LSI chip which realizes multiple scenarios simultaneously, we can have a process-variation-tolerant LSI chip. In this paper, we propose a process-variation-aware low-latency and multi-scenario high-level synthesis algorithm targeting new distributed-register architectures, called HDR architectures. We assume two scenarios, a typical-case scenario and a worst-case scenario, and realize them onto a single chip. We first schedule/bind each of the scenarios independently. After that, we commonize the scheduling/binding results for the typical-case and worst-case scenarios and thus generate a commonized area-minimized floorplan result. Experimental results show that our algorithm reduces the latency of the typical-case scenario by up to 50% without increasing the latency of the worst-case scenario, compared with several existing methods.

DOI

Scopus

2

被引用数

(Scopus)
A hardware-trojans identifying method based on trojan net scoring at gate-level netlists

Masaru Oya, Youhua Shi, Noritaka Yamashita, Toshihiko Okamura, Yukiyasu Tsunoo, Satoshi Goto, Masao Yanagisawa, Nozomu Togawa

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E98A ( 12 ) 2537 - 2546 2015年12月

　概要を見る

Outsourcing IC design and fabrication is one of the effective solutions to reduce design cost but it may cause severe security risks. Particularly, malicious outside vendors may implement Hardware Trojans (HTs) on ICs. When we focus on IC design phase, we cannot assume an HT-free netlist or a Golden netlist and it is too difficult to identify whether a given netlist is HT-free or not. In this paper, we propose a score-based hardware-trojans identifying method at gate-level netlists without using a Golden netlist. Our proposed method does not directly detect HTs themselves in a gate-level netlist but it detects a net included in HTs, which is called Trojan net, instead. Firstly, we observe Trojan nets from several HT-inserted benchmarks and extract several their features. Secondly, we give scores to extracted Trojan net features and sum up them for each net in benchmarks. Then we can find out a score threshold to classify HTfree and HT-inserted netlists. Based on these scores, we can successfully classify HT-free and HT-inserted netlists in all the Trust-HUB gate-level benchmarks and ISCAS85 benchmarks as well as HT-free and HT-inserted AES gate-level netlists. Experimental results demonstrate that our method successfully identify all the HT-inserted gate-level benchmarks to be "HTinserted" and all the HT-free gate-level benchmarks to be "HT-free" in approximately three hours for each benchmark.

DOI

Scopus

12

被引用数

(Scopus)
A floorplan-aware high-level synthesis technique with delay-variation tolerance

Kazushi Kawamura, Yuta Hagio, Youhua Shi, Nozomu Togawa

Proceedings of the 2015 IEEE International Conference on Electron Devices and Solid-State Circuits, EDSSC 2015 122 - 125 2015年09月

　概要を見る

For realizing better trade-off between performance and yield rate in recent LSI designs, it is required to deal with increasing the ratios of interconnect delay as well as delay variation. In this paper, a novel floorplan-aware high-level synthesis technique with delay-variation tolerance is proposed. By utilizing floorplan-driven architectures, interconnect delays can be estimated and then handled even in high-level synthesis. Applying our technique enables to realize two scheduling/binding results (one is a non-delayed result and the other is a delayed result) simultaneously on a chip with small area/performance overhead, and either one of them can be selected according to the post-silicon delay variation. Experimental results demonstrate that our technique can reduce delayed scheduling/binding latency by up to 32.3% compared with conventional approaches.

DOI

Scopus

2

被引用数

(Scopus)
A universal delay line circuit for variation resilient IC with self-calibrated time-to-digital converter

Shuai Shao, Youhua Shi, Wentao Dai, Jianyi Meng, Weiwei Shan

Proceedings of the 2015 IEEE International Conference on Electron Devices and Solid-State Circuits, EDSSC 2015 126 - 129 2015年09月

　概要を見る

A universal delay monitor used to imitate the real critical paths is developed for variation resilient integrated circuit. This monitor is constructed based on the different proportion of logic cells and interconnects. The delay of the monitor is detected by a time-to-digital converter which keeps the sampling results precise. To reduce the deviation of the sampling results caused by PVT, a novel time-to-digital converter with self-calibration mechanism is developed. This variation resilient method based adaptive voltage scaling is applied on an ARM7 based System on a Chip on 0.18 μm CMOS process with a 112M signoff frequency and an area of 1.3∗1.3 mm2. The simulation results show that it has a 43.42% gain of power consumption under FF corner, -25°C compared to the fixed 1.8 V traditional design.

DOI

Scopus
FPGA-based SHA-3 acceleration on a 32-bit processor via instruction set extension

Yi Wang, Youhua Shi, Chao Wang, Yajun Ha

Proceedings of the 2015 IEEE International Conference on Electron Devices and Solid-State Circuits, EDSSC 2015 305 - 308 2015年09月

　概要を見る

As embedded systems play more and more important roles Internet of Things (IoT), the integration of cryptographic functionalities is an urgent demand to ensure data and information security. Recently, Keccak was declared as the winner of the third generation of Secure Hashing Algorithm (SHA-3). However, implementing SHA-3 on a specific 32-bit processor failed to meet the performance requirement. On the other hand, implementing it as a cryptographic coprocessor consumes a lot of extra area and requires customized driver program. Although implementing Keccak on a 64-bit platform is more efficient, this platform is not suitable for embedded implementation. In this paper, we propose a novel SHA-3 implementation using instruction set extension based on a 32-bit LEON3 processor (an open source processor), with the goals of reducing execution cycles and code size. Experimental results show that the proposed design reduces around 87% execution cycles and 10.5% code size as compared to reference designs. Our design takes up only 9.44% extra area with negligible speed overhead compared to the standard LEON3 processor. Compared to the existing hardware accelerators, our proposed design occupies only half of area resources and does not require extra driver programs to be developed when integrated into the overall system.

DOI

Scopus

13

被引用数

(Scopus)
Scan-based side-channel attack against symmetric key ciphers using scan signatures

Mika Fujishiro, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings of the 2015 IEEE International Conference on Electron Devices and Solid-State Circuits, EDSSC 2015 309 - 312 2015年09月

　概要を見る

There are a number of studies on a side-channel attack which uses information exploited from the physical implementation of a cryptosystem. A scan-based side-channel attack utilizes scan chains, one of design-for-test techniques and retrieves the secret information inside the cryptosystem. In this paper, scan-based side-channel attack methods against symmetric key ciphers such as block ciphers and stream ciphers using scan signatures are presented to show the risk of scan-based attacks.

DOI

Scopus

1

被引用数

(Scopus)
An energy-efficient floorplan driven high-level synthesis algorithm for multiple clock domains design

Shin Ya Abe, Youhua Shi, Kimiyoshi Usami, Masao Yanagisawa, Nozomu Togawa

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E98A ( 7 ) 1376 - 1391 2015年07月

　概要を見る

In this paper, we first propose an HDR-mcd architecture, which integrates periodically all-in-phase based multiple clock domains and multi-cycle interconnect communication into high-level synthesis. In HDR-mcd, an entire chip is divided into several huddles. Huddles can realize synchronization between different clock domains in which interconnection delay should be considered during high-level synthesis. Next, we propose a high-level synthesis algorithm for HDR-mcd, which can reduce energy consumption by optimizing configuration and placement of huddles. Experimental results show that the proposed method achieves 32.5% energy-saving compared with the existing single clock domain based methods

DOI

Scopus

1

被引用数

(Scopus)
An effective suspicious timing-error prediction circuit insertion algorithm minimizing area overhead

Shinnosuke Yoshida, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E98A ( 7 ) 1406 - 1418 2015年07月

　概要を見る

As process technologies advance, timing-error correction techniques have become important as well. A suspicious timing-error prediction (STEP) technique has been proposed recently, which predicts timing errors by monitoring themiddle points, or check points of several speedpaths in a circuit. However, if we insert STEP circuits (STEPCs) in the middle points of all the paths from primary inputs to primary outputs, we need many STEPCs and thus require too much area overhead. How to determine these check points is very important. In this paper, we propose an effective STEPC insertion algorithm minimizing area overhead. Our proposed algorithm moves the STEPC insertion positions to minimize inserted STEPC counts. We apply a max-flow and min-cut approach to determine the optimal positions of inserted STEPCs and reduce the required number of STEPCs to 1/10-1/80 and their area to 1/5-1/8 compared with a naive algorithm. Furthermore, our algorithm realizes 1.12X-1.5X overclocking compared with just inserting STEPCs into several speed-paths.

DOI

Scopus

3

被引用数

(Scopus)
A score-based classification method for identifying Hardware-Trojans at gate-level netlists

Masaru Oya, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings -Design, Automation and Test in Europe, DATE 2015-April 465 - 470 2015年04月

　概要を見る

Recently, digital ICs are often designed by outside vendors to reduce design costs in semiconductor industry, which may introduce severe risks that malicious attackers implement Hardware Trojans (HTs) on them. Since IC design phase generates only a single design result, an RT-level or gate-level netlist for example, we cannot assume an HT-free netlist or a Golden netlist and then it is too difficult to identify whether a generated netlist is HT-free or HT-inserted. In this paper, we propose a score-based classification method for identifying HT-free or HT-inserted gate-level netlists without using a Golden netlist. Our proposed method does not directly detect HTs themselves in a gate-level netlist but a net included in HTs, which is called Trojan net, instead. Firstly, we observe Trojan nets from several HT-inserted benchmarks and extract several their features. Secondly, we give scores to extracted Trojan net features and sum up them for each net in benchmarks. Then we can find out a score threshold to classify HT-free and HT-inserted netlists. Based on these scores, we can successfully classify HT-free and HT-inserted netlists in all the Trust-HUB gate-level benchmarks. Experimental results demonstrate that our method successfully identify all the HT-inserted gate-level benchmarks to be 'HT-inserted' and all the HT-free gate-level benchmarks to be 'HT-free' in approximately three hours for each benchmark.

DOI

Scopus

121

被引用数

(Scopus)
Secure scan design using improved random order and its evaluations

Masaru Oya, Yuta Atobe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS 2015-February ( February ) 555 - 558 2015年02月

　概要を見る

Scan test using scan chains is one of the most important DFT techniques. However, scan-based attacks are reported which can retrieve the secret key in crypto circuits by using scan chains. Secure scan architecture is strongly required to protect scan chains from scan-based attacks. This paper proposes an improved version of random order as a secure scan architecture. In improved random order, a scan chain is partitioned into multiple sub-chains. The structure of the scan chain changes dynamically by selecting a subchain to scan out. Testability and security of the proposed improved random order are also discussed in the paper, and the implementation results demonstrate the effectiveness of the proposed method.

DOI

Scopus

8

被引用数

(Scopus)
In-situ timing monitoring methods for variation-resilient designs

Youhua Shi, Nozomu Togawa

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS 2015-February ( February ) 735 - 738 2015年02月

　概要を見る

With technology scaling, process, voltage, and temperature (PVT) variations pose great challenges on integrated circuit designs. Conventionally, LSI circuits are designed by adding pessimistic timing margin to guarantee 'always correct' operations even under worst-case conditions. However, due to the increasing PVT variations, unacceptable larger design guard band should be reserved to avoid timing errors on critical paths of circuits, which will therefore lead to very inefficient designs in terms of power and performance. For this reason, in-situ timing monitoring technique has gained great research interest. In this paper, we will review existing variation-resilient design techniques with particular emphasis on in-situ timing monitoring techniques including both detection and prediction-based methods. The effectiveness of in-situ timing monitoring techniques will be discussed. Finally, we show an example of in-situ timing monitoring technique called STEP with applications to general pipeline designs.

DOI

Scopus
An area-overhead-oriented monitoring-path selection algorithm for suspicious timing error prediction

Shinnosuke Yoshida, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS 2015-February ( February ) 300 - 303 2015年02月

　概要を見る

As process technologies advance, the importance of timing error correction techniques is increasing as well. In this paper, We propose an area-overhead-oriented monitoring-path selection algorithm for suspicious timing error prediction circuits (STEPCs). STEPC predicts timing errors by monitoring the middle points of several speed-paths in a circuit. However, we need many STEPCs with a high area overhead to predict timing errors in an overall circuit. Our proposed method moves the STEPC insertion positions to minimize the number of inserted STEPCs. We apply a max-flow and min-cut approach to determine the optimal positions of inserted STEPCs. Our proposed algorithm reduces the required number of STEPCs to 1/19 and their area to 1/5 compared with a naive algorithm. Furthermore, our algorithm realizes 2.25X overclocking compared with just inserting STEPCs into several speed-paths.

DOI

Scopus

1

被引用数

(Scopus)
Throughput driven check point selection in suspicious timing error prediction based designs

Hiroaki Igarashi, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

2014 IEEE 5th Latin American Symposium on Circuits and Systems, LASCAS 2014 - Conference Proceedings 2014年

　概要を見る

In this paper, a throughput-driven design technique is proposed, in which a suspicious timing error prediction circuit is inserted to monitor the signal transitions at some selected check points. Unlike previous works where timing errors are detected after their occurrence, the proposed method tries to use the real intermediate signal transitions for timing error prediction. The check point selection will affect both the maximal operation frequency and the suspicious timing error overestimation rate, both of which have an effect on the overall throughput, thus an analysis on the check point selection is also given. In our work, the circuit can be overclocked by a factor of 2 or more with ignorable area overhead while guarantees the always-correct output. © 2014 IEEE.

DOI

Scopus
Floorplan driven architecture and high-level synthesis algorithm for dynamic multiple supply voltages

Shin Ya Abe, Youhua Shi, Kimiyoshi Usami, Masao Yanagisawa, Nozomu Togawa

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E96-A ( 12 ) 2597 - 2611 2013年12月

　概要を見る

In this paper, we propose an adaptive voltage huddle-based distributed-register architecture (AVHDR architecture), which integrates dynamic multiple supply voltages and interconnection delay into high-level synthesis. In AVHDR architecture, voltages can be dynamically assigned for energy reduction. In other words, low supply voltages are assigned to non-critical operations, and leakage power is cut off by turning off the power supply to the sleeping functional units. Next, an AVHDR-based high-level synthesis algorithm is proposed. Our algorithm is based on iterative improvement of scheduling/binding and floorplanning. In the iteration process, the modules in each huddle can be placed close to each other and the corresponding AVHDR architecture can be generated and optimized with floorplanning information. Experimental results show that on average our algorithm achieves 43.9% energy-saving compared with conventional algorithms.Copyright © 2013 The Institute of Electronics, Information and Communication Engineers.

DOI

Scopus

2

被引用数

(Scopus)
Secure scan design with dynamically configurable connection

Yuta Atobe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings of IEEE Pacific Rim International Symposium on Dependable Computing, PRDC 256 - 262 2013年

　概要を見る

Scan test is a powerful test technique which can control and observe the internal states of the circuit under test through scan chains. However, it has been reported that it's possible to retrieve secret keys from cryptographic LSIs through scan chains. Therefore new secure test methods are required to satisfy both testability and security requirements. In this paper, a secure scan design is proposed to achieve adequate security requirement as a countermeasure against scan-based attacks, while still maintain high testability like normal scan testing. In our method, the internal scan chain is divided into several sub chains, and the connection order of sub chains can be dynamically changed. In addition, how to decide the connection order of those sub chains so that it can't be identified by an attacker is also proposed in this paper. The proposed method is implemented on an AES circuit to show its effectiveness, and a security analysis is also given to show how the proposed approach can be used as a countermeasure against those known scan-based attacks. © 2013 IEEE.

DOI

Scopus

35

被引用数

(Scopus)
Suspicious timing error prediction with in-cycle clock gating

Youhua Shi, Hiroaki Igarashi, Nozomu Togawa, Masao Yanagisawa

Proceedings - International Symposium on Quality Electronic Design, ISQED 335 - 340 2013年

　概要を見る

Conventionally, circuits are designed to add pessimistic timing margin to solve delay variation problems, which guarantees 'always correct' operations. However, due to the fact that such a worst-case condition occurs rarely, the traditional pessimistic design method is therefore becoming one of the main obstacles for designers to achieve higher performance and/or ultra-low power consumption. By monitoring timing error occurrence during circuit operation, adaptive timing error detection and recovery methods have gained wide interests recently as a promising solution. As an extension of existing research, in this paper, we propose a suspicious timing error prediction method for performance or energy efficiency improvement in pipeline designs. Experimental results show that with when compared with typical margin designs, the proposed method can 1) achieve up to 1.41X throughput improvement with in-situ timing error prediction ability; and 2) allow the design to be overclocked by up to 1.88X with 'always correct' outputs. © 2013 IEEE.

DOI

Scopus

17

被引用数

(Scopus)
An energy-efficient high-level synthesis algorithm incorporating interconnection delays and dynamic multiple supply voltages

Shin Ya Abe, Youhua Shi, Kimiyoshi Usami, Masao Yanagisawa, Nozomu Togawa

2013 International Symposium on VLSI Design, Automation, and Test, VLSI-DAT 2013 2013年

　概要を見る

In this paper, we propose an adaptive voltage huddle-based distributed-register architecture (AVHDR architecture) that integrates dynamic multiple supply voltages and interconnection delays into high-level synthesis. Next, we propose a high-level synthesis algorithm for AVHDR architectures. Our algorithm is based on iterative improvement of scheduling/binding and floorplanning. In the iteration process, huddles, each of which abstracts modules placed close to each other, are naturally generated using floorplanning. Low-supply voltages are assigned to non-critical operations, and leakage power is cut off by turning off the power supply to the sleeping functional units. Experimental results show that our algorithm achieves 50% energy-saving compared with conventional algorithms. © 2013 IEEE.

DOI

Scopus
Concurrent faulty clock detection for crypto circuits against clock glitch based DFA

Hiroaki Igarashi, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings - IEEE International Symposium on Circuits and Systems 1432 - 1435 2013年

　概要を見る

In this paper, a concurrent faulty clock detection method is proposed for crypto circuits against clock glitch based differential fault analysis (DFA). In the proposed method, a nonlogic buffer-based delay chain is inserted, and then by monitoring the delay along the delay chain, a possible clock glitch based DFA can be detected. Experimental results on an AES circuit show that the proposed method can successfully detect clock glitch based attacks, and the required area overhead is only 0.47% that is much smaller than previous works. © 2013 IEEE.

DOI

Scopus

19

被引用数

(Scopus)
Scan-based attack on AES through round registers and its countermeasure

Youhua Shi, Nozomu Togawa, Masao Yanagisawa

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E95-A ( 12 ) 2338 - 2346 2012年12月

　概要を見る

Scan-based side channel attack on hardware implementations of cryptographic algorithms has shown its great security threat. Unlike existing scan-based attacks, in our work we observed that instead of the secret-related-registers, some non-secret registers also carry the potential of being misused to help a hacker to retrieve secret keys. In this paper, we first present a scan-based side channel attack method on AES by making use of the round counter registers, which are not paid attention to in previous works, to show the potential security threat in designs with scan chains. And then we discussed the issues of secure DFT requirements and proposed a secure scan scheme to preserve all the advantages and simplicities of traditional scan test, while significantly improve the security with ignorable design overhead, for crypto hardware implementations. Copyright © 2012 The Institute of Electronics, Information and Communication Engineers.

DOI

Scopus

1

被引用数

(Scopus)
MH ⁴: Multiple-supplyvoltages aware high-level synthesis for highintegrated and highfrequency circuits for HDR architectures

Shin Ya Abe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

IEICE Electronics Express 9 ( 17 ) 1414 - 1422 2012年

　概要を見る

In this paper, we propose multiple-supply-voltages aware high-level synthesis algorithm for HDR architectures which realizes high-speed and high-efficient circuits. We propose three new techniques: virtual area estimation, virtual area adaptation, and floorplanning- directed huddling, and integrate them into our HDR architecture synthesis algorithm. Virtual area estimation/adaptation effectively estimates a huddle area by gradually reducing it during iterations, which improves the convergence of our algorithm. Floorplanningdirected huddling determines huddle composition very effectively by performing floorplanning and functional unit assignment inside huddles simultaneously. Experimental results show that our algorithm achieves about 29% run-time-saving compared with the conventional algorithms, and obtains a solution which cannot be obtained by our original algorithm even if a very tight clock constraint is given. © IEICE 2012.

DOI

Scopus

14

被引用数

(Scopus)
Dynamically changeable secure scan architecture against scan-based side channel attack

Yuta Atobe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

ISOCC 2012 - 2012 International SoC Design Conference 155 - 158 2012年

　概要を見る

Scan test which is one of the useful design for testability techniques is effective for LSIs including cryptographic circuit. It can observe and control the internal states of the circuit under test by using scan chain. However, scan chain presents a significant security risk of information leakage for scan-based attacks which retrieves secret keys of cryptographic LSIs. In this paper, a secure scan architecture against scan-based attack which still has high testability is proposed. In our method, scan data is dynamically changed by adding the latch to any FFs in the scan chain. We show that by using proposed method, neither the secret key nor the testability of an RSA circuit implementation is compromised, and the effectiveness of the proposed method. © 2012 IEEE.

DOI

Scopus

42

被引用数

(Scopus)
State dependent scan flip-flop with key-based configuration against scan-based side channel attack on RSA circuit

Yuta Atobe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS 607 - 610 2012年

　概要を見る

Scan test is one of the useful design for testability techniques, which can detect circuit failure efficiently. However, it has been reported that it's possible to retrieve secret keys from cryptographic LSIs through scan chains. Therefore testability and security contradicted to each other, and there is a need to an efficient design for testability circuit so as to satisfy both testability and security requirement. In this paper, a secure scan architecture against scan-based attack is proposed to achieve high security without compromising the testability. In our method, scan structure is dynamically changed by adding the latch to any FFs in the scan chain. We made an analysis on an RSA circuit implementation to show the effectiveness of the proposed method and discussed how our approach is resistant to scan-based attack. © 2012 IEEE.

DOI

Scopus

19

被引用数

(Scopus)
State-dependent changeable scan architecture against scan-based side channel attacks

Ryuta Nara, Hiroshi Atobe, Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

ISCAS 2010 - 2010 IEEE International Symposium on Circuits and Systems: Nano-Bio Circuit Fabrics and Systems 1867 - 1870 2010年

　概要を見る

Scan test is a powerful and popular test technique because it can control and observe the internal states of the circuit under test. However, scan path would be used to discover the internals of crypto hardware, which presents a significant security risk of information leakage. An interesting design-for-test technique by inserting inverters into the internal scan path to complicate the scan structure has been recently presented. Unfortunately, it still carries the potential of being attacked through statistical analysis of the information scanned out from chips. Therefore, in this paper we propose secure scan architecture, called dynamic variable secure scan, against scan-based side channel attack. The modified scan flip-flops are state-dependent, which could cause the output of each State-dependent Scan FF to be inverted or not so as to make it more difficult to discover the internal scan architecture. ©2010 IEEE.

DOI

Scopus

10

被引用数

(Scopus)
VLSI implementation of a fast intra prediction algorithm for H.264/AVC encoding

Youhua Shi, Kenta Tokumitsu, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS 1139 - 1142 2010年

　概要を見る

Intra-frame coding is one of the most important technologies in H.264/AVC, which made significant contributions to the enhancement of coding efficiency of H.264/AVC at the cost of computation complexity. To address this problem, in this paper we present an efficient VLSI implementation of a computation efficient intra prediction algorithm for H.264/AVC encoding. Unlike most of existing fast intra-mode selection techniques, in the proposed method the directional differences are computed using a few selected original pixels to obtain the candidate modes with the minimal direction cost. The proposed method is hardware-friendly and provides more processing parallelism for H.264 intra-frame encoding with less overhead and less power consumption, which is expected to be utilized as a favourable accelerator hardware module in a real-time HDTV (1920×1080p) H.264 encoder. © 2010 IEEE.

DOI

Scopus

2

被引用数

(Scopus)
Design-for-secure-test for crypto cores

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings - International Test Conference 2009年12月

　概要を見る

Scan technology carries the potential of being misused as a "side channel" to leak out the secret information of crypto cores. To address such a design challenge, this paper proposes a design-for-secure-test (DFST) solution for crypto cores by adding a stimuli-launched flip-flop into the traditional scan flip-flop to maintain the high test quality without compromising the security. © 2009 IEEE.

DOI

Scopus

7

被引用数

(Scopus)
X-handling for current X-tolerant compactors with more unknowns and maximal compaction

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E92-A ( 12 ) 3119 - 3127 2009年12月

　概要を見る

This paper presents a novel X-handling technique, which removes the effect of unknowns on compacted test response with maximal compaction ratio. The proposed method combines with the current X-tolerant compactors and inserts masking cells on scan paths to selectively mask X's. By doing this, the number of unknown responses in each scan-out cycle could be reduced to a reasonable level such that the target X-tolerant compactor would tolerate with guaranteed possible error detection. It guarantees no test loss due to the effect of X's, and achieves the maximal compaction that the target response compactor could provide as well. Moreover, because the masking cells are only inserted on the scan paths, it has no performance degradation of the designs. Experimental results demonstrate the effectiveness of the proposed method. Copyright © 2009 The Institute of Electronics, Information and Communication Engineers.

DOI

Scopus
Unified dual-radix architecture for scalable montgomery multiplications in GF(P) and GF(2ⁿ)

Kazuyuki Tanimura, Ryuta Nara, Shunitsu Kohara, Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E92-A ( 9 ) 2304 - 2317 2009年09月

　概要を見る

Modular multiplication is the most dominant arithmetic operation in elliptic curve cryptography (ECC), that is a type of publickey cryptography. Montgomery multiplier is commonly used to compute the modular multiplications and requires scalability because the bit length of operands varies depending on its security level. In addition, ECC is performed in GF(P) or GF(2n), and unified architecture for multipliers in GF(P) and GF(2n) is required. However, in previous works, changing frequency is necessary to deal with delay-time difference between GF ( P) and GF(2n) multipliers because the critical path of the GF(P) multiplier is longer. This paper proposes unified dual-radix architecture for scalable Montgomery multiplications in GF(P) and GF(2n). This proposed architecture unifies four parallel radix-216 multipliers in GF(P) and a radix-264 multiplier in GF(2n) into a single unit. Applying lower radix to GF(P) multiplier shortens its critical path and makes it possible to compute the operands in the two fields using the same multiplier at the same frequency so that clock dividers to deal with the delay-time difference are not required. Moreover, parallel architecture in GF(P) reduces the clock cycles increased by dual-radix approach. Consequently, the proposed architecture achieves to compute a GF(P) 256-bit Montgomery multiplication in 0.28 μs. The implementation result shows that the area of the proposal is almost the same as that of previous works: 39 kgates. Copyright © 2009 The Institute of Electronics, Information and Communication Engineers.

DOI

Scopus
A secure test technique for pipelined advanced encryption standard

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEICE Transactions on Information and Systems E91-D ( 3 ) 776 - 780 2008年03月

　概要を見る

In this paper, we presented a Design-for-Secure-Test (DFST) technique for pipelined AES to guarantee both the security and the test quality during testing. Unlike previous works, the proposed method can keep all the secrets inside and provide high test quality and fault diagnosis ability as well. Furthermore, the proposed DFST technique can significantly reduce test application time, test data volume, and test generation effort as additional benefits. Copyright © 2008 The Institute of Electronics, Information and Communication Engineers.

DOI

Scopus

3

被引用数

(Scopus)
A unified test compression technique for scan stimulus and unknown masking data with no test loss

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E91-A ( 12 ) 3514 - 3523 2008年

　概要を見る

This paper presents a unified test compression technique for scan stimulus and unknown masking data with seamless integration of test generation, test compression and all unknown response masking for high quality manufacturing test cost reduction. Unlike prior test compression methods, the proposed approach considers the unknown responses during test pattern generation procedure, and then selectively encodes the less specified bits (either Is or Os) in each scan slice for compression while at the same time masks the unknown responses before sending them to the response compactor. The proposed test scheme could dramatically reduce test data volume as well as the number of required test channels by using only c tester channels to drive N internal scan chains, where c = [10g2 N + 2- In addition, because all the unknown responses could be exactly masked before entering into the response compactor, test loss due to unknown responses would be eliminated. Experimental results on both benchmark circuits and larger designs indicated the effectiveness of the proposed technique. Copyright © 2008 The Institute of Electronics, Information and Communication Engineers.

DOI

Scopus
Scalable unified dual-radix architecture for Montgomery multiplication in GF{P) and GF(2ⁿ)

Kazuyuki Tanimura, Ryuta Nara, Shunitsu Kohara, Kazunori Shimizu, Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC 697 - 702 2008年

　概要を見る

Modular multiplication is the most dominant arithmetic operation in elliptic curve cryptography (ECC), which is a type of public-key cryptography. Montgomery multiplication is commonly used as a technique for the modular multiplication and required scalability since the bit length of operands varies depending on the security levels. Also, ECC is performed in GF(P) or GF(2 n), and unified architectures for GF(P) and GF(2n) multiplier are needed. However, in previous works, changing frequency or dual-radix architecture is necessary to deal with delay-time difference between GF(P) and GF(2n) circuits of the multiplier because the critical path of GF(P) circuit is longer. This paper proposes a scalable unified dual-radix architecture for Montgomery multiplication in GF(P) and GF(2n). The proposed architecture unifies 4 parallel radix-216 multipliers in GF(P) and a radix-264 multiplier in GF(2n) into a single unit. Applying lower radix to GF(P) multiplier shortens its critical path and makes it possible to compute the operands in the two fields using the same multiplier at the same frequency so that clock dividers to deal with the delay-time difference are not required. Moreover, parallel architecture in GF(P) reduces the clock cycles increased by dual-radix approach. Consequently, the proposed architecture achieves to compute GF(P) 256-bit Montgomery multiplication in 0.23μs. ©2008 IEEE.

DOI

Scopus

4

被引用数

(Scopus)
GECOM: Test data compression combined with all unknown response masking

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC 577 - 582 2008年

　概要を見る

This paper introduces GECOM technology, a novel test compression method with seamless integration of test GEneration, test COmpression (i.e. integrated compression on scan stimulus and masking bits) and all unknown scan responses Masking for manufacturing test cost reduction. Unlike most of prior methods, the proposed method considers the unknown responses during ATPG procedure and selectively encodes the specified 1 or 0 bits (either 1s or 0s) in scan slices for compression while at the same time masks the unknown responses before sending them to the response compactor. The proposed GECOM technology consists of GECOM architecture and GECOM ATPG technique. In the GECOM architecture, for a circuit with N internal scan chains, only c tester channels, where c = [log2 N] +2, are required. GECOM ATPG generates test patterns for the GECOM architecture thus not only the scan inputs could be efficiently compressed but also all the unknown responses would be masked. Experimental results on both benchmark circuits and real industrial designs indicated the effectiveness of the proposed GECOM technique. ©2008 IEEE.

DOI

Scopus

5

被引用数

(Scopus)
Unknown response masking with minimized observable response loss and mask data

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS 1779 - 1781 2008年

　概要を見る

This paper presents a new unknown response masking technique to minimize the effect on test loss due to over-masking. Unlike previous works where the scan responses are masked before entering the response compactor, the proposed method could mask the Xs when they are transformed on the scan path. Meanwhile, the masking cells are inserted along the scan paths, thus they would have no degradation on the performance of the designs. In addition, the test data required to mask unknown responses is only one bit for each test pattern. Experimental results show the effectiveness of the proposed method. © 2008 IEEE.

DOI

Scopus
Design for secure test - A case study on pipelined advanced encryption standard

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings - IEEE International Symposium on Circuits and Systems 149 - 152 2007年

　概要を見る

Cryptography plays an important role in the security of data transmission. To ensure the correctness of crypto hardware, we should conduct testing at fabrication and infield. However, the state-of-the-art scan-based test techniques, to achieve high test qualities, need to increase the testability of the circuit under test, which carries a potential of being misused to reveal the secret information of the crypto hardware. Thus, to develop efficient test strategies for crypto hardware to achieve high test quality without compromising security becomes an important task. In this paper we discuss the development of a Design-forSecure-Test (DFST) technique for pipelined AES to overcome the above contradiction between security and test quality in testing crypto hardware. Unlike previous works, the proposed method can keep all the secrets inside and provide high test quality and fault diagnosis ability as well. Furthermore, the proposed DFST technique can significantly reduce test application time, test data volume, and test generation effort as additional benefits. © 2007 IEEE.

DOI

Scopus

3

被引用数

(Scopus)
Selective low-care coding: A means for test data compression in circuits with multiple scan chains

Youhua Shi, Nozomu Togawa, Shinji Kimura, Masao Yanagisawa, Tatsuo Ohtsuki

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E89-A ( 4 ) 996 - 1003 2006年04月

　概要を見る

This paper presents a test input data compression technique, Selective Low-Care Coding (SLC), which can he used to significantly reduce input test data volume as well as the external test channel requirement for multiscan-based designs. In the proposed SLC scheme, we explored the linear dependencies of the internal scan chains, and instead of encoding all the specified bits in test cubes, only a smaller amount of specified bits are selected for encoding, thus greater compression can be expected. Experiments on the larger benchmark circuits show drastic reduction in test data volume with corresponding savings on test application time can be indeed achieved even for the well-compacted test set. Copyright © 2006 The Institute of Electronics, Information and Communication Engineers.

DOI

Scopus

2

被引用数

(Scopus)
FCSCAN: An efficient multiscan-based test compression technique for test cost reduction

Youhua Shi, Nozomu Togawa, Shinji Kimura, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC 2006 653 - 658 2006年

　概要を見る

This paper proposes a new multiscan-based test input data compression technique by employing a Fan-out Compression Scan Architecture (FCSCAN) for test cost reduction. The basic idea of FCSCAN is to target the minority specified 1 or 0 bits (either 1 or 0) in scan slices for compression. Due to the low specified bit density in test cube set, FCSCAN can significantly reduce input test data volume and the number of required test channels so as to reduce test cost. The FCSCAN technique is easy to be implemented with small hardware overhead and does not need any special ATPG for test generation. In addition, based on the theoretical compression efficiency analysis, improved procedures are also proposed for the FCSCAN to achieve further compression. Experimental results on both benchmark circuits and one real industrial design indicate that drastic reduction in test cost can be indeed achieved. © 2006 IEEE.
Low-cost IP core test using multiple-mode loading scan chain and scan chain clusters

Gang Zeng, Youhua Shi, Toshinori Takabatake, Masao Yanagisawa, Hideo Ito

Proceedings - IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems 136 - 144 2006年

　概要を見る

A fixing-shifting encoding (FSE) method is proposed to reduce test cost of IP cores. The FSE method reduces test cost by supporting multiple-mode loading test data, i.e., parallel loading, left-direction, and right-direction serial loading for each test slice data. Furthermore, the FSE that utilizes only two test channels can support a large number of internal scan chains and achieve further reduction in test cost by combining with scan chain clustering method. As a non-intrusive and automatic test pattern generation (ATPG) independent solution, the approach is applicable to IP core testing because it requires neither redesign of the core under test (CUT) nor running any additional ATPG for the encoding procedure. In addition, the decoder has low hardware overhead, and its design is independent of the CUT. Experimental results for some large ISCAS 89 benchmarks and an industry ASIC design have proven the efficiency of the proposed approach. © 2006 IEEE.

DOI

Scopus

2

被引用数

(Scopus)
Low power test compression technique for designs with multiple scan chains

Youhua Shi, Nozomu Togawa, Shinji Kimura, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asian Test Symposium 2005 386 - 389 2005年

　概要を見る

This paper presents a new DFT technique that can significantly reduce test data volume as well as scan-in power consumption for multiscan-based designs. It can also help to reduce test time and tester channel requirements with small hardware overhead. In the proposed approach, we start with apre-computed test cube set and fill the don't-cares with proper values for joint reduction of test data volume and scan power consumption. In addition we explore the linear dependencies of the scan chains to construct a fanout structure only with inverters to achieve further compression. Experimental results for the larger ISCAS'89 benchmarks show the efficiency of the proposed technique. © 2005 IEEE.

DOI

Scopus

17

被引用数

(Scopus)
A hybrid dictionary test data compression for multiscan-based designs

Youhua Shi, Shinji Kimura, Masao Yanagisawa, Tatsuo Ohtsuki

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E87-A ( 12 ) 3193 - 3199 2004年12月

　概要を見る

In this paper, we present a test data compression technique to reduce test data volume for multiscan-based designs. In our method the internal scan chains are divided into equal sized groups and two dictionaries were build to encode either an entire slice or a subset of the slice. Depending on the codeword, the decompressor may load all scan chains or may load only a group of the scan chains, which can enhance the effectiveness of dictionary-based compression. In contrast to previous dictionary coding techniques, even for the CUT with a large number of scan chains, the proposed approach can achieve satisfied reduction in test data volume with a reasonable smaller dictionary. Experimental results showed the proposed test scheme works particularly well for the large ISCAS'89 benchmarks.
A selective scan chain reconfiguration through run-length coding for test data compression and scan power reduction

Youhua Shi, Shinji Kimura, Masao Yanagisawa, Tatsuo Ohtsuki

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E87-A ( 12 ) 3208 - 3214 2004年12月

　概要を見る

Test data volume and power consumption for scan-based designs are two major concerns in system-on-a-chip testing. However, test set compaction by filling the don't-cares will invariably increase the scan-in power dissipation for scan testing, then the goals of test data reduction and low-power scan testing appear to be conflicted. Therefore, in this paper we present a selective scan chain reconfiguration method for test data compression and scan-in power reduction. The proposed method analyzes the compatibility of the internal scan cells for a given test set and then divides the scan cells into compatible classes. After the scan chain reconfiguration a dictionary is built to indicate the run-length of each compatible class and only the scan-in data for each class should be transferred from the ATE to the CUT so as to reduce test data volume. Experimental results for the larger ISCAS'89 benchmarks show that the proposed approach overcomes the limitations of traditional run-length coding techniques, and leads to highly reduced test data volume with significant power savings during scan testing in all cases.
Reducing test data volume for multiscan-based designs through single/sequence mixed encoding

Youhua Shi, Shinji Kimura, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Midwest Symposium on Circuits and Systems 2 2004年

　概要を見る

This paper presents a new test data compression technique for multiscan-based designs through dictionary-based encoding on the single or sequences scan-inputs. In spite of its simplicity, it achieves significant reduction in test data volume. Unlike some previous approaches on test data compression, our approach eliminates the need for additional synchronization and handshaking between the CUT and the ATE, so it is especially suitable to be integrated in a low cost test scheme for SoC test In addition in contrast to previous dictionary-based coding techniques, even for the CUT with a small number of scan chains, the proposed approach can achieve satisfied reduction in test data volume. Experimental results showed the proposed test scheme works particularly well for the large ISCAS'89 benchmarks.
Alternative run-length coding through scan chain reconfiguration for joint minimization of test data volume and power consumption in scan test

Youhua Shi, Shinji Kimura, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asian Test Symposium 432 - 437 2004年

　概要を見る

Test data volume and scan power are two major concerns in SoC test. In this paper we present an alternative run-length coding method through scan chain reconfiguration to reduce both test data volume and scan-in power consumption. The proposed method analyzes the compatibility of the internal scan cells for a given test set and then divides the scan cells into compatible classes. To extract the compatible scan cells we apply a heuristic algorithm by solving the graph coloring problem; and then a simple greedy algorithm is used to configure the scan chain for the minimization of scan power. Experimental results for the larger IS-CAS'89 benchmarks show that the proposed approach leads to highly reduced test data volume with significant power savings during scan test.

DOI

Scopus

2

被引用数

(Scopus)
A Built-in Reseeding Technique for LFSR-Based Test Pattern Generation

Youhua Shi, Zhe Zhang, Shinji Kimura, Masao Yanagisawa, Tatsuo Ohtsuki

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E86-A ( 12 ) 3056 - 3062 2003年12月

　概要を見る

Reseeding technique is proposed to improve the fault coverage in pseudo-random testing. However most of previous works on reseeding is based on storing the seeds in an external tester or in a ROM. In this paper we present a built-in reseeding technique for LFSR-based test pattern generation. The proposed structure can run both in pseudorandom mode and in reseeding mode. Besides, our method requires no storage for the seeds since in reseeding mode the seeds can be generated automatically in hardware. In this paper we also propose an efficient grouping algorithm based on simulated annealing to optimize test vector grouping. Experimental results for benchmark circuits indicate the superiority of our technique against other reseeding methods with respect to test length and area overhead. Moreover, since the theoretical properties of LFSRs are preserved, our method could be beneficially used in conjunction with any other techniques proposed so far.
Multiple test set generation method for LFSR-based BIST

Youhua Shi, Zhe Zhang

Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC 2003-January 863 - 868 2003年

　概要を見る

In this paper we propose a new reseeding method for LFSR-based test pattern generation suitable for circuits with random pattern resistant faults. The character of our method is that the proposed test pattern generator (TPG) can work both in normal LFSR mode, to generate pseudorandom test vectors, and in jumping mode to make the TPG jump from a state to the required state (seed of next group). Experimental results indicate that its superiority against other known reseeding techniques with respect to the length of the test sequence and the required area overhead.

DOI

Scopus

9

被引用数

(Scopus)
New low power BIST methodology by altering the structure of linear feedback shift registers

Rui Li, Chen Hu, Jun Yang, Zhe Zhang, Youhua Shi

Dianzi Qijian/Journal of Electron Devices 25 ( 3 ) 245 2002年09月
Simulated annealing algorithm applied in low power BIST scheme

Chen Hu, Zhe Zhang, Youhua Shi, Jun Yang, Longxing Shi

Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition) 32 ( 2 ) 177 - 180 2002年03月

　概要を見る

An approach to approximately optimal group test vectors in a certain length of test patterns is proposed to decrease the number of test vectors based on simulated annealing algorithm. By the scheme of reseeding, this approach makes linear feedback shift register (LFSR) generate optimized groups of vectors, so as to reduce the power consumption without any loss of fault coverage. The experiment result shows that more than 70% power consumption can be reduced while keeping the fault coverage invariable. In addition, the test time is greatly shortened with decreased number of test vectors, which is important in real time device.
A new software for test logic optimization in DFT

Zhe Zhang, Chen Hu, Rui Li, Youhua Shi, Longxing Shi

International Conference on ASIC, Proceedings 654 - 657 2001年

　概要を見る

This paper presents a new software named ASIC2000TA developed for design for test (DFT) aiming at optimizing test logic. This software consists of two modules: Test analysis module and DFT module. Test analysis module can examine circuit's testability, generate test vectors and perform fault simulation, in which some algorithms are described. DFT module automatically inserts test logic in gate-level netlist, including full scan and partial scan, in which a greedy search algorithm is discussed. Electronic design intermediate format (EDIF) acts as an interface between ASIC2000TA and Cadence. An experiment of ASIC2000TA is presented at last.
A new self-test structure for at-speed test of crosstalk in SoC busses

Jun Yang, Chen Hu, Youhua Shi, Zhe Zhang, Longxing Shi

International Conference on ASIC, Proceedings 633 - 636 2001年

　概要を見る

The use of deep submicron process technologies increases the probability of crosstalk faults in the bus of system-on-a-chip (SoC). Though a self-testing methodology based on MA fault model has been developed, its area overhead of test logic is excessive. This paper proposed a new Error Detector (ED) and new test patterns whose overhead is decreased down to only approximate 50% of the old methodology on the average. A behavior fault simulation is used to validate the self-testing structure described in this paper.
A new low power BIST methodology by altering the structure of linear feedback shift registers

Rui Li, Chen Hu, Jun Yang, Zhe Zhang, Youhua Shi, Longxing Shi

International Conference on ASIC, Proceedings 646 - 649 2001年

　概要を見る

In this paper a new low power BIST methodology by altering the structure of linear feedback shift register (LFSR) is proposed. In pseudo-random test mode, the efficiency of the vectors decreases sharply as the test progresses. For low power consumption during test mode, the proposed approach ignores the non-detecting vectors by altering the structure of LFSR. Note that altering the structure of LFSR is efficient, and its has no impact on the fault coverage.

▼全件表示

講演・口頭発表等

CNNに対する概算加算器の適用と評価

井上雄太, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2018年05月
リーク削減による低消費電力SRAMの設計

伊藤卓, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2018年05月
低周波圧電エネルギーハーベスティングにおけるMOSs SP-SSHI手法

杉山貴紀, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2018年05月
Soft error tolerant latch designs with low power consumption (invited paper)

Saki Tajima, Nozomu Togawa, Masao Yanagisawa, Youhua Shi

Proceedings of International Conference on ASIC

発表年月： 2018年01月

　概要を見る

© 2017 IEEE. As semiconductor technology continues scaling down, the reliability issue has become much more critical than ever before. Unlike traditional hard-errors caused by permanent physical damage which can't be recovered in field, soft errors are caused by radiation or voltage/current fluctuations that lead to transient changes on internal node states, thus they can be viewed as temporary errors. However, due to the unpredictable occurrence of soft errors, it is desirable to develop soft error tolerant designs. For this reason, soft error tolerant design techniques have gained great research interest. In this paper, we will explain the soft error mechanism and then review the existing soft error tolerant design techniques with particular emphasis on SEH family because they can achieve low power consumption and small performance overhead as well.
A low cost and high speed CSD-based symmetric transpose block FIR implementation

Jinghao Ye, Youhua Shi, Nozomu Togawa, Masao Yanagisawa

Proceedings of International Conference on ASIC

発表年月： 2018年01月

　概要を見る

© 2017 IEEE. In this paper, a low cost and high speed CSD-based symmetric transpose block FIR design was proposed for low cost digital signal processing. First, the existing area-efficient CSD-based multiplier was optimized by considering the reusability and the symmetry of coefficients for area reduction. Second, the position of the input register was changed for high speed transpose block FIR processing in which half of the number of required multipliers can be saved. When compared with the existing block FIR designs, the proposed FIR design can increase the data rate from 238.66 MHz to 373.13 MHz while saving 10.89% area and 21.30% energy consumption as well.
内部ノードを利用したソフトエラー検出ラッチの設計

中垣直道, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2017年05月
C-elementを用いたソフトエラー耐性をもつSHCラッチの設計

田島咲季, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2017年05月
最大エラー距離に基づくGeAr回路の最適化

早水謙, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2017年05月
自己動力型スイッチング磁気変圧回路を用いたエネルギーハーベスティングシステム

川合洋平, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2017年05月
Improved monitoring-path selection algorithm for suspicious timing error prediction based timing speculation

Shinnosuke Yoshida, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings - 2015 IEEE 11th International Conference on ASIC, ASICON 2015

発表年月： 2016年07月

　概要を見る

© 2015 IEEE. As process technology is scaling down, timing speculation techniques such as Razor and STEP are emerged as alternative solutions to reduce required margins due to various variation effects. Unlike Razor, STEP is a prediction-based timing speculation method to predict suspicious timing errors before they really appear, and thus it can result in more performance improvement. Therefore, an improved monitoring-path selection algorithm for STEP-based timing speculation is proposed in this paper, in which candidate monitoring-paths are selected based on short path removement and path length estimation. Experimental results show that the proposed algorithm realizes an average of 1.71X overclocking compared with worst-case based designs.
A low-power soft error tolerant latch scheme

Saki Tajima, Youhua Shi, Nozomu Togawa, Masao Yanagisawa

Proceedings - 2015 IEEE 11th International Conference on ASIC, ASICON 2015

発表年月： 2016年07月

　概要を見る

© 2015 IEEE. As process technology continues scaling, low power and reliability of integrated circuits are becoming more critical than ever before. Particularly, due to the reduction of node capacitance and operating voltage for low power consumption, it makes the circuits more sensitive to high-energy particles induced soft errors. In this paper, a soft-error tolerant latch called TSPC-SEH is proposed for soft error tolerance with low power consumption. The simulation results show that the proposed TSPC-SEH latch can achieve up to 42% power consumption reduction and 54% delay improvement compared to the existing soft error tolerant SEH and DICE designs.
In-situ Trojan authentication for invalidating hardware-Trojan functions

Masaru Oya, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings - International Symposium on Quality Electronic Design, ISQED

発表年月： 2016年05月

　概要を見る

© 2016 IEEE. Due to the fact that we do not know who will create hardware Trojans (HTs), and when and where they would be inserted, it is very difficult to correctly and completely detect all the real HTs in untrusted ICs, and thus it is desired to incorporate in-situ HT invalidating functions into untrusted ICs as a countermeasure against HTs. This paper proposes an in-situ Trojan authentication technique for gate-level netlists to avoid security leakage. In the proposed approach, an untrusted IC operates in authentication mode and normal mode. In the authentication mode, an embedded Trojan authentication circuit monitors the bit-flipping count of a suspicious Trojan net within the pre-defined constant clock cycles and identify whether it is a real Trojan or not. If the authentication condition is satisfied, the suspicious Trojan net is validated. Otherwise, it is invalidated and HT functions are masked. By doing this, even untrusted netlists with HTs can still be used in the normal mode without security leakage. By setting the appropriate authentication condition using training sets from Trust-HUB gate-level benchmarks, the proposed technique invalidates successfully only HTs in the training sets. Furthermore, by embedding the in-situ Trojan authentication circuit into a Trojan-inserted AES crypto netlist, it can run securely and correctly even if HTs exist where its area overhead is just 1.5% with no delay overhead.
A delay variation and floorplan aware high-level synthesis algorithm with body biasing

Koki Igawa, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings - International Symposium on Quality Electronic Design, ISQED

発表年月： 2016年05月

　概要を見る

© 2016 IEEE. In this paper, we propose a delay variation and floorplan aware high-level synthesis algorithm with body biasing, which minimizes the average leakage energy of manufactured chips. To realize a floorplan-oriented high-level synthesis, we utilize a huddle-based distributed register architecture (HDR architecture), one of the DR architectures. HDR architecture divides the chip area into small partitions called a huddle and we can control a body bias voltage for every huddle. During high-level synthesis, we iteratively obtain expected leakage energy for every huddle when applying a body bias voltage. A huddle with smaller expected leakage energy contributes to reducing expected leakage energy of the entire circuit but can increase the latency. We assign CDFG nodes in critical paths to the huddles with larger expected leakage energy and those in non-critical paths to the huddles with smaller expected leakage energy. We expect to minimize the entire leakage energy in a manufactured chip without increasing its latency. Experimental results show that our algorithm reduces the average leakage energy by up to 38.9% without latency and yield degradation compared with typical-case design with body biasing.
高速かつ低電力なソフトエラー耐性をもつFast-SEHラッチの設計

田島咲季, 史又華, 戸川望, 柳澤政生

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2016年05月
A process-variation-aware multi-scenario high-level synthesis algorithm for distributed-register architectures

Koki Igawa, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

International System on Chip Conference

発表年月： 2016年02月

　概要を見る

© 2015 IEEE. In order to tackle a process-variation problem, we can define several scenarios, each of which corresponds to a particular LSI behavior, such as a typical-case scenario and a worst-case scenario. By designing a single LSI chip which realizes multiple scenarios simultaneously, we can have a process-variation-tolerant LSI chip. In this paper, we propose a process-variation-aware low-latency and multi-scenario high-level synthesis algorithm targeting new distributed-register architectures, called HDR architectures. We assume two scenarios, a typical-case scenario and a worst-case scenario, and realize them onto a single chip. We first schedule/bind each of the scenarios independently. After that, we commonize the scheduling/binding results for the typical-case and worst-case scenarios and thus generate a commonized area-minimized floorplan result. Experimental results show that our algorithm reduces the latency of the typical-case scenario by up to 50% without increasing the latency of the worst-case scenario, compared with several existing methods.
Scan-based side-channel attack against symmetric key ciphers using scan signatures

Mika Fujishiro, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings of the 2015 IEEE International Conference on Electron Devices and Solid-State Circuits, EDSSC 2015

発表年月： 2015年09月

　概要を見る

© 2015 IEEE. There are a number of studies on a side-channel attack which uses information exploited from the physical implementation of a cryptosystem. A scan-based side-channel attack utilizes scan chains, one of design-for-test techniques and retrieves the secret information inside the cryptosystem. In this paper, scan-based side-channel attack methods against symmetric key ciphers such as block ciphers and stream ciphers using scan signatures are presented to show the risk of scan-based attacks.
FPGA-based SHA-3 acceleration on a 32-bit processor via instruction set extension

Yi Wang, Youhua Shi, Chao Wang, Yajun Ha

Proceedings of the 2015 IEEE International Conference on Electron Devices and Solid-State Circuits, EDSSC 2015

発表年月： 2015年09月

　概要を見る

© 2015 IEEE. As embedded systems play more and more important roles Internet of Things (IoT), the integration of cryptographic functionalities is an urgent demand to ensure data and information security. Recently, Keccak was declared as the winner of the third generation of Secure Hashing Algorithm (SHA-3). However, implementing SHA-3 on a specific 32-bit processor failed to meet the performance requirement. On the other hand, implementing it as a cryptographic coprocessor consumes a lot of extra area and requires customized driver program. Although implementing Keccak on a 64-bit platform is more efficient, this platform is not suitable for embedded implementation. In this paper, we propose a novel SHA-3 implementation using instruction set extension based on a 32-bit LEON3 processor (an open source processor), with the goals of reducing execution cycles and code size. Experimental results show that the proposed design reduces around 87% execution cycles and 10.5% code size as compared to reference designs. Our design takes up only 9.44% extra area with negligible speed overhead compared to the standard LEON3 processor. Compared to the existing hardware accelerators, our proposed design occupies only half of area resources and does not require extra driver programs to be developed when integrated into the overall system.
A floorplan-aware high-level synthesis technique with delay-variation tolerance

Kazushi Kawamura, Yuta Hagio, Youhua Shi, Nozomu Togawa

Proceedings of the 2015 IEEE International Conference on Electron Devices and Solid-State Circuits, EDSSC 2015

発表年月： 2015年09月

　概要を見る

© 2015 IEEE. For realizing better trade-off between performance and yield rate in recent LSI designs, it is required to deal with increasing the ratios of interconnect delay as well as delay variation. In this paper, a novel floorplan-aware high-level synthesis technique with delay-variation tolerance is proposed. By utilizing floorplan-driven architectures, interconnect delays can be estimated and then handled even in high-level synthesis. Applying our technique enables to realize two scheduling/binding results (one is a non-delayed result and the other is a delayed result) simultaneously on a chip with small area/performance overhead, and either one of them can be selected according to the post-silicon delay variation. Experimental results demonstrate that our technique can reduce delayed scheduling/binding latency by up to 32.3% compared with conventional approaches.
A universal delay line circuit for variation resilient IC with self-calibrated time-to-digital converter

Shuai Shao, Youhua Shi, Wentao Dai, Jianyi Meng, Weiwei Shan

Proceedings of the 2015 IEEE International Conference on Electron Devices and Solid-State Circuits, EDSSC 2015

発表年月： 2015年09月

　概要を見る

© 2015 IEEE. A universal delay monitor used to imitate the real critical paths is developed for variation resilient integrated circuit. This monitor is constructed based on the different proportion of logic cells and interconnects. The delay of the monitor is detected by a time-to-digital converter which keeps the sampling results precise. To reduce the deviation of the sampling results caused by PVT, a novel time-to-digital converter with self-calibration mechanism is developed. This variation resilient method based adaptive voltage scaling is applied on an ARM7 based System on a Chip on 0.18 μm CMOS process with a 112M signoff frequency and an area of 1.3∗1.3 mm2. The simulation results show that it has a 43.42% gain of power consumption under FF corner, -25°C compared to the fixed 1.8 V traditional design.
A Score-Based Classification Method for Identifying Hardware-Trojans Inserted/Free Gate-Level Netlists

発表年月： 2015年03月
A score-based classification method for identifying Hardware-Trojans at gate-level netlists

Masaru Oya, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings -Design, Automation and Test in Europe, DATE

発表年月： 2015年01月

　概要を見る

© 2015 EDAA. Recently, digital ICs are often designed by outside vendors to reduce design costs in semiconductor industry, which may introduce severe risks that malicious attackers implement Hardware Trojans (HTs) on them. Since IC design phase generates only a single design result, an RT-level or gate-level netlist for example, we cannot assume an HT-free netlist or a Golden netlist and then it is too difficult to identify whether a generated netlist is HT-free or HT-inserted. In this paper, we propose a score-based classification method for identifying HT-free or HT-inserted gate-level netlists without using a Golden netlist. Our proposed method does not directly detect HTs themselves in a gate-level netlist but a net included in HTs, which is called Trojan net, instead. Firstly, we observe Trojan nets from several HT-inserted benchmarks and extract several their features. Secondly, we give scores to extracted Trojan net features and sum up them for each net in benchmarks. Then we can find out a score threshold to classify HT-free and HT-inserted netlists. Based on these scores, we can successfully classify HT-free and HT-inserted netlists in all the Trust-HUB gate-level benchmarks. Experimental results demonstrate that our method successfully identify all the HT-inserted gate-level benchmarks to be 'HT-inserted' and all the HT-free gate-level benchmarks to be 'HT-free' in approximately three hours for each benchmark.
In-situ timing monitoring methods for variation-resilient designs

Youhua Shi, Nozomu Togawa

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS

発表年月： 2015年01月

　概要を見る

© 2014 IEEE. With technology scaling, process, voltage, and temperature (PVT) variations pose great challenges on integrated circuit designs. Conventionally, LSI circuits are designed by adding pessimistic timing margin to guarantee 'always correct' operations even under worst-case conditions. However, due to the increasing PVT variations, unacceptable larger design guard band should be reserved to avoid timing errors on critical paths of circuits, which will therefore lead to very inefficient designs in terms of power and performance. For this reason, in-situ timing monitoring technique has gained great research interest. In this paper, we will review existing variation-resilient design techniques with particular emphasis on in-situ timing monitoring techniques including both detection and prediction-based methods. The effectiveness of in-situ timing monitoring techniques will be discussed. Finally, we show an example of in-situ timing monitoring technique called STEP with applications to general pipeline designs.
An area-overhead-oriented monitoring-path selection algorithm for suspicious timing error prediction

Shinnosuke Yoshida, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS

発表年月： 2015年01月

　概要を見る

© 2014 IEEE. As process technologies advance, the importance of timing error correction techniques is increasing as well. In this paper, We propose an area-overhead-oriented monitoring-path selection algorithm for suspicious timing error prediction circuits (STEPCs). STEPC predicts timing errors by monitoring the middle points of several speed-paths in a circuit. However, we need many STEPCs with a high area overhead to predict timing errors in an overall circuit. Our proposed method moves the STEPC insertion positions to minimize the number of inserted STEPCs. We apply a max-flow and min-cut approach to determine the optimal positions of inserted STEPCs. Our proposed algorithm reduces the required number of STEPCs to 1/19 and their area to 1/5 compared with a naive algorithm. Furthermore, our algorithm realizes 2.25X overclocking compared with just inserting STEPCs into several speed-paths.
Secure scan design using improved random order and its evaluations

Masaru Oya, Yuta Atobe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS

発表年月： 2015年01月

　概要を見る

© 2014 IEEE. Scan test using scan chains is one of the most important DFT techniques. However, scan-based attacks are reported which can retrieve the secret key in crypto circuits by using scan chains. Secure scan architecture is strongly required to protect scan chains from scan-based attacks. This paper proposes an improved version of random order as a secure scan architecture. In improved random order, a scan chain is partitioned into multiple sub-chains. The structure of the scan chain changes dynamically by selecting a subchain to scan out. Testability and security of the proposed improved random order are also discussed in the paper, and the implementation results demonstrate the effectiveness of the proposed method.
In-situ Timing Monitoring Methods for Variation-Resilient Designs

発表年月： 2014年11月
An Area-Overhead-Oriented Monitoring-Path Selection Algorithm for Suspicious Timing Error Prediction

発表年月： 2014年11月
Secure Scan Design Using Improved Random Order and its Evaluations

発表年月： 2014年11月
回路面積を考慮したSuspicious Timing Error Prediction回路の挿入位置決定手法

吉田慎之介, 史又華, 柳澤政生

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2014年08月
InTimeTune: A Throughput Driven Timing Speculation Architecture for Overscaled Designs

発表年月： 2014年06月
Throughput Driven Check Point Selection in Suspicious Timing Error Prediction based Designs

発表年月： 2014年02月
Throughput driven check point selection in suspicious timing error prediction based designs

Hiroaki Igarashi, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

2014 IEEE 5th Latin American Symposium on Circuits and Systems, LASCAS 2014 - Conference Proceedings

発表年月： 2014年01月

　概要を見る

In this paper, a throughput-driven design technique is proposed, in which a suspicious timing error prediction circuit is inserted to monitor the signal transitions at some selected check points. Unlike previous works where timing errors are detected after their occurrence, the proposed method tries to use the real intermediate signal transitions for timing error prediction. The check point selection will affect both the maximal operation frequency and the suspicious timing error overestimation rate, both of which have an effect on the overall throughput, thus an analysis on the check point selection is also given. In our work, the circuit can be overclocked by a factor of 2 or more with ignorable area overhead while guarantees the always-correct output. © 2014 IEEE.
Secure Scan Design with Dynamically Configurable Connection

発表年月： 2013年12月
Predication based Timing Speculation Technique for Throughput Improvement

発表年月： 2013年11月
Concurrent faulty clock detection for crypto circuits against clock glitch based DFA

Hiroaki Igarashi, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings - IEEE International Symposium on Circuits and Systems

発表年月： 2013年09月

　概要を見る

In this paper, a concurrent faulty clock detection method is proposed for crypto circuits against clock glitch based differential fault analysis (DFA). In the proposed method, a nonlogic buffer-based delay chain is inserted, and then by monitoring the delay along the delay chain, a possible clock glitch based DFA can be detected. Experimental results on an AES circuit show that the proposed method can successfully detect clock glitch based attacks, and the required area overhead is only 0.47% that is much smaller than previous works. © 2013 IEEE.
An energy-efficient high-level synthesis algorithm incorporating interconnection delays and dynamic multiple supply voltages

Shin Ya Abe, Youhua Shi, Kimiyoshi Usami, Kimiyoshi Usami, Kimiyoshi Usami, Masao Yanagisawa, Masao Yanagisawa, Nozomu Togawa

2013 International Symposium on VLSI Design, Automation, and Test, VLSI-DAT 2013

発表年月： 2013年08月

　概要を見る

In this paper, we propose an adaptive voltage huddle-based distributed-register architecture (AVHDR architecture) that integrates dynamic multiple supply voltages and interconnection delays into high-level synthesis. Next, we propose a high-level synthesis algorithm for AVHDR architectures. Our algorithm is based on iterative improvement of scheduling/binding and floorplanning. In the iteration process, huddles, each of which abstracts modules placed close to each other, are naturally generated using floorplanning. Low-supply voltages are assigned to non-critical operations, and leakage power is cut off by turning off the power supply to the sleeping functional units. Experimental results show that our algorithm achieves 50% energy-saving compared with conventional algorithms. © 2013 IEEE.
ランダムオーダースキャンによるセキュアスキャン設計

跡部悠太, 史又華, 柳澤政生

回路とシステムワークショップ論文集 Workshop on Circuits and Systems

発表年月： 2013年07月
Suspicious timing error prediction with in-cycle clock gating

Youhua Shi, Hiroaki Igarashi, Nozomu Togawa, Masao Yanagisawa

Proceedings - International Symposium on Quality Electronic Design, ISQED

発表年月： 2013年07月

　概要を見る

Conventionally, circuits are designed to add pessimistic timing margin to solve delay variation problems, which guarantees 'always correct' operations. However, due to the fact that such a worst-case condition occurs rarely, the traditional pessimistic design method is therefore becoming one of the main obstacles for designers to achieve higher performance and/or ultra-low power consumption. By monitoring timing error occurrence during circuit operation, adaptive timing error detection and recovery methods have gained wide interests recently as a promising solution. As an extension of existing research, in this paper, we propose a suspicious timing error prediction method for performance or energy efficiency improvement in pipeline designs. Experimental results show that with when compared with typical margin designs, the proposed method can 1) achieve up to 1.41X throughput improvement with in-situ timing error prediction ability; and 2) allow the design to be overclocked by up to 1.88X with 'always correct' outputs. © 2013 IEEE.
Floorplan Driven Architectures and High-level Synthesis Algorithm for Dynamic Multiple Supply Voltages

発表年月： 2013年06月
Concurrent Faulty Clock Detection for Crypto Circuits Against Clock Glitch Based DFA

発表年月： 2013年05月
DR24 An Energy-efficient High-level Synthesis Algorithm Incorporating Interconnection Delays and Dynamic Multiple Supply Voltages

発表年月： 2013年04月
Suspicious Timing Error Detection and Recovery with In-Cycle Clock Gating

発表年月： 2013年03月
Secure scan design with dynamically configurable connection

Yuta Atobe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

Proceedings of IEEE Pacific Rim International Symposium on Dependable Computing, PRDC

発表年月： 2013年01月

　概要を見る

Scan test is a powerful test technique which can control and observe the internal states of the circuit under test through scan chains. However, it has been reported that it's possible to retrieve secret keys from cryptographic LSIs through scan chains. Therefore new secure test methods are required to satisfy both testability and security requirements. In this paper, a secure scan design is proposed to achieve adequate security requirement as a countermeasure against scan-based attacks, while still maintain high testability like normal scan testing. In our method, the internal scan chain is divided into several sub chains, and the connection order of sub chains can be dynamically changed. In addition, how to decide the connection order of those sub chains so that it can't be identified by an attacker is also proposed in this paper. The proposed method is implemented on an AES circuit to show its effectiveness, and a security analysis is also given to show how the proposed approach can be used as a countermeasure against those known scan-based attacks. © 2013 IEEE.
State Dependent Scan Flip-Flop with Key-Based Configuration against Scan-Based Side Channel Attack on RSA Circuit

発表年月： 2012年12月
State dependent scan flip-flop with key-based configuration against scan-based side channel attack on RSA circuit

Yuta Atobe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS

発表年月： 2012年12月

　概要を見る

Scan test is one of the useful design for testability techniques, which can detect circuit failure efficiently. However, it has been reported that it's possible to retrieve secret keys from cryptographic LSIs through scan chains. Therefore testability and security contradicted to each other, and there is a need to an efficient design for testability circuit so as to satisfy both testability and security requirement. In this paper, a secure scan architecture against scan-based attack is proposed to achieve high security without compromising the testability. In our method, scan structure is dynamically changed by adding the latch to any FFs in the scan chain. We made an analysis on an RSA circuit implementation to show the effectiveness of the proposed method and discussed how our approach is resistant to scan-based attack. © 2012 IEEE.
Dynamically changeable secure scan architecture against scan-based side channel attack

Yuta Atobe, Youhua Shi, Masao Yanagisawa, Nozomu Togawa

ISOCC 2012 - 2012 International SoC Design Conference

発表年月： 2012年12月

　概要を見る

Scan test which is one of the useful design for testability techniques is effective for LSIs including cryptographic circuit. It can observe and control the internal states of the circuit under test by using scan chain. However, scan chain presents a significant security risk of information leakage for scan-based attacks which retrieves secret keys of cryptographic LSIs. In this paper, a secure scan architecture against scan-based attack which still has high testability is proposed. In our method, scan data is dynamically changed by adding the latch to any FFs in the scan chain. We show that by using proposed method, neither the secret key nor the testability of an RSA circuit implementation is compromised, and the effectiveness of the proposed method. © 2012 IEEE.
Dynamically Changeable Architecture against Scan-Based Side Channel, Attack Using State Dependent Scan Flip-Flop on RSA Circuit

発表年月： 2012年11月
VLSI implementation of a fast intra prediction algorithm for H.264/AVC encoding

Youhua Shi, Kenta Tokumitsu, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS

発表年月： 2010年12月

　概要を見る

Intra-frame coding is one of the most important technologies in H.264/AVC, which made significant contributions to the enhancement of coding efficiency of H.264/AVC at the cost of computation complexity. To address this problem, in this paper we present an efficient VLSI implementation of a computation efficient intra prediction algorithm for H.264/AVC encoding. Unlike most of existing fast intra-mode selection techniques, in the proposed method the directional differences are computed using a few selected original pixels to obtain the candidate modes with the minimal direction cost. The proposed method is hardware-friendly and provides more processing parallelism for H.264 intra-frame encoding with less overhead and less power consumption, which is expected to be utilized as a favourable accelerator hardware module in a real-time HDTV (1920×1080p) H.264 encoder. © 2010 IEEE.
State-dependent changeable scan architecture against scan-based side channel attacks

Ryuta Nara, Hiroshi Atobe, Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

ISCAS 2010 - 2010 IEEE International Symposium on Circuits and Systems: Nano-Bio Circuit Fabrics and Systems

発表年月： 2010年08月

　概要を見る

Scan test is a powerful and popular test technique because it can control and observe the internal states of the circuit under test. However, scan path would be used to discover the internals of crypto hardware, which presents a significant security risk of information leakage. An interesting design-for-test technique by inserting inverters into the internal scan path to complicate the scan structure has been recently presented. Unfortunately, it still carries the potential of being attacked through statistical analysis of the information scanned out from chips. Therefore, in this paper we propose secure scan architecture, called dynamic variable secure scan, against scan-based side channel attack. The modified scan flip-flops are state-dependent, which could cause the output of each State-dependent Scan FF to be inverted or not so as to make it more difficult to discover the internal scan architecture. ©2010 IEEE.
Design-for-secure-test for crypto cores

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings - International Test Conference

発表年月： 2009年12月

　概要を見る

Scan technology carries the potential of being misused as a "side channel" to leak out the secret information of crypto cores. To address such a design challenge, this paper proposes a design-for-secure-test (DFST) solution for crypto cores by adding a stimuli-launched flip-flop into the traditional scan flip-flop to maintain the high test quality without compromising the security. © 2009 IEEE.
Unknown response masking with minimized observable response loss and mask data

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

IEEE Asia-Pacific Conference on Circuits and Systems, Proceedings, APCCAS

発表年月： 2008年12月

　概要を見る

This paper presents a new unknown response masking technique to minimize the effect on test loss due to over-masking. Unlike previous works where the scan responses are masked before entering the response compactor, the proposed method could mask the Xs when they are transformed on the scan path. Meanwhile, the masking cells are inserted along the scan paths, thus they would have no degradation on the performance of the designs. In addition, the test data required to mask unknown responses is only one bit for each test pattern. Experimental results show the effectiveness of the proposed method. © 2008 IEEE.
GECOM: Test data compression combined with all unknown response masking

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC

発表年月： 2008年08月

　概要を見る

This paper introduces GECOM technology, a novel test compression method with seamless integration of test GEneration, test COmpression (i.e. integrated compression on scan stimulus and masking bits) and all unknown scan responses Masking for manufacturing test cost reduction. Unlike most of prior methods, the proposed method considers the unknown responses during ATPG procedure and selectively encodes the specified 1 or 0 bits (either 1s or 0s) in scan slices for compression while at the same time masks the unknown responses before sending them to the response compactor. The proposed GECOM technology consists of GECOM architecture and GECOM ATPG technique. In the GECOM architecture, for a circuit with N internal scan chains, only c tester channels, where c = [log2 N] +2, are required. GECOM ATPG generates test patterns for the GECOM architecture thus not only the scan inputs could be efficiently compressed but also all the unknown responses would be masked. Experimental results on both benchmark circuits and real industrial designs indicated the effectiveness of the proposed GECOM technique. ©2008 IEEE.
Scalable unified dual-radix architecture for Montgomery multiplication in GF{P) and GF(2ⁿ)

Kazuyuki Tanimura, Ryuta Nara, Shunitsu Kohara, Kazunori Shimizu, Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC

発表年月： 2008年08月

　概要を見る

Modular multiplication is the most dominant arithmetic operation in elliptic curve cryptography (ECC), which is a type of public-key cryptography. Montgomery multiplication is commonly used as a technique for the modular multiplication and required scalability since the bit length of operands varies depending on the security levels. Also, ECC is performed in GF(P) or GF(2n), and unified architectures for GF(P) and GF(2n) multiplier are needed. However, in previous works, changing frequency or dual-radix architecture is necessary to deal with delay-time difference between GF(P) and GF(2n) circuits of the multiplier because the critical path of GF(P) circuit is longer. This paper proposes a scalable unified dual-radix architecture for Montgomery multiplication in GF(P) and GF(2n). The proposed architecture unifies 4 parallel radix-216multipliers in GF(P) and a radix-264multiplier in GF(2n) into a single unit. Applying lower radix to GF(P) multiplier shortens its critical path and makes it possible to compute the operands in the two fields using the same multiplier at the same frequency so that clock dividers to deal with the delay-time difference are not required. Moreover, parallel architecture in GF(P) reduces the clock cycles increased by dual-radix approach. Consequently, the proposed architecture achieves to compute GF(P) 256-bit Montgomery multiplication in 0.23μs. ©2008 IEEE.
Design for secure test - A case study on pipelined advanced encryption standard

Youhua Shi, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings - IEEE International Symposium on Circuits and Systems

発表年月： 2007年09月

　概要を見る

Cryptography plays an important role in the security of data transmission. To ensure the correctness of crypto hardware, we should conduct testing at fabrication and infield. However, the state-of-the-art scan-based test techniques, to achieve high test qualities, need to increase the testability of the circuit under test, which carries a potential of being misused to reveal the secret information of the crypto hardware. Thus, to develop efficient test strategies for crypto hardware to achieve high test quality without compromising security becomes an important task. In this paper we discuss the development of a Design-forSecure-Test (DFST) technique for pipelined AES to overcome the above contradiction between security and test quality in testing crypto hardware. Unlike previous works, the proposed method can keep all the secrets inside and provide high test quality and fault diagnosis ability as well. Furthermore, the proposed DFST technique can significantly reduce test application time, test data volume, and test generation effort as additional benefits. © 2007 IEEE.
Low-cost IP core test using multiple-mode loading scan chain and scan chain clusters

Gang Zeng, Youhua Shi, Toshinori Takabatake, Masao Yanagisawa, Hideo Ito

Proceedings - IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems

発表年月： 2006年12月

　概要を見る

A fixing-shifting encoding (FSE) method is proposed to reduce test cost of IP cores. The FSE method reduces test cost by supporting multiple-mode loading test data, i.e., parallel loading, left-direction, and right-direction serial loading for each test slice data. Furthermore, the FSE that utilizes only two test channels can support a large number of internal scan chains and achieve further reduction in test cost by combining with scan chain clustering method. As a non-intrusive and automatic test pattern generation (ATPG) independent solution, the approach is applicable to IP core testing because it requires neither redesign of the core under test (CUT) nor running any additional ATPG for the encoding procedure. In addition, the decoder has low hardware overhead, and its design is independent of the CUT. Experimental results for some large ISCAS 89 benchmarks and an industry ASIC design have proven the efficiency of the proposed approach. © 2006 IEEE.
FCSCAN: An efficient multiscan-based test compression technique for test cost reduction

Youhua Shi, Nozomu Togawa, Shinji Kimura, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC

発表年月： 2006年09月

　概要を見る

This paper proposes a new multiscan-based test input data compression technique by employing a Fan-out Compression Scan Architecture (FCSCAN) for test cost reduction. The basic idea of FCSCAN is to target the minority specified 1 or 0 bits (either 1 or 0) in scan slices for compression. Due to the low specified bit density in test cube set, FCSCAN can significantly reduce input test data volume and the number of required test channels so as to reduce test cost. The FCSCAN technique is easy to be implemented with small hardware overhead and does not need any special ATPG for test generation. In addition, based on the theoretical compression efficiency analysis, improved procedures are also proposed for the FCSCAN to achieve further compression. Experimental results on both benchmark circuits and one real industrial design indicate that drastic reduction in test cost can be indeed achieved. © 2006 IEEE.
Low power test compression technique for designs with multiple scan chains

Youhua Shi, Nozomu Togawa, Shinji Kimura, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asian Test Symposium

発表年月： 2005年12月

　概要を見る

This paper presents a new DFT technique that can significantly reduce test data volume as well as scan-in power consumption for multiscan-based designs. It can also help to reduce test time and tester channel requirements with small hardware overhead. In the proposed approach, we start with apre-computed test cube set and fill the don't-cares with proper values for joint reduction of test data volume and scan power consumption. In addition we explore the linear dependencies of the scan chains to construct a fanout structure only with inverters to achieve further compression. Experimental results for the larger ISCAS'89 benchmarks show the efficiency of the proposed technique. © 2005 IEEE.
Alternative run-length coding through scan chain reconfiguration for joint minimization of test data volume and power consumption in scan test

Youhua Shi, Shinji Kimura, Nozomu Togawa, Masao Yanagisawa, Tatsuo Ohtsuki

Proceedings of the Asian Test Symposium

発表年月： 2004年12月

　概要を見る

Test data volume and scan power are two major concerns in SoC test. In this paper we present an alternative run-length coding method through scan chain reconfiguration to reduce both test data volume and scan-in power consumption. The proposed method analyzes the compatibility of the internal scan cells for a given test set and then divides the scan cells into compatible classes. To extract the compatible scan cells we apply a heuristic algorithm by solving the graph coloring problem; and then a simple greedy algorithm is used to configure the scan chain for the minimization of scan power. Experimental results for the larger IS-CAS'89 benchmarks show that the proposed approach leads to highly reduced test data volume with significant power savings during scan test.
Reducing test data volume for multiscan-based designs through single/sequence mixed encoding

Youhua Shi, Youhua Shi, Shinji Kimura, Nozomu Togawa, Nozomu Togawa, Masao Yanagisawa, Masao Yanagisawa, Tatsuo Ohtsuki, Tatsuo Ohtsuki

Midwest Symposium on Circuits and Systems

発表年月： 2004年12月

　概要を見る

This paper presents a new test data compression technique for multiscan-based designs through dictionary-based encoding on the single or sequences scan-inputs. In spite of its simplicity, it achieves significant reduction in test data volume. Unlike some previous approaches on test data compression, our approach eliminates the need for additional synchronization and handshaking between the CUT and the ATE, so it is especially suitable to be integrated in a low cost test scheme for SoC test In addition in contrast to previous dictionary-based coding techniques, even for the CUT with a small number of scan chains, the proposed approach can achieve satisfied reduction in test data volume. Experimental results showed the proposed test scheme works particularly well for the large ISCAS'89 benchmarks.
Multiple test set generation method for LFSR-based BIST

Youhua Shi, Zhe Zhang

Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC

発表年月： 2003年01月

　概要を見る

© 2003 IEEE. In this paper we propose a new reseeding method for LFSR-based test pattern generation suitable for circuits with random pattern resistant faults. The character of our method is that the proposed test pattern generator (TPG) can work both in normal LFSR mode, to generate pseudorandom test vectors, and in jumping mode to make the TPG jump from a state to the required state (seed of next group). Experimental results indicate that its superiority against other known reseeding techniques with respect to the length of the test sequence and the required area overhead.

▼全件表示

共同研究・競争的資金等の研究課題

高効率な自立電源エネルギーハーベスティング回路の研究開発

日本学術振興会科学研究費助成事業

研究期間:

2018年04月

-

2021年03月

史又華

　概要を見る

人間がエネルギー源（例えば、踏む、歩く、漕ぐ、押すなどの動作によって発電させようというもの）として圧電素子を用いた高効率なエネルギーハーベスティング（EH）インターフェース回路設計の研究開発を行った。特に、(1)多段反転E-SECE回路と複数の圧電素子を用いたEH回路の最適化による自己駆動型スイッチング制御回路の設計、(2)スイッチの導通タイミングに位相差を挿入による広帯域化設計手法の提案、および(3)振動発電を利用したバッテリーフリー無線送信可能なウェアラブルデバイスの実現などの成果を得た。
超低消費電力設計における遅延テスト設計技術に関する研究

科学研究費助成事業(早稲田大学) 科学研究費助成事業(若手研究(B))

研究期間:

2011年

-

2013年

史又華

　概要を見る

近年、情報処理LSI回路の消費電力を下げるために、様々な研究が行われてきた。一方、既存の超低消費電力LSI回路における高信頼化設計手法は、電源電圧ごとの遅延テストまたは電源間の遅延テストを行うために、非常に複雑になる。そのため、超低消費電力LSI回路に対して高い信頼性を保つ設計技術の確立が強く求められると考えられる。そこで、本研究は、上記の問題点を解決するために、１）サブスレッショルド回路における高信頼化設計、２）配線遅延を考慮した低消費電力化設計、及び３）ロジック回路遅延予測によるタイミングエラー検出可能なLSI設計技術について低消費電力回路における高信頼化設計の研究開発を行ってきた。
暗号処理向け組み込みLSIとそのテスト設計環境の構築

科学研究費助成事業(早稲田大学) 科学研究費助成事業(基盤研究(C))

研究期間:

2009年

-

2011年

柳澤政生, 奈良竜太, 史又華

　概要を見る

通信ネットワークの普及やディジタル回路技術の発達に伴って,情報の安全性確保や不正処理の防止のために暗号処理LSIが使われている.通常, LSIでは製造時の検査や動作テストを行うために,スキャンチェインと呼ばれるテスト用回路と信号線が用意されるが,暗号回路ではこれが不正な情報取得の糸口となる可能性がある.そこで本研究では,テスト目的での利用を可能としながら内部情報の解析を防ぐSD-SFF(State Dependent ScanFlip Flop :状態依存スキャンフリップフロップ)という機構を提案する.この提案テスト回路を利用することで,製造時のコスト増加を抑えながら機密性の高い暗号通信を可能とする.
フォールパス自動検出および過剰テスト緩和の合成システムに関する研究

科学研究費助成事業(早稲田大学) 科学研究費助成事業(若手研究(B))

研究期間:

2007年

-

2009年

史又華

　概要を見る

システムLSIの多機能化、高速化及び低消費電力化につれ、新たな問題点が明らかになってきた。その一つは、フォールスパスの遅延に関する解析・テスト合成設計である。本研究では、システムLSI設計時間の短縮とテスト設計コストの削減のため、設計解析合成システムを構築し、その環境を活用して、次世代システムLSI向けの解析・テスト技術の開発を行ってきた。特に、フォールスパス自動検出システムの開発及び過剰テストを考慮した遅延故障テスト技術に関する研究を行ってきた。
タイミングエラー予測によるばらつき耐性を有するLSI設計技術に関する研究

科学研究費助成事業(早稲田大学) 科学研究費助成事業(基盤研究(C))

Misc

Energy-efficient and Real-time FPGA-based YOLOv6 accelerator for Object Detection

情報処理学会 DAシンポジウム 129 - 134 2023年08月 [査読有り]

担当区分：最終著者, 責任著者

研究発表ペーパー・要旨（全国大会，その他学術会議）
Optimizing Hardware-Friendly Object Detection Network for Edge Devices

情報処理学会 DAシンポジウム 124 - 128 2023年08月 [査読有り]

担当区分：最終著者, 責任著者

研究発表ペーパー・要旨（全国大会，その他学術会議）
エッジデバイス搭載可能なAttention Moduleを用いた動的手話認識システム

孟悦捷, 柳澤政生, 史又華

人工知能学会第37回全国大会 2023年07月 [査読有り]

担当区分：最終著者, 責任著者

研究発表ペーパー・要旨（全国大会，その他学術会議）
Attention Mask によるディープフェイク動画像の検出

小野尚紀, 史又華

人工知能学会第37回全国大会 2023年07月 [査読有り]

担当区分：最終著者, 責任著者

研究発表ペーパー・要旨（全国大会，その他学術会議）
TFNNを用いた音声感情認識システムに関する考察

新崎正人, 柳澤政生, 史又華

人工知能学会, 第121回人工知能基本問題研究会 2022年09月

機関テクニカルレポート，技術報告書，プレプリント等
自立駆動可能な摩擦帯電エネルギーハーベスティング回路の設計

山本圭乃, 蘇怡瑞, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 35 53 - 58 2022年08月 [査読有り]

研究発表ペーパー・要旨（全国大会，その他学術会議）
人の動作によるエネルギーハーベスティングのための圧電素子の実機実験

山口航, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 35 263 - 268 2022年08月 [査読有り]

研究発表ペーパー・要旨（全国大会，その他学術会議）
リーク削減による低消費電力SRAMの設計—A low power SRAM design with leakage power reduction

伊藤卓, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 31 197 - 202 2018年05月 [査読有り]

CiNii
低周波圧電エネルギーハーベスティングにおけるMOSs SP-SSHI手法—MOSs SP-SSHI for low frequency piezoelectric energy harvesting

杉山貴紀, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 31 86 - 91 2018年05月 [査読有り]

CiNii
CNNに対する概算加算器の適用と評価—Application and evaluation of CNN with approximate adders

井上雄太, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 31 191 - 196 2018年05月 [査読有り]

CiNii
C-elementを用いたソフトエラー耐性をもつSHCラッチの設計

田島咲季, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 30 214 - 219 2017年05月 [査読有り]

CiNii
内部ノードを利用したソフトエラー検出ラッチの設計

中垣直道, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 30 220 - 225 2017年05月 [査読有り]

CiNii
最大エラー距離に基づくGeAr回路の最適化

早水謙, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 30 7 - 12 2017年05月 [査読有り]

CiNii
自己動力型スイッチング磁気変圧回路を用いたエネルギーハーベスティングシステム

川合洋平, 戸川望, 柳澤政生, 史又華

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 30 1 - 6 2017年05月 [査読有り]

CiNii
高速かつ低電力なソフトエラー耐性をもつFast-SEHラッチの設計

田島咲季, 史又華, 戸川望, 柳澤政生

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 29 220 - 224 2016年05月 [査読有り]

CiNii
タイミングエラー耐性を持つAES暗号回路の設計 (VLSI設計技術)

吉田慎之介, 史又華, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 115 ( 465 ) 73 - 78 2016年02月

CiNii
悪意ある機能を無効化する内部ハードウェアトロイ認証 (VLSI設計技術)

大屋優, 史又華, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 115 ( 465 ) 79 - 84 2016年02月

CiNii
ゲートレベルネットリストの脆弱性を表現する指標 (ディペンダブルコンピューティング)

大屋優, 史又華, 山下哲孝

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 115 ( 339 ) 141 - 146 2015年12月

CiNii
ゲートレベルネットリストの脆弱性を表現する指標 (VLSI設計技術)

大屋優, 史又華, 山下哲孝

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 115 ( 338 ) 141 - 146 2015年12月

CiNii
15nmプロセスにおける低電力な耐ソフトエラーラッチの設計 (VLSI設計技術)

田島咲季, 史又華, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 115 ( 338 ) 123 - 127 2015年12月

CiNii
15nmプロセスにおける低電力な耐ソフトエラーラッチの設計 (ディペンダブルコンピューティング)

田島咲季, 史又華, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 115 ( 339 ) 123 - 127 2015年12月

CiNii
A-9-2 低電力なソフトエラー耐性をもつNew-SEHラッチの設計(A-9.信頼性,一般セッション)

田島咲季, 史又華, 戸川望, 柳澤政生

電子情報通信学会基礎・境界ソサイエティ/NOLTAソサイエティ大会講演論文集 2015 106 - 106 2015年08月

CiNii
基板バイアス制御による遅延ばらつき補償および配線遅延を考慮した低エネルギーオーバーヘッド指向の高位合成手法

井川昂輝, 史又華, 柳澤政生, 戸川望

DAシンポジウム2015論文集 ( 2015 ) 23 - 28 2015年08月

CiNii
クロックグリッチに基づく故障解析に耐性を持つAES暗号回路 (VLSI設計技術)

平野大輔, 史又華, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 115 ( 21 ) 51 - 55 2015年05月

CiNii
クロックグリッチに基づく故障解析に耐性を持つAES暗号回路

平野大輔, 史又華, 戸川望, 柳澤政生

情報処理学会研究報告. SLDM, [システムLSI設計技術] 2015 ( 10 ) 1 - 5 2015年05月

　概要を見る

近年,暗号回路への攻撃手法として,故障解析が脅威となっている.回路への故障の発生方法には,レーザー照射や電圧変動,クロックグリッチなどの方法があるが,実装や制御の容易性からクロックグリッチが注目されている.対策手法として,回路を三重化して比較する空間冗長化手法や,同じ処理を 2 回行って比較する時間冗長化手法が存在する.しかし,これらの手法は面積オーバーヘッド或いは時間オーバーヘッドが大きいという問題点がある.本稿では,故障解析の誘因となるクロックグリッチを高速に検出可能で,面積オーバーヘッドを 4.9% に抑えた AES 暗号回路を提案する.

CiNii
低電力耐ソフトエラーラッチの設計 (VLSI設計技術)

田島咲季, 史又華, 戸川望, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 476 ) 55 - 60 2015年03月

　概要を見る

近年のLSIの微細化に伴い,ソフトエラーによる信頼性の低下が問題視されている.フリップフロップの多重化等の様々なソフトエラー対策が提案されてきたが,多重化による面積・電力の増大が問題である.そこで,本稿では既存のSEHラッチに低電力化技術であるTSPC (True Single Phase Clock)を取り入れた,低電力耐ソフトエラーラッチを提案する.レイアウトを設計し,HSPICEシミュレーションによりTSPC-SEHラッチと従来のSEHラッチ,DICEラッチと比較し,ソフトエラー耐性を損なわずに電力を最大42%削減し,54%の動作速度向上を達成した.

DOI CiNii
ゲートレベルネットリストを対象としたスコアに基づくハードウェアトロイ識別手法 (VLSI設計技術)

大屋優, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 476 ) 165 - 170 2015年03月

　概要を見る

近年,企業はデジタルICの製造コストを削減するために,チップの製造をサードパーティに外注するようになった.サードパーティが製造に関わるようになり,ハードウェアトロイ(HTs)の挿入が問題視されるようになった.設計段階ではRegister Transfer Level (RTL)やゲートレベルのネットリストが1つだけ生成されるため,Goldenネットリストを仮定することはできない.以上の背景から生成されたネットリストにHTsが挿入されているか否かを識別するのは極めて困難である.本稿では,Goldenネットリストを使わずにゲートレベルのネットリストに対してスコアに基づいたHTsの有無を識別する手法を提案する.提案手法は,HTsに含まれるネット(トロイネットと呼ぶ)の特徴に注目し,トロイネットを検出することでHTsを検出する.提案手法はTrust-HUBのAbstraction Gate Levelで公開されている全てのゲートレベルのネットリストに対してHTsの有無を分類することに成功した.提案手法にかかる時間は高々数時間程度である.

CiNii
トロイネットの特徴に基づくハードウェアトロイ検出手法 (VLSI設計技術)

大屋優, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 426 ) 157 - 162 2015年01月

　概要を見る

近年,企業はチップの製造コストを削減するために,チップの製造をサードパーティに外注するようになった.サードパーティが製造に関わるようになり,ハードウェアトロイの挿入が問題視されるようになった.特に設計段階では容易にハードウェアトロイを挿入することができる,ゲートレベルのネットリストに対してハードウェアトロイ検出手法を適用する場合,Goldenネットリストを持っておらず,挿入されているハードウェアトロイを活性化させないという条件下でハードウェアトロイを検出できる手法は存在しない。本稿では,Goldenネットリストが無く,ハードウェアトロイを活性化させなくても,ハードウェアトロイを検出する手法を提案する.提案手法は,ハードウェアトロイに含まれるネット(トロイネットと呼ぶ)の特徴に注目し,トロイネットを検出することでハードウェアトロイを検出する.トロイネットの特徴は9個あり,これらの特徴に一致するネットに重みづけを行うことで,トロイネットを検出する.提案手法はTrust-HUBのAbstraction Gate Levelで公開されているハードウェアトロイの挿入されている全てのゲートレベルのネットリストに対してトロイネットを検出した.加えて,2個のネットリストを除いて,誤検出なくトロイネットのみを検出することに成功した.提案手法にかかる時間は高々数十分程度である.

CiNii
トロイネットの特徴に基づくハードウェアトロイ検出手法 (コンピュータシステム)

大屋優, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 427 ) 157 - 162 2015年01月

　概要を見る

近年,企業はチップの製造コストを削減するために,チップの製造をサードパーティに外注するようになった.サードパーティが製造に関わるようになり,ハードウェアトロイの挿入が問題視されるようになった.特に設計段階では容易にハードウェアトロイを挿入することができる,ゲートレベルのネットリストに対してハードウェアトロイ検出手法を適用する場合,Goldenネットリストを持っておらず,挿入されているハードウェアトロイを活性化させないという条件下でハードウェアトロイを検出できる手法は存在しない。本稿では,Goldenネットリストが無く,ハードウェアトロイを活性化させなくても,ハードウェアトロイを検出する手法を提案する.提案手法は,ハードウェアトロイに含まれるネット(トロイネットと呼ぶ)の特徴に注目し,トロイネットを検出することでハードウェアトロイを検出する.トロイネットの特徴は9個あり,これらの特徴に一致するネットに重みづけを行うことで,トロイネットを検出する.提案手法はTrust-HUBのAbstraction Gate Levelで公開されているハードウェアトロイの挿入されている全てのゲートレベルのネットリストに対してトロイネットを検出した.加えて,2個のネットリストを除いて,誤検出なくトロイネットのみを検出することに成功した.提案手法にかかる時間は高々数十分程度である.

CiNii
トロイネットの特徴に基づくハードウェアトロイ検出手法 (リコンフィギャラブルシステム)

大屋優, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 428 ) 157 - 162 2015年01月

　概要を見る

近年,企業はチップの製造コストを削減するために,チップの製造をサードパーティに外注するようになった.サードパーティが製造に関わるようになり,ハードウェアトロイの挿入が問題視されるようになった.特に設計段階では容易にハードウェアトロイを挿入することができる,ゲートレベルのネットリストに対してハードウェアトロイ検出手法を適用する場合,Goldenネットリストを持っておらず,挿入されているハードウェアトロイを活性化させないという条件下でハードウェアトロイを検出できる手法は存在しない。本稿では,Goldenネットリストが無く,ハードウェアトロイを活性化させなくても,ハードウェアトロイを検出する手法を提案する.提案手法は,ハードウェアトロイに含まれるネット(トロイネットと呼ぶ)の特徴に注目し,トロイネットを検出することでハードウェアトロイを検出する.トロイネットの特徴は9個あり,これらの特徴に一致するネットに重みづけを行うことで,トロイネットを検出する.提案手法はTrust-HUBのAbstraction Gate Levelで公開されているハードウェアトロイの挿入されている全てのゲートレベルのネットリストに対してトロイネットを検出した.加えて,2個のネットリストを除いて,誤検出なくトロイネットのみを検出することに成功した.提案手法にかかる時間は高々数十分程度である.

CiNii
トロイネットの特徴に基づくハードウェアトロイ検出手法

大屋優, 史又華, 柳澤政生, 戸川望

情報処理学会研究報告. SLDM, [システムLSI設計技術] 2015 ( 28 ) 1 - 6 2015年01月

　概要を見る

近年,企業はチップの製造コストを削減するために,チップの製造をサードパーテイに外注するようになった.サードパーテイが製造に関わるようになり,ハードウェアトロイの挿入が問題視されるようになった.特に設計段階では容易にハードウェアトロイを挿入することができる.ゲートレベルのネットリストに対してハードウェアトロイ検出手法を適用する場合,Golden ネットリストを持っておらず,挿入されているハードウェアトロイを活性化させないという条件下でハードウェアトロイを検出できる手法は存在しない本稿では,Golden ネットリストが無く,ハードウェアトロイを活性化させなくても,ハードウェアトロイを検出する手法を提案する.提案手法は,ハードウェアトロイに含まれるネット (トロイネットと呼ぶ) の特徴に注目し,トロイネットを検出することでハードウェアトロイを検出する.トロイネットの特徴は 9 個あり,これらの特徴に一致するネットに重みづけを行うことで,トロイネットを検出する.提案手法は Trust-HUB の Abstraction Gate Level で公開されているハードウェアトロイの挿入されている全てのゲートレベルのネットリストに対してトロイネットを検出した.加えて,2 個のネットリストを除いて,誤検出なくトロイネットのみを検出することに成功した.提案手法にかかる時間は高々数十分程度である.

CiNii
タイミングエラーへの耐性を持つフリップフロップ設計 (VLSI設計技術) -- (デザインガイア2014 : VLSI設計の新しい大地)

鈴木大渡, 史又華, 戸川望, 宇佐美公良, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 328 ) 45 - 50 2014年11月

　概要を見る

集積回路の微細化の影響により,回路のばらつきが大きくなっており,設計に必要な電源電圧やクロック周波数のマージンが増大している.マージンの緩和のため,タイミングエラーへの耐性を持つ回路の構造が盛んに研究されている.本稿では,フリップフロップの動作とラッチの動作を動的に切り替えることによりタイミングエラー耐性を実現するTime Borrowing Flip-Flop(TBFF)のトランジスタレベルの構造を2通り提案した.また,HSPICEシミュレーションによる評価を行い,従来手法と比較して消費エネルギーを最大20.6%削減できることを示した.

CiNii
タイミングエラー予測回路による再構成可能デバイス上でのデータ依存最適化回路設計 (VLSI設計技術) -- (デザインガイア2014 : VLSI設計の新しい大地)

川村一志, 阿部晋矢, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 328 ) 51 - 56 2014年11月

　概要を見る

LSI内部の各パス遅延は入力データに応じて様々に変動する.この性質を利用することで,計算精度をわずかに落としながらも高速に動作するLSIの設計が可能になる.本稿では,入力データ群にもとづき特定された最適化すべきパスをリコンフィギュレーションし最適化する,新たな回路設計アルゴリズムを提案する.提案アルゴリズムは最適化対象の回路にタイミングエラー予測回路を挿入し動作させることで被最適化パスを特定,動的に再構成し与えられたエラー制約内で動作クロック周期の最小化を図る.本アルゴリズムを加算器に対して適用した結果,通常のクリティカルパス最小化の設計と比較し,2.1%以下のエラーを許容する制約下で最大18.5%の高速化に成功した.

CiNii
回路面積を考慮したSuspicious Timing Error Prediction回路の挿入位置決定手法の改良と評価 (VLSI設計技術) -- (デザインガイア2014 : VLSI設計の新しい大地)

吉田慎之介, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 328 ) 57 - 62 2014年11月

　概要を見る

近年,半導体技術の進展に伴いタイミングエラー発生の危険性が増加している.STEPはタイミングエラーを事前に予測できる手法であるが,STEP回路を挿入する位置が重要である.このような背景から、回路面積を考慮したSTEP回路の挿入位置決定手法を提案した.本手法ではSTEP回路の個数を削減するために短いパスを無視するが,長いパスまで無視する可能性があった.また,短いパスに合わせて位置ラベルを付けるため,STEP回路の挿入位置がパスの後半に偏る可能性があった.本稿ではSTEP回路の挿入位置決定手法で用いる,短いパスの探索方法とラベル付けの方法を改良する.パスの長さを推定することで短いパスのみを無視できるため,これまでSTEP回路を挿入しなかった長いパスで発生するタイミングエラーが予測できる.また,任意の長さのパスに合わせたラベル付けもできるため,チェックポイントがパスの後半となることを防ぐ.改良した手法を複数の回路に対して適用し,最大動作周波数の向上を図る.実験結果よりSTEP回路を入れない場合と比較して,最大動作周波数を平均1.71倍に向上させることができた.改良前の手法と比較すると,最大動作周波数を平均1.15倍に向上させることができた.

CiNii
DTMOSを用いたサブスレッショルド回路の高速化設計 (VLSI設計技術) -- (デザインガイア2014 : VLSI設計の新しい大地)

福留祐治, 史又華, 戸川望, 宇佐美公良, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 328 ) 117 - 121 2014年11月

　概要を見る

サブスレッショルド領域で回路を動作させることで低電力化は実現されるが,同時に速度が劣化するトレードオフの関係にある.本稿ではサブスレッショルド領域において低電力で高速化を実現するため,DTMOSを用いたサブスレッショルド回路の高速化設計を行い,トランジスタレベルのシミュレーションの結果,30〜45%高速化し,V_<dd>=0.2V,0.3Vにおいて平均15%低エネルギー化したことを示す.

CiNii
ハードウェアトロイに含まれるネットに着目したハードウェアトロイ検出手法 (VLSI設計技術) -- (デザインガイア2014 : VLSI設計の新しい大地)

大屋優, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 328 ) 135 - 140 2014年11月

　概要を見る

近年チップの製造をサードパーティに外注するようになり,ハードウェアトロイが挿入される可能性が高まってきた.特に設計段階では簡単にハードウェアトロイを挿入することができる.ゲートレベルのネットリストに対してハードウェアトロイ検出手法を適用する場合,我々はGoldenネットリストを持っておらず,挿入されているハードウェアトロイを活性化するという条件下でハードウェアトロイを検出する手法が存在するのみである.本稿では,Goldenネットリストが無く,ハードウェアトロイを活性化させなくてもハードウェアトロイを検出する手法として,低スイッチング確率のネット(LSLGネットと呼ぶ)の検出を通じてハードウェアトロイを検出する手法を提案する.LSLGネットはネットリストに含まれるネットの数%であるにも関わらず,Trust-HUBのAbstraction Gate Levelで公開されているハードウェアトロイが挿入されている全てのゲートレベルのネットリストに対して,ハードウェアトロイの一部を検出することに成功した.提案手法にかかる時間は高々十数分程度である.

CiNii
HDR-mcvを対象とした複数クロックドメインおよび複数電源電圧による低電力化高位合成手法 (VLSI設計技術) -- (デザインガイア2014 : VLSI設計の新しい大地)

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 328 ) 203 - 208 2014年11月

　概要を見る

低電力かつ高速なLSIの設計へ向け,配線遅延を考慮しながら複数クロックドメイン,複数電源電圧を同時に適用可能なHDR-mcvおよび高位合成手法が提案された.従来手法はクロックおよび電圧をハドルと呼ぶ区画毎に割り当てるが,クロックツリー数の増加による消費エネルギーのオーバヘッドが無視できない.提案手法はクロックに同期する論理,および演算回路に対し独立に電圧を割り当てることで,クロックツリー数を増加せずに複数クロックドメインと複数電源電圧を同時適用する.計算機実験結果により,提案手法は従来のHDR-mcvアーキテクチャを対象とした高位合成アルゴリズムと比較し50%程度消費エネルギーを削減し,最終的に従来のレジスタ分散型アーキテクチャと比較し提案手法は60%程度消費エネルギーを削減できることを確認した.

CiNii
タイミングエラーへの耐性を持つフリップフロップ設計 (ディペンダブルコンピューティング) -- (デザインガイア2014 : VLSI設計の新しい大地)

鈴木大渡, 史又華, 戸川望, 宇佐美公良, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 329 ) 45 - 50 2014年11月

　概要を見る

集積回路の微細化の影響により,回路のばらつきが大きくなっており,設計に必要な電源電圧やクロック周波数のマージンが増大している.マージンの緩和のため,タイミングエラーへの耐性を持つ回路の構造が盛んに研究されている.本稿では,フリップフロップの動作とラッチの動作を動的に切り替えることによりタイミングエラー耐性を実現するTime Borrowing Flip-Flop(TBFF)のトランジスタレベルの構造を2通り提案した.また,HSPICEシミュレーションによる評価を行い,従来手法と比較して消費エネルギーを最大20.6%削減できることを示した.

CiNii
タイミングエラー予測回路による再構成可能デバイス上でのデータ依存最適化回路設計 (ディペンダブルコンピューティング) -- (デザインガイア2014 : VLSI設計の新しい大地)

川村一志, 阿部晋矢, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 329 ) 51 - 56 2014年11月

　概要を見る

LSI内部の各パス遅延は入力データに応じて様々に変動する.この性質を利用することで,計算精度をわずかに落としながらも高速に動作するLSIの設計が可能になる.本稿では,入力データ群にもとづき特定された最適化すべきパスをリコンフィギュレーションし最適化する,新たな回路設計アルゴリズムを提案する.提案アルゴリズムは最適化対象の回路にタイミングエラー予測回路を挿入し動作させることで被最適化パスを特定,動的に再構成し与えられたエラー制約内で動作クロック周期の最小化を図る.本アルゴリズムを加算器に対して適用した結果,通常のクリティカルパス最小化の設計と比較し,2.1%以下のエラーを許容する制約下で最大18.5%の高速化に成功した.

CiNii
回路面積を考慮したSuspicious Timing Error Prediction回路の挿入位置決定手法の改良と評価 (ディペンダブルコンピューティング) -- (デザインガイア2014 : VLSI設計の新しい大地)

吉田慎之介, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 329 ) 57 - 62 2014年11月

　概要を見る

近年,半導体技術の進展に伴いタイミングエラー発生の危険性が増加している.STEPはタイミングエラーを事前に予測できる手法であるが,STEP回路を挿入する位置が重要である.このような背景から,回路面積を考慮したSTEP回路の挿入位置決定手法を提案した.本手法ではSTEP回路の個数を削減するために短いパスを無視するが,長いパスまで無視する可能性があった.また,短いパスに合わせて位置ラベルを付けるため,STEP回路の挿入位置がパスの後半に偏る可能性があった.本稿ではSTEP回路の挿入位置決定手法で用いる,短いパスの探索方法とラベル付けの方法を改良する.パスの長さを推定することで短いパスのみを無視できるため,これまでSTEP回路を挿入しなかった長いパスで発生するタイミングエラーが予測できる.また,任意の長さのパスに合わせたラベル付けもできるため,チェックポイントがパスの後半となることを防ぐ.改良した手法を複数の回路に対して適用し,最大動作周波数の向上を図る.実験結果よりSTEP回路を入れない場合と比較して,最大動作周波数を平均1.71倍に向上させることができた.改良前の手法と比較すると,最大動作周波数を平均1.15倍に向上させることができた.

CiNii
DTMOSを用いたサブスレッショルド回路の高速化設計 (ディペンダブルコンピューティング) -- (デザインガイア2014 : VLSI設計の新しい大地)

福留祐治, 史又華, 戸川望, 宇佐美公良, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 329 ) 117 - 121 2014年11月

　概要を見る

サブスレッショルド領域で回路を動作させることで低電力化は実現されるが,同時に速度が劣化するトレードオフの関係にある.本稿ではサブスレッショルド領域において低電力で高速化を実現するため,DTMOSを用いたサブスレッショルド回路の高速化設計を行い,トランジスタレベルのシミュレーションの結果,30〜45%高速化し,V_<dd>=0.2V,0.3Vにおいて平均15%低エネルギー化したことを示す.

CiNii
ハードウェアトロイに含まれるネットに着目したハードウェアトロイ検出手法 (ディペンダブルコンピューティング) -- (デザインガイア2014 : VLSI設計の新しい大地)

大屋優, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 329 ) 135 - 140 2014年11月

　概要を見る

近年チップの製造をサードパーティに外注するようになり,ハードウェアトロイが挿入される可能性が高まってきた.特に設計段階では簡単にハードウェアトロイを挿入することができる.ゲートレベルのネットリストに対してハードウェアトロイ検出手法を適用する場合,我々はGoldenネットリストを持っておらず,挿入されているハードウェアトロイを活性化するという条件下でハードウェアトロイを検出する手法が存在するのみである.本稿では,Goldenネットリストが無く,ハードウェアトロイを活性化させなくてもハードウェアトロイを検出する手法として,低スイッチング確率のネット(LSLGネットと呼ぶ)の検出を通じてハードウェアトロイを検出する手法を提案する.LSLGネットはネットリストに含まれるネットの数%であるにも関わらず,Trust-HUBのAbstraction Gate Levelで公開されているハードウェアトロイが挿入されている全てのゲートレベルのネットリストに対して,ハードウェアトロイの一部を検出することに成功した.提案手法にかかる時間は高々十数分程度である.

CiNii
HDR-mcvを対象とした複数クロックドメインおよび複数電源電圧による低電力化高位合成手法 (ディペンダブルコンピューティング) -- (デザインガイア2014 : VLSI設計の新しい大地)

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 329 ) 203 - 208 2014年11月

　概要を見る

低電力かつ高速なLSIの設計へ向け,配線遅延を考慮しながら複数クロックドメイン,複数電源電圧を同時に適用可能なHDR-mcvおよび高位合成手法が提案された.従来手法はクロックおよび電圧をハドルと呼ぶ区画毎に割り当てるが,クロックツリー数の増加による消費エネルギーのオーバヘッドが無視できない.提案手法はクロックに同期する論理,および演算回路に対し独立に電圧を割り当てることで,クロックツリー数を増加せずに複数クロックドメインと複数電源電圧を同時適用する.計算機実験結果により,提案手法は従来のHDR-mcvアーキテクチャを対象とした高位合成アルゴリズムと比較し50%程度消費エネルギーを削減し,最終的に従来のレジスタ分散型アーキテクチャと比較し提案手法は60%程度消費エネルギーを削減できることを確認した.

CiNii
タイミングエラー予測回路による再構成可能デバイス上でのデータ依存最適化回路設計

川村一志, 阿部晋矢, 史又華, 柳澤政生, 戸川望

研究報告システムとLSIの設計技術（SLDM） 2014 ( 2 ) 1 - 6 2014年11月

　概要を見る

LSI 内部の各パス遅延は入力データに応じて様々に変動する．この性質を利用することで，計算精度をわずかに落としながらも高速に動作する LSI の設計が可能になる．本稿では，入力データ群にもとづき特定された最適化すべきパスをリコンフィギュレーションし最適化する，新たな回路設計アルゴリズムを提案する．提案アルゴリズムは最適化対象の回路にタイミングエラー予測回路を挿入し動作させることで被最適化パスを特定，動的に再構成し与えられたエラー制約内で動作クロック周期の最小化を図る．本アルゴリズムを加算器に対して適用した結果，通常のクリティカルパス最小化の設計と比較し，2.1 ％以下のエラーを許容する制約下で最大 18.5％の高速化に成功した．The propagation delay along each path inside an LSI widely varies depending on input data, and this property can be exploited to design high-performance approximation circuit with a negligible error rate. In this paper, we propose a novel approximation circuit design algorithm, which identifies paths to be optimized based on input data and reconfigures these paths. Our algorithm first identifies the optimized paths by incorporating timing error prediction circuits into a target circuit and running them in practice. These paths are then dynamically reconfigured within an accuracy constraint with the objective of maximizing its performance. Experimental results targeting a set of basic adders show that our algorithm can achieve performance increase by up to 18.5% within acceptable error of 2.1% compared with conventional design techniques.

CiNii
タイミングエラーへの耐性を持つフリップフロップ設計

鈴木大渡, 史又華, 戸川望, 宇佐美公良, 柳澤政生

研究報告システムとLSIの設計技術（SLDM） 2014 ( 1 ) 1 - 6 2014年11月

　概要を見る

集積回路の微細化の影響により，回路のばらつきが大きくなっており，設計に必要な電源電圧やクロック周波数のマージンが増大している．マージンの緩和のため，タイミングエラーへの耐性を持つ回路の構造が盛んに研究されている．本稿では，フリップフロップの動作とラッチの動作を動的に切り替えることによりタイミングエラー耐性を実現する Time Borrowing Flip-Flop(TBFF) のトランジスタレベルの構造を 2 通り提案したまた，HSPICE シミュレーションによる評価を行い，従来手法と比較して消費エネルギーを最大 20.6%削減できることを示した．Under the influence of the miniaturization of the integrated circuit, the variation of the operation condition of the circuit becomes bigger, and margins of the supply voltage and the clock frequency necessary for a design increase. For the mitigation of the margin, the structure of the circuit with the timing error tolerance is studied flourishingly. In this paper, we propose two new Time Borrowing Flip-Flops (TBFF) in transistor level to realize timing error tolerance by switching from flip-flop to latch dynamically. HSPICE simulation results show that the proposed TBFF can achieve up to 28.1% power reduction when compared with existing works.

CiNii
回路面積を考慮したSuspicious Timing Error Prediction 回路の挿入位置決定手法の改良と評価

吉田慎之介, 史又華, 柳澤政生, 戸川望

研究報告システムとLSIの設計技術（SLDM） 2014 ( 3 ) 1 - 6 2014年11月

　概要を見る

近年，半導体技術の進展に伴いタイミングエラー発生の危険性が増加している．STEP はタイミングエラーを事前に予測できる手法であるが，STEP 回路を挿入する位置が重要である．このような背景から、回路面積を考慮した STEP 回路の挿入位置決定手法を提案した．本手法では STEP 回路の個数を削減するために短いパスを無視するが，長いパスまで無視する可能性があった．また，短いパスに合わせて位置ラベルを付けるため，STEP 回路の挿入位置がパスの後半に偏る可能性があった．本稿では STEP 回路の挿入位置決定手法で用いる，短いパスの探索方法とラベル付けの方法を改良する．パスの長さを推定することで短いパスのみを無視できるため，これまで STEP 回路を挿入しなかった長いパスで発生するタイミングエラーが予測できる．また，任意の長さのパスに合わせたラベル付けもできるため，チェックポイントがバスの後半となることを防ぐ．改良した手法を複数の回路に対して適用し，最大動作周波数の向上を図る．実験結果より STEP 回路を入れない場合と比較して，最大動作周波数を平均 1.71 倍に向上させることができた．改良前の手法と比較すると，最大動作周波数を平均 1.15 倍に向上させることができた．As process technologies advance, process and delay variation causes a complex timing design and in-situ timing error correction techniques are strongly required. Suspicious timing error prediction (STEP) predicts timing errors by monitoring checkpoints by STEP circuits (STEPCs) and how to insert checkpoints is very important. We have proposed a network-flow-based checkpoint insertion algorithm for STEP. However, our algorithm may ignore long paths and insert checkpoints near the output. In this paper, we improve how to ignore short paths and set labels by estimating path lengths. Then, we can ignore only short paths and insert checkpoints into near the center of all long paths. We evaluate our algorithm by applying it to four benchmark circuits. Experimental results show that our proposed algorithm realizes an average of 1.71X overclocking compared with just inserting no STEPC. Furthermore, our improved algorithm realizes an average of 1.15X overclocking compared with our original algorithm.

CiNii
DTMOSを用いたサブスレッショルド回路の高速化設計

福留祐治, 史又華, 戸川望, 宇佐美公良, 柳澤政生

研究報告システムとLSIの設計技術（SLDM） 2014 ( 21 ) 1 - 5 2014年11月

　概要を見る

サブスレッショルド領域で回路を動作させることで低電力化は実現されるが，同時に速度が劣化するトレードオフの関係にある．本稿ではサブスレッショルド領域において低電力で高速化を実現するため，DTMOS を用いたサブスレッシヨルド回路の高速化設計を行い，トランジスタレベルのシミュレーションの結果，30～45％高速化し，Vdd＝0.2Ｖ, 0.3Ｖにおいて平均 15％低エネルギー化したことを示す．Low power consumption is achieved by operating circuits in sub-threshold region. However, in sub-threshold region, the operating speed becomes slow, and the tradeoff between power and speed should be considered carefully. In this work, we present DTMOS implementations to realize high speed and low power in subthreshold region. Transistor level simulation results show that the operating speed can be improved by 30 %-45 %, and on average 15 % energy reduction can be achieved when Vdd ranges 0.2-0.3V.

CiNii
ハードウェアトロイに含まれるネットに着目したハードウェアトロイ検出手法

大屋優, 史又華, 柳澤政生, 戸川望

研究報告システムとLSIの設計技術（SLDM） 2014 ( 24 ) 1 - 6 2014年11月

　概要を見る

近年チップの製造をサードパーティに外注するようになり，ハードウェアトロイが挿入される可能性が高まってきた．特に設計段階では簡単にハードウェアトロイを挿入することができる．ゲートレベルのネットリストに対してハードウェアトロイ検出手法を適用する場合，我々は Golden ネットリストを持っておらず，挿入されているハードウェアトロイを活性化するという条件下でハードウェアトロイを検出する手法が存在するのみである．本稿では，Golden ネットリストが無く，ハードウェアトロイを活性化させなくてもハードウェアトロイを検出する手法として，低スイッチング確率のネット（LSLG ネットと呼ぶ）の検出を通じてハードウェアトロイを検出する手法を提案する LSLG ネットはネットリストに含まれるネットの数％であるにも関わらず，Trｕｓｔ-HUB の Abstraction Gate Level で公開されているハードウェアトロイが挿入されている全てのゲートレベルのネットリストに対して，ハードウェアトロイの一部を検出することに成功した．提案手法にかかる時間は高々十数分程度である．Recently, digital ICs are designed by outside vendors to reduce design costs in semiconductor industry. This circumstance introduces risks that malicious attackers implement Hardware Trojans (HTs) into ICs. HTs are easily inserted in particular during design phase, but HTs detection is too difficult during this phase. This is why we have to assume Golden Netlists and activation of HTs in previous researches. This paper proposes an HT detection method through detecting LSLG nets, which have low switching probabilities. Our approach does not assume Golden netlists nor activation of HTs. We succesfully find out that all HT-inserted gate-level netlists from Trust-HUB benchmarks include a small number of LSLG nets. It takes approximately ten minutes to detect LSLG nets in each benchmark.

CiNii
HDR-mcvを対象とした複数クロックドメインおよび複数電源電圧による低電力化高位合成手法

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

研究報告システムとLSIの設計技術（SLDM） 2014 ( 40 ) 1 - 6 2014年11月

　概要を見る

低電力かつ高速な LSI の設計へ向け，配線遅延を考慮しながら複数クロックドメイン，複数電源電圧を同時に適用可能な HDR-mcv および高位合成手法が提案された．従来手法はクロックおよび電圧をハドルと呼ぶ区画毎に割り当てるが，クロックツリー数の増加による消費エネルギーのオーバヘッドが無視できない．提案手法はクロックに同期する論理，および演算回路に対し独立に電圧を割り当てることで，クロックツリー数を増加せずに複数クロックドメインと複数電源電圧を同時適用する．計算機実験結果により，提案手法は従来の HDR-mcv アーキテクチャを対象とした高位合成アルゴリズムと比較し 50％程度消費エネルギーを削減し，最終的に従来のレジスタ分散型アーキテクチャと比較し提案手法は 60％程度消費エネルギーを削減できることを確認した．An HDR-mcv architecture, which integrates multiple supply voltages and multiple clock domains into high-level synthesis and enables us to estimate interconnection delay effects during high-level synthesis, has been proposed with the corresponding synthesis algorithm. They assign voltages and clock frequencies to huddles which are the partitions for interconnection delay estimation during high-level synthesis. However, the voltage and clock assignment may have some energy overheads due to the increased clock trees. In this paper, we propose a new HDR-mcv architecture in which supply voltages are assigned to functional logics and clock synchronization logics separately. Next, we propose a high-level synthesis algorithm for the architecture, which can assign clock frequen cies and supply voltages on the bases of the placement and energy informations. Experimental results show that the proposed method achieves 50% energy-saving compared with the conventional HDR-mcv architecture and 60% energy-saving compared with the existing high-level synthesis methods.

CiNii
可変パイプラインのローカルなパルス生成による低消費エネルギー化手法 (VLSI設計技術)

新井孝将, 史又華, 戸川望, 宇佐美公良, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 231 ) 7 - 12 2014年10月

　概要を見る

モバイル端末において性能向上による消費エネルギーの増加が問題となっており,様々な低消費エネルギー化手法が提案されている.その一つである可変パイプライン段数(Variable Stages Pipeline:VSP)では,LDS-cell(Latch D-FF Selector cell)という特殊なセルを用いてグリッチを緩和することができる.しかし,クロックがLowのときに発生するグリッチに対しては緩和できないという問題があった.本稿では既存の可変パイプライン段数手法に対し,LE(Low Energy)モード時にクロックゲーティングを適用し,ローカルなパルス生成によりデータパス上のグリッチを更に抑制し,消費エネルギーを削減する手法を提案する.実際に乗算器に提案手法を実装し,従来のVSPと比較して3.08%消費エネルギーを削減することができた.

CiNii
可変パイプラインのローカルなパルス生成による低消費エネルギー化手法 (画像工学)

新井孝将, 史又華, 戸川望, 宇佐美公良, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 233 ) 7 - 12 2014年10月

　概要を見る

モバイル端末において性能向上による消費エネルギーの増加が問題となっており,様々な低消費エネルギー化手法が提案されている.その一つである可変パイプライン段数(Variable Stages Pipeline:VSP)では,LDS-cell(Latch D-FF Selector cell)という特殊なセルを用いてグリッチを緩和することができる.しかし,クロックがLowのときに発生するグリッチに対しては緩和できないという問題があった.本稿では既存の可変パイプライン段数手法に対し,LE(Low Energy)モード時にクロックゲーティングを適用し,ローカルなパルス生成によりデータパス上のグリッチを更に抑制し,消費エネルギーを削減する手法を提案する.実際に乗算器に提案手法を実装し,従来のVSPと比較して3.08%消費エネルギーを削減することができた.

CiNii
可変パイプラインのローカルなパルス生成による低消費エネルギー化手法 (集積回路)

新井孝将, 史又華, 戸川望, 宇佐美公良, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 114 ( 232 ) 7 - 12 2014年10月

　概要を見る

モバイル端末において性能向上による消費エネルギーの増加が問題となっており,様々な低消費エネルギー化手法が提案されている.その一つである可変パイプライン段数(Variable Stages Pipeline:VSP)では,LDS-cell(Latch D-FF Selector cell)という特殊なセルを用いてグリッチを緩和することができる.しかし,クロックがLowのときに発生するグリッチに対しては緩和できないという問題があった.本稿では既存の可変パイプライン段数手法に対し,LE(Low Energy)モード時にクロックゲーティングを適用し,ローカルなパルス生成によりデータパス上のグリッチを更に抑制し,消費エネルギーを削減する手法を提案する.実際に乗算器に提案手法を実装し,従来のVSPと比較して3.08%消費エネルギーを削減することができた.

CiNii
可変パイプラインのローカルなパルス生成による低消費エネルギー化手法

新井孝将, 史又華, 戸川望, 宇佐美公良, 柳澤政生

研究報告システムとLSIの設計技術（SLDM） 2014 ( 2 ) 1 - 6 2014年09月

　概要を見る

モバイル端末において性能向上による消費エネルギーの増加が問題となっており，様々な低消費エネルギー化手法が提案されている．その一つである可変パイプライン段数 (Variable Stages Pipeline:VSP) では，LDS-cell (Latch D-FF Selector cell) という特殊なセルを用いてグリッチを緩和することができる．しかし，クロックが Low のときに発生するグリッチに対しては緩和できないという問題があった．本稿では既存の可変パイプライン段数手法に対し，LE(Low Energy) モード時にクロックゲーティングを適用し，ローカルなパルス生成によりデータパス上のグリッチを更に抑制し，消費エネルギーを削減する手法を提案する．実際に乗算器に提案手法を実装し，従来の VSP と比較して 3.08％消費エネルギーを削減することができた．The increase of energy consumption due to improved performance has become a problem in the mobile terminal, and various low energy design techniques have been proposed. Variable Stages Pipeline(VSP) technique is one of them, which can reduce glitches by using a special LDS-cell(Latch D-FF selector-cell). However, glitches that occur during the low clock phase will still be propagated to next stages. In this paper, we propose a method for variable stages pipeline designs by applying local pulse generation and clock gating in LE mode for further energy reduction. We implemented the proposed method to a multiplier and experimental results show that the energy is reduced by 3.08% when compared to conventional VSP.

CiNii
Suspicious Timing Error Prediction を用いた回路全体の遅延ばらつきに対するロバスト設計

吉田慎之介, 史又華, 柳澤政生, 戸川望

DAシンポジウム2014論文集 2014 61 - 66 2014年08月

CiNii
回路面積を考慮したSuspicious Timing Error Prediction回路の挿入位置決定手法

吉田慎之介, 史又華, 柳澤政生

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 27 416 - 421 2014年08月

CiNii
故障解析に耐性を持つラッチを利用したAES暗号回路 (VLSI設計技術)

史又華, 谷口寛彰, 戸川望, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 113 ( 454 ) 37 - 42 2014年03月

　概要を見る

暗号技術は複雑な数学的理論を安全性の根拠としているため安全性が高いとされている.しかし近年,暗号アルゴリズムに対してではなく,暗号回路そのものに攻撃を仕掛ける故障解析が脅威となってきている.故障解析にはレーザーや異常電圧やクロックグリッチが使用されるが,攻撃の容易さからクロックグリッチによる攻撃が注目されている.クロックグリッチによる故障を検出するためにはラウンド間での故障を検出する必要があるが,そのための実装方法として,回路を三重化して比較する空間冗長化や,同じ処理を2回行って比較する時間冗長化が存在する.前者は3倍以上の面積オーバーヘッドが存在し,後者は最大で2倍の時間オーバーヘッドが存在するという問題点がある.本稿ではラッチを用いたAES暗号回路を提案する.提案手法では小面積,高速でクロックグリッチによる故障解析に耐性を持たせることを可能にした.提案手法は,攻撃者にとって意味があるクロックグリッチにおいて,データレジスタにおける故障の検出率を100%とするとともに,データレジスタに一度故障が起きた場合でも最終的な暗号処理結果を100%正しくすることを可能にした.

CiNii
改良ランダムオーダースキャンによるセキュアスキャン設計とその評価 (VLSI設計技術)

大屋優, 跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 113 ( 454 ) 43 - 48 2014年03月

　概要を見る

大規模集積回路のテスト容易化設計の1つであるスキャンチェインを利用したスキャンテストが一般的に行われる.反面スキャンチェインを利用して,暗号回路の秘密鍵が解読されるなどのスキャンベース攻撃が問題となっている.スキャンチェインをスキャンベース攻撃から保護するために,セキュアスキャンアーキテクチャの必要性が高まってきた.セキュアスキャンアーキテクチャは,テスト性を保証すると共にスキャンベース攻撃からスキャンチェインを保護する必要がある.本稿では,セキュアスキャンアーキテクチャとして改良ランダムオーダースキャンを提案する.改良ランダムオーダースキャンは,ランダムオーダースキャンを改良したものであり,スキャンチェインの構造を動的に変化させ,スキャンベース攻撃からスキャンチェインを保護する.スキャンチェインを複数のサブチェインに分割し,イネーブル信号でスキャンアウトさせるサブチェインを次々と選択することで,スキャンチェインの構造が動的に変化する.改良ランダムオーダースキャンの安全性とテスト性を議論し,また計算核実験により面積オーバーヘッドが小さいセキュアスキャンアーキテクチャであることを示す.

CiNii
サブスレッショルド回路における遅延・エネルギーの温度依存性に関する実験および考察 (VLSI設計技術)

櫛田浩樹, 史又華, 戸川望, 宇佐美公良, 柳澤政生

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 113 ( 454 ) 147 - 151 2014年03月

　概要を見る

バッテリ一式の無線ネットワーク機器では,消費エネルギーの削減を重視するため,供給電圧を下げる設計手法が広く用いられる.サブスレッショルド回路においては,しきい値下の電圧で制御することで大幅なエネルギー削減を達成できるが,性能の低下や環境変動による遅延ばらつきの問題が生じる.本稿ではスーパーパイプラインを用いたサブスレッショルド乗算器を実装し,性能向上とリークエネルギー削減による全体の消費エネルギー削減を確認した.さらに,温度変動による回路の遅延・エネルギーの温度依存性について実験し,考察を行った.

CiNii
チェックポイント観測によるタイミングエラー予測手法 (ディペンダブルコンピューティングデザインガイア2013 : VLSI設計の新しい大地)

五十嵐博昭, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 113 ( 321 ) 39 - 44 2013年11月

　概要を見る

プロセス技術の微細化によりLSIのタイミング設計が難しくなっており,タイミングエラー対策手法の重要性が高まっている.既存のタイミングエラー検出手法はエラー訂正に再実行が必要であったり,複雑な構造を持つためタイミング設計が難しい.我々はより訂正コストが小さく簡単な構造を持つタイミングエラー対策手法としてSTEPを提案している.STEPではチェックポイントと呼ばれるパス中の観測点をチェックすることでタイミングエラー発生の可能性を検出する.STEPはタイミングエラー予測手法であるため誤検出が発生し,誤検出の削減が大きな課題である.本稿ではチェックポイントの最適化により誤検出を削減する手法を提案する.実験結果より,動作可能周波数が最大で2.4倍となり,スループットは最大で約45%向上した.

CiNii
HDR-mcdを対象としたクロックエネルギー優位な高位合成と実験評価 (ディペンダブルコンピューティングデザインガイア2013 : VLSI設計の新しい大地)

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 113 ( 321 ) 263 - 268 2013年11月

　概要を見る

LSI全体に占めるクロック信号によるエネルギー消費の割合は大きく,マルチクロックドメイン,クロックゲーティングなどが提案された.本稿では,マルチクロックドメイン指向HDR-mcdアーキテクチャを対象としたクロックエネルギー削減に向けた高位合成手法を提案する.提案手法は1クロック内の通信が保障されるハドルと呼ぶ区画を利用し,配線遅延の影響を予測,異なるクロック間の同期を考慮した高位合成を実現する.クロックはハドル毎に割り当て,資源制約と時間制約を満たす範囲で低い周波数のクロックを割り当てることで低電力化する.計算機実験により提案手法はクロックゲーティングのみを考慮した従来手法と比較し,クロックツリーのエネルギーを30%程度削減でき,全体のエネルギーを25%程度削減できることを確認した.

CiNii
チェックポイント観測によるタイミングエラー予測手法 (VLSI設計技術デザインガイア2013 : VLSI設計の新しい大地)

五十嵐博昭, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 113 ( 320 ) 39 - 44 2013年11月

　概要を見る

プロセス技術の微細化によりLSIのタイミング設計が難しくなっており,タイミングエラー対策手法の重要性が高まっている.既存のタイミングエラー検出手法はエラー訂正に再実行が必要であったり,複雑な構造を持つためタイミング設計が難しい.我々はより訂正コストが小さく簡単な構造を持つタイミングエラー対策手法としてSTEPを提案している.STEPではチェックポイントと呼ばれるパス中の観測点をチェックすることでタイミングエラー発生の可能性を検出する.STEPはタイミングエラー予測手法であるため誤検出が発生し,誤検出の削減が大きな課題である.本稿ではチェックポイントの最適化により誤検出を削減する手法を提案する.実験結果より,動作可能周波数が最大で2.4倍となり,スループットは最大で約45%向上した.

CiNii
HDR-mcdを対象としたクロックエネルギー優位な高位合成と実験評価 (VLSI設計技術デザインガイア2013 : VLSI設計の新しい大地)

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 113 ( 320 ) 263 - 268 2013年11月

　概要を見る

LSI全体に占めるクロック信号によるエネルギー消費の割合は大きく,マルチクロックドメイン,クロックゲーティングなどが提案された.本稿では,マルチクロックドメイン指向HDR-mcdアーキテクチャを対象としたクロックエネルギー削減に向けた高位合成手法を提案する.提案手法は1クロック内の通信が保障されるハドルと呼ぶ区画を利用し,配線遅延の影響を予測,異なるクロック間の同期を考慮した高位合成を実現する.クロックはハドル毎に割り当て,資源制約と時間制約を満たす範囲で低い周波数のクロックを割り当てることで低電力化する.計算機実験により提案手法はクロックゲーティングのみを考慮した従来手法と比較し,クロックツリーのエネルギーを30%程度削減でき,全体のエネルギーを25%程度削減できることを確認した.

CiNii
チェックポイント観測によるタイミングエラー予測手法

五十嵐博昭, 史又華, 柳澤政生, 戸川望

研究報告システムLSI設計技術（SLDM） 2013 ( 8 ) 1 - 6 2013年11月

　概要を見る

プロセス技術の微細化により LSI のタイミング設計が難しくなっており，タイミングエラー対策手法の重要性が高まっている．既存のタイミングエラー検出手法はエラー訂正に再実行が必要であったり，複雑な構造を持つためタイミング設計が難しい我々はより訂正コストが小さく簡単な構造を持つタイミングエラー対策手法として STEP を提案している．STEP ではチェックポイントと呼ばれるパス中の観測点をチェックすることでタイミングエラー発生の可能性を検出する．STEP はタイミングエラー予測手法であるため誤検出が発生し，誤検出の削減が大きな課題である．本稿ではチェックポイントの最適化により誤検出を削減する手法を提案する．実験結果より，動作可能周波数が最大で 24 倍となり，スループットは最大で約 45％向上した．Due to advance process technologies, timing design of LSIs has become more difficult and the importance of timing error countermeasure techniques is increasing as well. Existing timing error detection/correction methods have difficulties in timing design since they have complex structure. Furthermore, their error correction is realized by re-run operation which results in low throughput. We have proposed a suspicious timing error prediction method (STEP method) which predicts timing error and corrects it with simple structure. STEP is based on checking timing errors by observing several checkpoints on signal paths. Since STEP is a timing error prediction method, we may have false positives and reduction of them is one of the largest problems. In this paper, we propose a method to reduce the false positives to optimize the checkpoints. The experimental results show that an operational frequency is increased by up to 2.4 times and its throughput is improved by up to 45%.

CiNii
HDR-mcdを対象としたクロックエネルギー優位な高位合成と実験評価

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

研究報告システムLSI設計技術（SLDM） 2013 ( 47 ) 1 - 6 2013年11月

　概要を見る

LSI 全体に占めるクロック信号によるエネルギー消費の割合は大きく，マルチクロックドメイン，クロックゲーテイングなどが提案された．本稿では，マルチクロックドメイン指向 HDR-mcd アーキテクチャを対象としたクロックエネルギー削減に向けた高位合成手法を提案する．提案手法は 1 クロック内の通信が保障されるハドルと呼ぶ区画を利用し，配線遅延の影響を予測，異なるクロック間の同期を考慮した高位合成を実現する．クロックはハドル毎に割り当て，資源制約と時間制約を満たす範囲で低い周波数のクロックを割り当てることで低電力化する．計算機実験により提案手法はクロックゲーテイングのみを考慮した従来手法と比較し，クロックツリーのエネルギーを 30％程度削減でき，全体のエネルギーを 25％程度削減できることを確認した．In this paper, we propose a clock energy-efficient high-level synthesis algorithm for HDR-mcd architecture. In HDR-mcd, an entire chip is divided into several huddles. Huddles can realize synchronization between different clock domains in which interconnection delay is required and should be considered during high-level synthesis. In our iterative improvement based algorithm, low-frequency clocks are assigned to non-critical huddlesunder resource and latency constraints for energy efficiency improvement. Experimental results show that the proposed method achieves 20% clock energy-saving and 10% total energy-saving compared with the existing methods considering clock gating.

CiNii
A-3-5 トロイパスによるハードウェアトロイ検出の一手法(A-3.VLSI設計技術,一般セッション)

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会ソサイエティ大会講演論文集 2013 48 - 48 2013年09月

CiNii
A-3-6 故障差分解析に耐性を持つデータ修復可能なAES暗号回路(A-3.VLSI設計技術,一般セッション)

谷口寛彰, 史又華, 戸川望, 柳澤政生

電子情報通信学会ソサイエティ大会講演論文集 2013 49 - 49 2013年09月

CiNii
ランダムオーダースキャンによるセキュアスキャン設計

跡部悠太, 史又華, 柳澤政生

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 26 448 - 453 2013年07月

CiNii
HDR-mcdを対象としたマルチクロックドメイン指向の低電力化高位合成手法

阿部晋矢, 史又華, 宇佐美公良

回路とシステムワークショップ論文集 Workshop on Circuits and Systems 26 185 - 190 2013年07月

CiNii
フロアプランを考慮したマルチクロックドメイン指向の低電力化高位合成手法 (コンピュータシステム組込み技術とネットワークに関するワークショップETNET2013)

阿部晋矢, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 : 信学技報 112 ( 481 ) 115 - 120 2013年03月

　概要を見る

本稿では,マルチクロックドメイン適用へ向け,HDRアーキテクチャを拡張したHDR-mcdを提案する.続いてHDR-mcdを対象にマルチクロックドメイン指向の低電力化高位合成を提案する.提案手法はフロアプラン情報をフィードバックし,反復改良する合成フローを取る.その際,1クロック内の通信が保障されるパドルと呼ぶ区画を利用し,配線遅延の影響を予測,異なるクロック間の同期を考慮した高位合成を実現する.クロックはパドル毎に割り当て,資源制約と時間制約を満たす範囲で低い周波数のクロックを割り当てることで低電力化する.計算機実験により提案手法は従来の単一クロックのみを考慮したレジスタ分散型アーキテクチャと比較し25%程度消費エネルギーを削減できることを確認した.

CiNii
フロアプランを考慮したマルチクロックドメイン指向の低電力化高位合成手法

阿部晋矢, 史又華, 柳澤政生, 戸川望

研究報告システムLSI設計技術（SLDM） 2013 ( 20 ) 1 - 6 2013年03月

　概要を見る

本稿では，マルチクロックドメイン適用へ向け，HDRアーキテクチャを拡張したHDR-mcdを提案する．続いてHDR-mcdを対象にマルチクロックドメイン指向の低電力化高位合成を提案する．提案手法はフロアプラン情報をフィードバックし，反復改良する合成フローを取る．その際，1クロック内の通信が保障されるハドルと呼ぶ区画を利用し，配線遅延の影響を予測，異なるクロック間の同期を考慮した高位合成を実現する．クロックはハドル毎に割り当て，資源制約と時間制約を満たす範囲で低い周波数のクロックを割り当てることで低電力化する．計算機実験により提案手法は従来の単一クロックのみを考慮したレジスタ分散型アーキテクチャと比較し25%程度消費エネルギーを削減できることを確認した．

CiNii
フロアプランを考慮したマルチクロックドメイン指向の低電力化高位合成手法

阿部晋矢, 史又華, 柳澤政生, 戸川望

研究報告組込みシステム（EMB） 2013 ( 20 ) 1 - 6 2013年03月

　概要を見る

本稿では，マルチクロックドメイン適用へ向け，HDRアーキテクチャを拡張したHDR-mcdを提案する．続いてHDR-mcdを対象にマルチクロックドメイン指向の低電力化高位合成を提案する．提案手法はフロアプラン情報をフィードバックし，反復改良する合成フローを取る．その際，1クロック内の通信が保障されるハドルと呼ぶ区画を利用し，配線遅延の影響を予測，異なるクロック間の同期を考慮した高位合成を実現する．クロックはハドル毎に割り当て，資源制約と時間制約を満たす範囲で低い周波数のクロックを割り当てることで低電力化する．計算機実験により提案手法は従来の単一クロックのみを考慮したレジスタ分散型アーキテクチャと比較し25%程度消費エネルギーを削減できることを確認した．

CiNii
フロアプランを考慮したマルチクロックドメイン指向の低電力化高位合成手法(動作合成,組込み技術とネットワークに関するワークショップETNET2013)

阿部晋矢, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. CPSY, コンピュータシステム 112 ( 481 ) 115 - 120 2013年03月

　概要を見る

本稿では,マルチクロックドメイン適用へ向け,HDRアーキテクチャを拡張したHDR-mcdを提案する.続いてHDR-mcdを対象にマルチクロックドメイン指向の低電力化高位合成を提案する.提案手法はフロアプラン情報をフィードバックし,反復改良する合成フローを取る.その際,1クロック内の通信が保障されるパドルと呼ぶ区画を利用し,配線遅延の影響を予測,異なるクロック間の同期を考慮した高位合成を実現する.クロックはパドル毎に割り当て,資源制約と時間制約を満たす範囲で低い周波数のクロックを割り当てることで低電力化する.計算機実験により提案手法は従来の単一クロックのみを考慮したレジスタ分散型アーキテクチャと比較し25%程度消費エネルギーを削減できることを確認した.

CiNii
鍵ベース構成のState Dependent Scan Flip-Flopを用いたセキュアスキャンアーキテクチャ

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. VLD, VLSI設計技術 112 ( 320 ) 45 - 50 2012年11月

　概要を見る

暗号LSIは機密操作を行うために使用されるため,それ自体は安全である必要がある.スキャンテストは高い故障検出率を持つテスト容易化手法であり,近年のLSIの大規模化によって重要性が高まっているが,様々な暗号回路へのスキャンベース攻撃手法が報告されている.そこで,テスト容易性を保ちスキャンベース攻撃に対して高い安全性を持つセキュアスキャンアーキテクチャとしてSDSFF(State Dependent Scan Flip-Flop)が提案された.SDSFFでは,スキャンフリップフロップに対して付加するラッチの値を更新するタイミングが重要な問題となる.本稿では,オンラインテストを可能にする更新タイミングを提案する.提案する更新タイミングはスキャンチェイン上の任意のフリップフロップと回路設計時に決定した値との比較結果によって決定される.RSA暗号回路,AES暗号回路及びDES暗号回路に提案手法を実装し,評価を行った.実験結果より,様々な暗号回路において有効であることが示せた.

CiNii
SAAV:AVHDRアーキテクチャを対象として動的複数電源電圧指向の低電力化高位合成手法

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. VLD, VLSI設計技術 112 ( 320 ) 135 - 140 2012年11月

　概要を見る

動的複数電源電圧と配線遅延を高位合成に統合するプラットフォームとして,Adaptive Voltages Huddle-basedDistributed-Registerアーキテクチャ(AVHDR)およびAVHDRアーキテクチャを対象とした高位合成手法が提案された.従来手法はフロアプラン情報をフィードバックし,反復改良する合成フローを取る.従来手法では収束性を改善するため,仮想面積ベースの反復改良を採用している.しかし,仮想面積は面積と配線遅延のオーバヘッドを伴う可能性がある.本稿では反復が進むごとにオーバヘッドを削減する仮想面積調整を提案する.計算機実験結果により,提案手法は従来のAVHDRアーキテクチャを対象とした高位合成アルゴリズムと比較し最大6.2%消費エネルギーを削減し,最終的に従来の高位合成アルゴリズムと比較し最大65.7%消費エネルギーを削減できることを確認した.

CiNii
鍵ベース構成のState Dependent Scan Flip-Flopを用いたセキュアスキャンアーキテクチャ

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. DC, ディペンダブルコンピューティング : IEICE technical report 112 ( 321 ) 45 - 50 2012年11月

　概要を見る

暗号LSIは機密操作を行うために使用されるため,それ自体は安全である必要がある.スキャンテストは高い故障検出率を持つテスト容易化手法であり,近年のLSIの大規模化によって重要性が高まっているが,様々な暗号回路へのスキャンベース攻撃手法が報告されている.そこで,テスト容易性を保ちスキャンベース攻撃に対して高い安全性を持つセキュアスキャンアーキテクチャとしてSDSFF(State Dependent Scan Flip-Flop)が提案された.SDSFFでは,スキャンフリップフロップに対して付加するラッチの値を更新するタイミングが重要な問題となる.本稿では,オンラインテストを可能にする更新タイミングを提案する.提案する更新タイミングはスキャンチェイン上の任意のフリップフロップと回路設計時に決定した値との比較結果によって決定される.RSA暗号回路,AES暗号回路及びDES暗号回路に提案手法を実装し,評価を行った.実験結果より,様々な暗号回路において有効であることが示せた.

CiNii
SAAV:AVHDRアーキテクチャを対象として動的複数電源電圧指向の低電力化高位合成手法

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. DC, ディペンダブルコンピューティング : IEICE technical report 112 ( 321 ) 135 - 140 2012年11月

　概要を見る

動的複数電源電圧と配線遅延を高位合成に統合するプラットフォームとして,Adaptive Voltages Huddle-basedDistributed-Registerアーキテクチャ(AVHDR)およびAVHDRアーキテクチャを対象とした高位合成手法が提案された.従来手法はフロアプラン情報をフィードバックし,反復改良する合成フローを取る.従来手法では収束性を改善するため,仮想面積ベースの反復改良を採用している.しかし,仮想面積は面積と配線遅延のオーバヘッドを伴う可能性がある.本稿では反復が進むごとにオーバヘッドを削減する仮想面積調整を提案する.計算機実験結果により,提案手法は従来のAVHDRアーキテクチャを対象とした高位合成アルゴリズムと比較し最大6.2%消費エネルギーを削減し,最終的に従来の高位合成アルゴリズムと比較し最大65.7%消費エネルギーを削減できることを確認した.

CiNii
鍵ベース構成のState Dependent Scan Flip-Flopを用いたセキュアスキャンアーキテクチャ

跡部悠太, 史又華, 柳澤政生, 戸川望

研究報告システムLSI設計技術（SLDM） 2012 ( 9 ) 1 - 6 2012年11月

　概要を見る

暗号 LSI は機密操作を行うために使用されるため，それ自体は安全である必要があるスキャンテストは高い故障検出率を持つテスト容易化手法であり，近年の LSI の大規模化によって重要性が高まっているが，様々な暗号回路へのスキャンベース攻撃手法が報告されているそこで，テスト容易性を保ちスキャンベース攻撃に対して高い安全性を持つセキュアスキャンアーキテクチャとして SDSFF (State Dependent Scan Flip-Flop) が提案された SDSFF では，スキャンフリップフロップに対して付加するラッチの値を更新するタイミングが重要な問題となる本稿では，オンラインテストを可能にする更新タイミングを提案する提案する更新タイミングはスキャンチェイン上の任意のフリップフロップと回路設計時に決定した値との比較結果によって決定される RSA 暗号回路， AES 暗号回路及び DES 暗号回路に提案手法を実装し，評価を行った実験結果より，様々な暗号回路において有効であることが示せた．Secure cryptographic LSIs is intensively used in order to perform confidential operation. Scan test has become the most widely adopted test technique to ensure the correctness of manufactured LSIs, in which through the scan chains the internal states of the circuit under test (CUT) can be controlled and observed externally. However, scan chains using scan test might carry the risk of being misused for secret information leakage. Therefore a secure scan architecture using SDSFF(State Dependent Scan Flip-Flop) against scan-based attack which achieves high security without compromising the testability is proposed. In SDSFF, there is a problem which is the update timing of the latch which added to the scan FF. In this paper, we propose the update timing to online test without sacrificing the security. In our method, the latches are updated by result which the value of KEY which decided when designed compared with any FFs in a scan chain. We show that by using proposed method, neither the secret key nor the testability of vairous crypto circuits implementation is compromised, and the effectiveness of the proposed method. Experimental results on various crypto implementations show the effectiveness of the proposed method.

CiNii
SAAV:AVHDRアーキテクチャを対象とした動的複数電源電圧指向の低電力化高位合成手法

阿部晋矢, 史又華, 宇佐美公良, 柳澤政生, 戸川望

研究報告システムLSI設計技術（SLDM） 2012 ( 24 ) 1 - 6 2012年11月

　概要を見る

動的複数電源電圧と配線遅延を高位合成に統合するプラットフォームとして， Adaptive Voltages Huddle-based Distributed-Register アーキテクチャ (AVHDR) および AVHDR アーキテクチャを対象とした高位合成手法が提案された．従来手法はフロアプラン情報をフィードバックし，反復改良する合成フローを取る従来手法では収束性を改善するため，仮想面積ベースの反復改良を採用している．しかし，仮想面積は面積と配線遅延のオーバヘッドを伴う可能性がある．本稿では反復が進むごとにオーバヘッドを削減する仮想面積調整を提案する．計算機実験結果により，提案手法は従来の AVHDR アーキテクチャを対象とした高位合成アルゴリズムと比較し最大 6.2％消費エネルギーを削減し，最終的に従来の高位合成アルゴリズムと比較し最大 65.7％消費エネルギーを削減できることを確認した．An adaptive voltage huddle-based distributed-register architecture (AVHDR architecture), which integrates dynamic multiple supply voltages and interconnection delays into high-level synthesis, and a synthesis algorithm for AVHDR architectures have been proposed. This algorithm is based on iterative improvement of scheduling/binding and floorplanning and can converge without oscillation by using virtual-area-based iterative refinement flow. However, virtual areas may have some area and interconnection delay overheads. In this paper, we propose virtual area adaptation which relaxes these overheads as the iteration proceeds. Experimental results show that our algorithm achieves 6.2% energy saving compared with conventional algorithm for AVHDR architectures and 65.7% energy saving compared with conventional algorithms.

CiNii
鍵ベース構成のState Dependent Scan Flip-Flopを用いたセキュアスキャンアーキテクチャのRSA暗号回路への実装 (集積回路)

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告 : 信学技報 112 ( 247 ) 95 - 100 2012年10月

　概要を見る

スキャンテストは高い故障検出率を持ち,一般的に使われるテスト容易化設計技術である.しかし,スキャンテストで用いられるスキャンチェインを通して暗号LSIから秘密鍵が解読できる可能性が指摘されている.そこで,テスト容易性を保ちスキャンベース攻撃に対して高い安全性を持つセキュアスキャンアーキテクチャとしてSDSFF(State Dependent Scan Flip-Flop)が提案された.SDSFFでは,スキャンフリップフロップに対して付加するラッチの値を更新するタイミングが重要な問題となる.本稿では,オンラインテストを可能にする更新タイミングを提案する.提案する更新タイミングはスキャンチェイン上の任意のフリップフロップと回路設計時に決定した値との比較結果によって決定される.RSA暗号回路に提案するセキュアスキャンアーキテクチャを実装し,評価を行った.実験結果より,SDSFFを100個実装した場合面積オーバーヘッドは高々0.555%であり,従来手法よりも小さい面積オーバーヘッドであることがわかった.

CiNii
鍵ベース構成の State Dependent Scan Flip-Flop を用いたセキュアスキャンアーキテクチャのRSA暗号回路への実装

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. ICD, 集積回路 112 ( 247 ) 95 - 100 2012年10月

CiNii
鍵ベース構成の State Dependent Scan Flip-Flop を用いたセキュアスキャンアーキテクチャのRSA暗号回路への実装

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. SIP, 信号処理 : IEICE technical report 112 ( 246 ) 95 - 100 2012年10月

　概要を見る

スキャンテストは高い故障検出率を持ち,一般的に使われるテスト容易化設計技術である.しかし,スキャンテストで用いられるスキャンチェインを通して暗号LSIから秘密鍵が解読できる可能性が指摘されている.そこで,テスト容易性を保ちスキャンベース攻撃に対して高い安全性を持つセキュアスキャンアーキテクチャとしてSDSFF(State Dependent Scan Flip-Flop)が提案された.SDSFFでは,スキャンフリップフロップに対して付加するラッチの値を更新するタイミングが重要な問題となる.本稿では,オンラインテストを可能にする更新タイミングを提案する.提案する更新タイミングはスキャンチェイン上の任意のフリップフロップと回路設計時に決定した値との比較結果によって決定される.RSA暗号回路に提案するセキュアスキャンアーキテクチャを実装し,評価を行った.実験結果より,SDSFFを100個実装した場合面積オーバーヘッドは高々0.555%であり,従来手法よりも小さい面積オーバーヘッドであることがわかった.

CiNii
鍵ベース構成の State Dependent Scan Flip-Flop を用いたセキュアスキャンアーキテクチャのRSA暗号回路への実装

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. VLD, VLSI設計技術 112 ( 245 ) 95 - 100 2012年10月

　概要を見る

スキャンテストは高い故障検出率を持ち,一般的に使われるテスト容易化設計技術である.しかし,スキャンテストで用いられるスキャンチェインを通して暗号LSIから秘密鍵が解読できる可能性が指摘されている.そこで,テスト容易性を保ちスキャンベース攻撃に対して高い安全性を持つセキュアスキャンアーキテクチャとしてSDSFF(State Dependent Scan Flip-Flop)が提案された.SDSFFでは,スキャンフリップフロップに対して付加するラッチの値を更新するタイミングが重要な問題となる.本稿では,オンラインテストを可能にする更新タイミングを提案する.提案する更新タイミングはスキャンチェイン上の任意のフリップフロップと回路設計時に決定した値との比較結果によって決定される.RSA暗号回路に提案するセキュアスキャンアーキテクチャを実装し,評価を行った.実験結果より,SDSFFを100個実装した場合面積オーバーヘッドは高々0.555%であり,従来手法よりも小さい面積オーバーヘッドであることがわかった.

CiNii
鍵ベース構成の State Dependent Scan Flip-Flop を用いたセキュアスキャンアーキテクチャのRSA暗号回路への実装

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. IE, 画像工学 112 ( 248 ) 95 - 100 2012年10月

　概要を見る

スキャンテストは高い故障検出率を持ち,一般的に使われるテスト容易化設計技術である.しかし,スキャンテストで用いられるスキャンチェインを通して暗号LSIから秘密鍵が解読できる可能性が指摘されている.そこで,テスト容易性を保ちスキャンベース攻撃に対して高い安全性を持つセキュアスキャンアーキテクチャとしてSDSFF(State Dependent Scan Flip-Flop)が提案された.SDSFFでは,スキャンフリップフロップに対して付加するラッチの値を更新するタイミングが重要な問題となる.本稿では,オンラインテストを可能にする更新タイミングを提案する.提案する更新タイミングはスキャンチェイン上の任意のフリップフロップと回路設計時に決定した値との比較結果によって決定される.RSA暗号回路に提案するセキュアスキャンアーキテクチャを実装し,評価を行った.実験結果より,SDSFFを100個実装した場合面積オーバーヘッドは高々0.555%であり,従来手法よりも小さい面積オーバーヘッドであることがわかった.

CiNii
鍵ベース構成のState Dependent Scan Flip-Flopを用いたセキュアスキャンアーキテクチャのRSA暗号回路への実装

跡部悠太, 史又華, 柳澤政生, 戸川望

研究報告システムLSI設計技術（SLDM） 2012 ( 18 ) 1 - 6 2012年10月

　概要を見る

スキャンテストは高い故障検出率を持ち，一般的に使われるテスト容易化設計技術である．しかし，スキャンテストで用いられるスキャンチェインを通して暗号 LSI から秘密鍵が解読できる可能性が指摘されている．そこで，テスト容易性を保ちスキャンベース攻撃に対して高い安全性を持つセキュアスキャンアーキテクチャとして SDSFF (State Dependent Scan Flip-Flop) が提案された． SDSFF では，スキャンフリップフロップに対して付加するラッチの値を更新するタイミングが重要な問題となる．本稿では，オンラインテストを可能にする更新タイミングを提案する．提案する更新タイミングはスキャンチェイン上の任意のフリップフロップと回路設計時に決定した値との比較結果によって決定される． RSA 暗号回路に提案するセキュアスキャンアーキテクチャを実装し，評価を行った．実験結果より， SDSFF を 100 個実装した場合面積オーバーヘッドは高々 0.555％であり，従来手法よりも小さい面積オーバーヘッドであることがわかった．Scan test is one of the useful design for testability techniques, which can detect circuit failure efficiently. However, it has been reported that it's possible to retrieve secret keys from cryptographic LSIs through scan chains. Therefore a secure scan architecture using SDSFF (State Dependent Scan Flip-Flop) against scan-based attack which achieves high security without compromising the testability is proposed. In SDSFF, there is a problem which is the update timing of the latch which added to the scan FF. In this paper, we propose the update timing to online test without sacrificing the security. In our method, the latches are updated by result which the value of KEY which decided when designed compared with any FFs in a scan chain. We show that by using proposed method, neither the secret key nor the testability of an RSA circuit implementation is compromised, and the effectiveness of the proposed method According the result, even with 100 SDSFFs, the introduced area overhead is 0.555% which less than the conventional method.

CiNii
A-3-4 クロックの立下りを利用した耐故障攻撃AES暗号回路(A-3.VLSI設計技術,一般セッション)

五十嵐博昭, 史又華, 柳澤政生, 戸川望

電子情報通信学会ソサイエティ大会講演論文集 2012 51 - 51 2012年08月

CiNii
A-3-5 Feedback付きState Dependent Scan Flip-Flopを用いたセキュアスキャンアーキテクチャ(A-3.VLSI設計技術,一般セッション)

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会ソサイエティ大会講演論文集 2012 52 - 52 2012年08月

CiNii
State Dependent Scan Flip Flop を用いたRSA暗号回路へのセキュアスキャンンアーキテクチャの実装

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. SIP, 信号処理 : IEICE technical report 112 ( 115 ) 115 - 120 2012年06月

　概要を見る

代表的なテスト容易化設計であるスキャンテストは,LSI内部のFF(フリップフロップ)を直列に接続し,外部から自由に制御,観測でき,効率よく故障検出をすることができる.一方,スキャンテストで用いられるスキャンチェインを使用し,暗号LSIの秘密鍵を解読するスキャンベース攻撃が注目されている.一般的にテスト容易性とセキュリティは相反する性質であるが,それらを両立させる回路設計が必要である.本稿では,スキャンテストの利点であるテスト容易性を持ち,スキャンベース攻撃に対するセキュアスキャンアーキテクチャを提案する.提案手法では,スキャンチェイン中の任意のFFにラッチを付け加えることで,過去のFFの値を利用し,スキャンデータを攻撃者に解読不可能なデータに変化させる.FFの値が変化することで,スキャンデータを動的に変化させることが可能である.攻撃者には解読不可能なデータであっても,テスト者は拡張回路の構造を知っているため,通常のスキャンテストと同様のテストが可能である.RSA暗号回路に提案するセキュアスキャンアーキテクチャを実装し,評価を行った.

CiNii
State Dependent Scan Flip Flop を用いたRSA暗号回路へのセキュアスキャンンアーキテクチャの実装

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. MSS, システム数理と応用 : IEICE technical report 112 ( 116 ) 115 - 120 2012年06月

　概要を見る

代表的なテスト容易化設計であるスキャンテストは,LSI内部のFF(フリップフロップ)を直列に接続し,外部から自由に制御,観測でき,効率よく故障検出をすることができる.一方,スキャンテストで用いられるスキャンチェインを使用し,暗号LSIの秘密鍵を解読するスキャンベース攻撃が注目されている.一般的にテスト容易性とセキュリティは相反する性質であるが,それらを両立させる回路設計が必要である.本稿では,スキャンテストの利点であるテスト容易性を持ち,スキャンベース攻撃に対するセキュアスキャンアーキテクチャを提案する.提案手法では,スキャンチェイン中の任意のFFにラッチを付け加えることで,過去のFFの値を利用し,スキャンデータを攻撃者に解読不可能なデータに変化させる.FFの値が変化することで,スキャンデータを動的に変化させることが可能である.攻撃者には解読不可能なデータであっても,テスト者は拡張回路の構造を知っているため,通常のスキャンテストと同様のテストが可能である.RSA暗号回路に提案するセキュアスキャンアーキテクチャを実装し,評価を行った.

CiNii
State Dependent Scan Flip Flop を用いたRSA暗号回路へのセキュアスキャンンアーキテクチャの実装

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. CAS, 回路とシステム 112 ( 113 ) 115 - 120 2012年06月

　概要を見る

代表的なテスト容易化設計であるスキャンテストは,LSI内部のFF(フリップフロップ)を直列に接続し,外部から自由に制御,観測でき,効率よく故障検出をすることができる.一方,スキャンテストで用いられるスキャンチェインを使用し,暗号LSIの秘密鍵を解読するスキャンベース攻撃が注目されている.一般的にテスト容易性とセキュリティは相反する性質であるが,それらを両立させる回路設計が必要である.本稿では,スキャンテストの利点であるテスト容易性を持ち,スキャンベース攻撃に対するセキュアスキャンアーキテクチャを提案する.提案手法では,スキャンチェイン中の任意のFFにラッチを付け加えることで,過去のFFの値を利用し,スキャンデータを攻撃者に解読不可能なデータに変化させる.FFの値が変化することで,スキャンデータを動的に変化させることが可能である.攻撃者には解読不可能なデータであっても,テスト者は拡張回路の構造を知っているため,通常のスキャンテストと同様のテストが可能である.RSA暗号回路に提案するセキュアスキャンアーキテクチャを実装し,評価を行った.

CiNii
State Dependent Scan Flip Flop を用いたRSA暗号回路へのセキュアスキャンンアーキテクチャの実装

跡部悠太, 史又華, 柳澤政生, 戸川望

電子情報通信学会技術研究報告. VLD, VLSI設計技術 112 ( 114 ) 115 - 120 2012年06月

　概要を見る

代表的なテスト容易化設計であるスキャンテストは,LSI内部のFF(フリップフロップ)を直列に接続し,外部から自由に制御,観測でき,効率よく故障検出をすることができる.一方,スキャンテストで用いられるスキャンチェインを使用し,暗号LSIの秘密鍵を解読するスキャンベース攻撃が注目されている.一般的にテスト容易性とセキュリティは相反する性質であるが,それらを両立させる回路設計が必要である.本稿では,スキャンテストの利点であるテスト容易性を持ち,スキャンベース攻撃に対するセキュアスキャンアーキテクチャを提案する.提案手法では,スキャンチェイン中の任意のFFにラッチを付け加えることで,過去のFFの値を利用し,スキャンデータを攻撃者に解読不可能なデータに変化させる.FFの値が変化することで,スキャンデータを動的に変化させることが可能である.攻撃者には解読不可能なデータであっても,テスト者は拡張回路の構造を知っているため,通常のスキャンテストと同様のテストが可能である.RSA暗号回路に提案するセキュアスキャンアーキテクチャを実装し,評価を行った.

CiNii
スクラッチパッドメモリとコード配置最適化による低消費エネルギーASIP合成手法

嶋田吉倫, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. VLD, VLSI設計技術 110 ( 432 ) 25 - 30 2011年02月

　概要を見る

本稿ではVLIW型ASIPを対象としたハードウェア/ソフトウェア(HW/SW)ASIP協調合成システムSPADESにおける消費エネルギー削減手法を提案する.ASIPにおいて命令メモリが占める消費エネルギーの割合は大きく,命令メモリの消費エネルギー削減が課題となっている.そこで我々は,SPADESを対象としたスクラッチパッドメモリアーキテクチャと,コード配置最適化手法を提案する.提案するスクラッチパッドメモリアーキテクチャは,プログラムカウンタによりスクラッチパッドメモリへ配置するデータを判別する.コード配置最適化手法は,アプリケーションCFGから消費エネルギー最小となるコード配置とスクラッチパッドメモリのサイズを決定する.これにより命令メモリのアクセス数を削減し,消費エネルギーを削減することができる.計算機実験により,メモリを含むプロセッサ全体で平均47.9%の消費エネルギー削減を確認した.

CiNii
暗号回路における動的に構造変化するセキュアスキャンアーキテクチャ

跡部浩士, 奈良竜太, 史又華, 戸川望, 柳澤政生, 大附辰夫

情報処理学会研究報告システムLSI設計技術（SLDM） 2008 ( 111 ) 55 - 59 2008年11月

　概要を見る

スキャンテストはスキャンチェインを用いた手法で一般的かつ強力なテスト手法である．しかし，スキャンチェインは外部から回路内部の情報を取得できるため，暗号回路においては有効な攻撃手段となりえる．本稿ではスキャンベース攻撃に対するセキュアスキャンアーキテクチャを提案する．スキャンチェイン内のランダムな場所にインバータを挿入し，スキャンチェインの構造を複雑にする防御手法が提案されているが，回路設計時にインバータを挿入する場所が固定されてしまうため，その特徴を利用し攻撃される可能性がある．したがって，設計した後にもスキャンチェインの構造を動的に変化させる必要があると考えられる．我々はラッチを用いて過去の FF の状態を利用することで次のスキャン FF への出力を変化させる状態依存スキャン FF (SDSFF) を提案する．このスキャンＦＦを用いることでスキャンチェインの構造を動的に変化させることが可能であり，コントローラを必要としないため面積オーバーヘッドも少ない AES 暗号回路に提案手法を実装し，評価を行った．Scan test is a powerful and popular test technique because it can control and observe the internal states of the circuit under test. However, scan chains would be used to discover the internals of crypto hardware, which presents a significant security risk of information leakage. An interesting design-for-test technique by inserting in verters into the internal scan chains to complicate the scan structure has been recently presented. Unfortunately, it still carries the potential of being attacked through statistical analysis of the information scanned out from chips. Therefore, in this paper we propose secure scan architecture, called dynamic variable secure scan, against scan-based side channel attack. The modified scan flip-flops are state-dependent, which could cause the output of each SDSFF to be inverted or not so as to make it more difficult to discover the internal scan architecture. We made an analysis on an AES implementation to show the effectiveness of the proposed method and discussed how our approach is resistant to scan-based side channel attack.

CiNii
暗号回路における動的に構造変化するセキュアスキャンアーキテクチャ

跡部浩士, 奈良竜太, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. VLD, VLSI設計技術 108 ( 298 ) 55 - 59 2008年11月

　概要を見る

スキャンテストはスキャンチェインを用いた手法で一般的かつ強力なテスト手法である.しかし,スキャンチェインは外部から回路内部の情報を取得できるため,暗号回路においては有効な攻撃手段となりえる.本稿ではスキャンベース攻撃に対するセキュアスキャンアーキテクチャを提案する.スキャンチェイン内のランダムな場所にインバータを挿入し,スキャンチェインの構造を複雑にする防御手法が提案されているが,回路設計時にインバータを挿入する場所が固定されてしまうため,その特徴を利用し攻撃される可能性がある.したがって,設計した後にもスキャンチェインの構造を動的に変化させる必要があると考えられる.我々はラッチを用いて過去のFFの状態を利用することで次のスキャンFFへの出力を変化させる状態依存スキャンFF(SDSFF)を提案する.このスキャンFFを用いることでスキャンチェインの構造を動的に変化させることが可能であり,コントローラを必要としないため面積オーバーヘッドも少ない.AES暗号回路に提案手法を実装し,評価を行った.

CiNii
暗号回路における動的に構造変化するセキュアスキャンアーキテクチャ

跡部浩士, 奈良竜太, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. DC, ディペンダブルコンピューティング : IEICE technical report 108 ( 299 ) 55 - 59 2008年11月

　概要を見る

スキャンテストはスキャンチェインを用いた手法で一般的かつ強力なテスト手法である.しかし,スキャンチェインは外部から回路内部の情報を取得できるため,暗号回路においては有効な攻撃手段となりえる.本稿ではスキャンベース攻撃に対するセキュアスキャンアーキテクチャを提案する.スキャンチェイン内のランダムな場所にインバータを挿入し,スキャンチェインの構造を複雑にする防御手法が提案されているが,回路設計時にインバータを挿入する場所が固定されてしまうため,その特徴を利用し攻撃される可能性がある.したがって,設計した後にもスキャンチェインの構造を動的に変化させる必要があると考えられる.我々はラッチを用いて過去のFFの状態を利用することで次のスキャンFFへの出力を変化させる状態依存スキャンFF(SDSFF)を提案する.このスキャンFFを用いることでスキャンチェインの構造を動的に変化させることが可能であり,コントローラを必要としないため面積オーバーヘッドも少ない.AES暗号回路に提案手法を実装し,評価を行った.

CiNii
命令メモリビット幅削減に基づく低エネルギーASIP合成手法

小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. ICD, 集積回路 107 ( 509 ) 25 - 30 2008年03月

　概要を見る

本稿ではASIPを対象としたハードウェア/ソフトウェア協調合成システムにおける命令メモリビット幅削減に基づく低エネルギー化手法を提案する。VLIW型プロセッサは並列に命令を発行可能だが,命令メモリのビット幅が長くなり,消費電力・消費エネルギーを無駄に増加させてしまう.したがって,VLIW型プロセッサの命令メモリのビット幅の削減は,高性能でエネルギー効率の高いプロセッサを実現可能にすると考えられる.命令メモリのビット幅は命令エンコーディング形式に依存し,それはオペコードとオペランド群で構成される.オペコードのビット幅は命令セットにおける命令数に,オペランドのビット幅は汎用レジスタ数に依存する.また,我々はオペコードのビット幅を削減するために,結合命令の概念を導入した.結合命令は各VLIWスロットで同時に発行される複数の命令を1つの命令として取り扱った命令である.我々は,オペコードビット幅削減アルゴリズム,オペランドビット幅削減アルゴリズム,エネルギー最小化アルゴリズムの3つのアルゴリズムで構成されるASIP合成システムを構築した.実験結果では,メモリを含むプロセッサ全体で,9%〜12%の消費エネルギーを削減することを確認した.

CiNii
命令メモリビット幅削減に基づく低エネルギーASIP合成手法 (VLSI設計技術)

小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告 107 ( 506 ) 25 - 30 2008年03月

　概要を見る

本稿ではASIPを対象としたハードウェア/ソフトウェア協調合成システムにおける命令メモリビット幅削減に基づく低エネルギー化手法を提案する。VLIW型プロセッサは並列に命令を発行可能だが,命令メモリのビット幅が長くなり,消費電力・消費エネルギーを無駄に増加させてしまう.したがって,VLIW型プロセッサの命令メモリのビット幅の削減は,高性能でエネルギー効率の高いプロセッサを実現可能にすると考えられる.命令メモリのビット幅は命令エンコーディング形式に依存し,それはオペコードとオペランド群で構成される.オペコードのビット幅は命令セットにおける命令数に,オペランドのビット幅は汎用レジスタ数に依存する.また,我々はオペコードのビット幅を削減するために,結合命令の概念を導入した.結合命令は各VLIWスロットで同時に発行される複数の命令を1つの命令として取り扱った命令である.我々は,オペコードビット幅削減アルゴリズム,オペランドビット幅削減アルゴリズム,エネルギー最小化アルゴリズムの3つのアルゴリズムで構成されるASIP合成システムを構築した.実験結果では,メモリを含むプロセッサ全体で,9%〜12%の消費エネルギーを削減することを確認した.

CiNii
GF(2^n)及びGF(P)におけるスケーラブル双基数ユニファイド型モンゴメリ乗算器

谷村和幸, 奈良竜太, 小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. CAS, 回路とシステム 107 ( 101 ) 43 - 48 2007年06月

　概要を見る

公開鍵記号の1つである楕円曲線暗号の中で支配的な演算である剰余乗算には,モンゴメリ乗算が一般的に使われる.モンゴメリ乗算器には暗号強度によって扱うオペランドのビット数が異なるので,スケーラビリティが要求される.また,楕円曲線暗号はGF(2^n)もしくはGF(P)上で演算されるため,両フィールドを扱えるスケーラブルなユニファイド型乗算器も過去に提案されている.しかし, GF(P)を扱う回路の方が, GF(2^n)より遅延時間が長いため,フィールド毎に動作周波数を変えるか, GF(P)の時だけクロックサイクル数の増加と引き換えに基数を小さくする必要がある.本稿ではGF(2^n)及びGF(P)におけるスケーラブル双基数ユニファイド型モンゴメリ乗算器を提案する.提案アーキテクチャは基数2^<16>で4並列化したGF(P)乗算器と基数2^<64>のGF(2^n)乗算器を1つに統合するものである.双基数化によってGF(2^n)とGF(P)における遅延時間差を削減し,それに伴う低基数側のクロックサイクル数増加を,並列化によって削減する.その結果,最速のスケーラブルユニファイド型モンゴメリ乗算器となった.

CiNii
GF(2^n)及びGF(P)におけるスケーラブル双基数ユニファイド型モンゴメリ乗算器

谷村和幸, 奈良竜太, 小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. SIP, 信号処理 107 ( 105 ) 43 - 48 2007年06月

　概要を見る

公開鍵記号の1つである楕円曲線暗号の中で支配的な演算である剰余乗算には,モンゴメリ乗算が一般的に使われる.モンゴメリ乗算器には暗号強度によって扱うオペランドのビット数が異なるので,スケーラビリティが要求される.また,楕円曲線暗号はGF(2^n)もしくはGF(P)上で演算されるため,両フィールドを扱えるスケーラブルなユニファイド型乗算器も過去に提案されている.しかし, GF(P)を扱う回路の方が, GF(2^n)より遅延時間が長いため,フィールド毎に動作周波数を変えるか, GF(P)の時だけクロックサイクル数の増加と引き換えに基数を小さくする必要がある.本稿ではGF(2^n)及びGF(P)におけるスケーラブル双基数ユニファイド型モンゴメリ乗算器を提案する.提案アーキテクチャは基数2^<16>で4並列化したGF(P)乗算器と基数2^<64>のGF(2^n)乗算器を1つに統合するものである.双基数化によってGF(2^n)とGF(P)における遅延時間差を削減し,それに伴う低基数側のクロックサイクル数増加を,並列化によって削減する.その結果,最速のスケーラブルユニファイド型モンゴメリ乗算器となった.

CiNii
GF(2^n)及びGF(P)におけるスケーラブル双基数ユニファイド型モンゴメリ乗算器

谷村和幸, 奈良竜太, 小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. VLD, VLSI設計技術 107 ( 103 ) 43 - 48 2007年06月

　概要を見る

公開鍵記号の1つである楕円曲線暗号の中で支配的な演算である剰余乗算には,モンゴメリ乗算が一般的に使われる.モンゴメリ乗算器には暗号強度によって扱うオペランドのビット数が異なるので,スケーラビリティが要求される.また,楕円曲線暗号はGF(2^n)もしくはGF(P)上で演算されるため,両フィールドを扱えるスケーラブルなユニファイド型乗算器も過去に提案されている.しかし, GF(P)を扱う回路の方が, GF(2^n)より遅延時間が長いため,フィールド毎に動作周波数を変えるか, GF(P)の時だけクロックサイクル数の増加と引き換えに基数を小さくする必要がある.本稿ではGF(2^n)及びGF(P)におけるスケーラブル双基数ユニファイド型モンゴメリ乗算器を提案する.提案アーキテクチャは基数2^<16>で4並列化したGF(P)乗算器と基数2^<64>のGF(2^n)乗算器を1つに統合するものである.双基数化によってGF(2^n)とGF(P)における遅延時間差を削減し,それに伴う低基数側のクロックサイクル数増加を,並列化によって削減する.その結果,最速のスケーラブルユニファイド型モンゴメリ乗算器となった.

CiNii
XML をベースとした CDFG マニピュレーションフレームワーク： CoDaMa

小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

情報処理学会研究報告システムLSI設計技術（SLDM） 2007 ( 2 ) 73 - 78 2007年01月

　概要を見る

本稿では、ハードウェア／ソフトウェア（HW/SW）協調合成システムや高位合成システム構築のための、XML をベースとした CDFG (Control Data Flow Graph) 操作フレームワークを提案する。CDFG は制御の流れを表す CFG (Control Flow Graph) とデータの流れを表す DFG (Data Flow Graph) で構成される。HW/SW 協調合成や高位レベル合成では、アプリケーションプログラムの内部表現として CDFG が用いられることが多い。それらの合成システムは、要求性能と設計制約を満たす最適なハードウェアやソフトウェアを合成するため、さまざまな最適化アルゴリズムによって CDFG を操作し、自動的に設計探索を行う。近年の SoC (System On a Chip) アプリケーションの大規模化に伴い、合成システムに求められる機能も高度化しており、合成システム開発にかかる工数も増加している。提案フレームワークでは、合成システム開発の生産性を向上させるため、アルゴリズムをモジュール単位で実装し、各モジュールの組み合わせによって合成システムを構築する。アプリケーションプログラムの中間表現を XML で記述し、入出力インタフェースをライブラリとして提供することで、開発者は容易にアルゴリズムを実装し、合成システムを構築することができる。This paper proposes an XML-based framework to manipulate CDFGs (Control Data Flow Graphs) for HW/SW (Hardware / Software) co-synthesis systems or high-level synthesis systems. A CDFG is composed of CFG (Control Flow Graph) and DFGs (Data Flow Graphs). In HW/SW co-synthesis systems or high-level synthesis system, CDFGs are often adopted as an internal representation of input application programs. The systems explore design space automatically with various optimization algorithm in order to synthesize hardware and software which satisfy performance requirements and design constraints. However, with the increased scale of the recent SoC (System On a Chip) applications, synthesis systems require implemented more advanced functions, and it would result in increased development efforts. In the proposed framework, developers implement algorithm as modules and construct the synthesis systems by combination of the modules in order to improve development productivity. The developers can implement algorithm and construct the systems easily by using XML descriptions as intermediate representation of application programs and providing the input/output interface.

CiNii
XMLをベースとしたCDFGマニピュレーションフレームワーク : CoDaMa

小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. CPSY, コンピュータシステム 106 ( 456 ) 19 - 24 2007年01月

　概要を見る

本稿では,ハードウェア/ソフトウェア(HW/SW)協調合成システムや高位合成システム構築のための,XMLをベースとしたCDFG (Control Data Flow Graph)操作フレームワークを提案する.CDFGは制御の流れを表すCFG (Control Flow Graph)とデータの流れを表すDFG(Data Flow Graph)で構成される.HW/SW協調合成や高位レベル合成では,アプリケーションプログラムの内部表現としてCDFGが用いられることが多い.それらの合成システムは,要求性能と設計制約を満たす最適なハードウェアやソフトウェアを合成するため,さまざまな最適化アルゴリズムによってCDFGを操作し,自動的に設計探索を行う.近年のSoC (System On a Chip)アプリケーションの大規模化に伴い,合成システムに求められる機能も高度化しており,合成システム開発にかかる工数も増加している.提案フレームワークでは,合成システム開発の生産性を向上させるため,アルゴリズムをモジュール単位で実装し,各モジュールの組み合わせによって合成システムを構築する.アプリケーションプログラムの中間表現をXMLで記述し,入出力インタフェースをライブラリとして提供することで,開発者は容易にアルゴリズムを実装し,合成システムを構築することができる.

CiNii
XMLをベースとしたCDFGマニピュレーションフレームワーク : CoDaMa

小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. RECONF, リコンフィギャラブルシステム : IEICE technical report 106 ( 458 ) 19 - 24 2007年01月

　概要を見る

本稿では,ハードウェア/ソフトウェア(HW/SW)協調合成システムや高位合成システム構築のための,XMLをベースとしたCDFG (Control Data Flow Graph)操作フレームワークを提案する.CDFGは制御の流れを表すCFG (Control Flow Graph)とデータの流れを表すDFG(Data Flow Graph)で構成される.HW/SW協調合成や高位レベル合成では,アプリケーションプログラムの内部表現としてCDFGが用いられることが多い.それらの合成システムは,要求性能と設計制約を満たす最適なハードウェアやソフトウェアを合成するため,さまざまな最適化アルゴリズムによってCDFGを操作し,自動的に設計探索を行う.近年のSoC (System On a Chip)アプリケーションの大規模化に伴い,合成システムに求められる機能も高度化しており,合成システム開発にかかる工数も増加している.提案フレームワークでは,合成システム開発の生産性を向上させるため,アルゴリズムをモジュール単位で実装し,各モジュールの組み合わせによって合成システムを構築する.アプリケーションプログラムの中間表現をXMLで記述し,入出力インタフェースをライブラリとして提供することで,開発者は容易にアルゴリズムを実装し,合成システムを構築することができる.

CiNii
XMLをベースとしたCDFGマニピュレーションフレームワーク : CoDaMa

小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. VLD, VLSI設計技術 106 ( 454 ) 19 - 24 2007年01月

　概要を見る

本稿では,ハードウェア/ソフトウェア(HW/SW)協調合成システムや高位合成システム構築のための,XMLをベースとしたCDFG (Control Data Flow Graph)操作フレームワークを提案する.CDFGは制御の流れを表すCFG (Control Flow Graph)とデータの流れを表すDFG(Data Flow Graph)で構成される.HW/SW協調合成や高位レベル合成では,アプリケーションプログラムの内部表現としてCDFGが用いられることが多い.それらの合成システムは,要求性能と設計制約を満たす最適なハードウェアやソフトウェアを合成するため,さまざまな最適化アルゴリズムによってCDFGを操作し,自動的に設計探索を行う.近年のSoC (System On a Chip)アプリケーションの大規模化に伴い,合成システムに求められる機能も高度化しており,合成システム開発にかかる工数も増加している.提案フレームワークでは,合成システム開発の生産性を向上させるため,アルゴリズムをモジュール単位で実装し,各モジュールの組み合わせによって合成システムを構築する.アプリケーションプログラムの中間表現をXMLで記述し,入出力インタフェースをライブラリとして提供することで,開発者は容易にアルゴリズムを実装し,合成システムを構築することができる.

CiNii
アプリケーションプロセッサのフォワーディングユニット最適化手法

日浦敏宏, 小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

情報処理学会研究報告システムLSI設計技術（SLDM） 2006 ( 126 ) 181 - 186 2006年11月

　概要を見る

定用途向けプロセッサは近年小型，低コスト，高性能に加え，設計時間の短縮が求められている．我々の提案しているアプリケーションプロセッサコア向けＨＷ/ＳＷ協調設計システムSPADESは，アプリケーションの実行時間制約を満たす範囲で最小面積のアプリケーションプロセッサを合成することを目的とする．SPADESの面積削減手法は，不要なＨＷユニットの削減と命令セットの変更をベースとしているが，より面積の小さいプロセッサを合成するには，命令セットの変更を伴わないプロセッサアーキテクチャレベルでの最適化手法が有効であると考えられる．本稿ではフォワーデイングユニットについて着目する．一般的なフォワーデイングユニットは，プロセッサのパイプライン段数やスロット数が増えると比較対象のデータ数が増え，クリティカルパスになりやすい．そこで従来のフォワーディングユニットよりも１ステージ分早い段階で判定を行う先見判定型のフォワーディングユニットが提案されている．一般的な型と先見判定型は面積/遅延でトレードオフの関係にあり，提案手法はプロセッサアーキテクチャパラメータから最適な方式を選択しＨＤＬを自動生成する．それぞれの方式のフォワーディングユニットＨＤＬ記述自動生成システムを実装し，様々なパラメータを与えてフォワーディングユニットのＨＤＬ記述の生成を行い，それぞれの面積/遅延にトレードオフの関係にあることが示された.To meet the requirements in application specific processor designs, such as area, cost, performance and design time, we have been developing a HW/SW co-design system, called SPADES, which can generate an applica tion specific processor with minimum area on the constraint of the execution time of an application. In SPADES, we reduce the area by reducing unnecessary HW unit and then change the instruction set. However, to change the instruction set will affect the processor architecture. On the other hand, forwarding unit is easy to become the critical path in processors when the processor architecture becomes complex. Thus in this paper, we focus on the forwarding unit for optimization by making a tradeoff between area and delay while without any changes in the instruction set. We also propose a new forwarding unit architecture, called foresight judgment type forwarding unit, which can be incorporated into SPADES to generate HDL description automatically without any knowledge of our system. Experimental results show that the proposed method is more suitable in HW/SW co-design systems to generate the optimized forwarding unit.

CiNii
アプリケーションプロセッサのフォワーディングユニット最適化手法

日浦敏宏, 小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. VLD, VLSI設計技術 106 ( 389 ) 49 - 54 2006年11月

　概要を見る

特定用途向けプロセッサは近年小型,低コスト,高性能に加え,設計時間の短縮が求められている.我々の提案しているアプリケーションプロセッサコア向けHW/SW協調設計システムSPADESは,アプリケーションの実行時間制約を満たす範囲で最小面積のアプリケーションプロセッサを合成することを目的とする.SPADESの面積削減手法は,不要なHWユニットの削減と命令セットの変更をベースとしているが,より面積の小さいプロセッサを合成するには,命令セットの変更を伴わないプロセッサアーキテクチャレベルでの最適化手法が有効であると考えられる.本稿ではフォワーディングユニットについて着目する.一般的なフォワーディングユニットは,プロセッサのパイプライン段数やスロット数が増えると比較対象のデータ数が増え,クリティカルパスになりやすい.そこで従来のフォワーディングユニットよりも1ステージ分早い段階で判定を行う先見判定型のフォワーディングユニットが提案されている.一般的な型と先見判定型は面積/遅延でトレードオフの関係にあり,提案手法はプロセッサアーキテクチャパラメータから最適な方式を選択しHDLを自動生成する.それぞれの方式のフォワーディングユニットHDL記述自動生成システムを実装し,様々なパラメータを与えてフォワーディングユニットのHDL記述の生成を行い,それぞれの面積/遅延にトレードオフの関係にあることが示された.

CiNii
アプリケーションプロセッサのフォワーディングユニット最適化手法

日浦敏宏, 小原俊逸, 史又華, 戸川望, 柳澤政生, 大附辰夫

電子情報通信学会技術研究報告. DC, ディペンダブルコンピューティング 106 ( 392 ) 49 - 54 2006年11月

　概要を見る

特定用途向けプロセッサは近年小型,低コスト,高性能に加え,設計時間の短縮が求められている.我々の提案しているアプリケーションプロセッサコア向けHW/SW協調設計システムSPADESは,アプリケーションの実行時間制約を満たす範囲で最小面積のアプリケーションプロセッサを合成することを目的とする.SPADESの面積削減手法は,不要なHWユニットの削減と命令セットの変更をベースとしているが,より面積の小さいプロセッサを合成するには,命令セットの変更を伴わないプロセッサアーキテクチャレベルでの最適化手法が有効であると考えられる.本稿ではフォワーディングユニットについて着目する.一般的なフォワーディングユニットは,プロセッサのパイプライン段数やスロット数が増えると比較対象のデータ数が増え,クリティカルパスになりやすい.そこで従来のフォワーディングユニットよりも1ステージ分早い段階で判定を行う先見判定型のフォワーディングユニットが提案されている.一般的な型と先見判定型は面積/遅延でトレードオフの関係にあり,提案手法はプロセッサアーキテクチャパラメータから最適な方式を選択しHDLを自動生成する.それぞれの方式のフォワーディングユニットHDL記述自動生成システムを実装し,様々なパラメータを与えてフォワーディングユニットのHDL記述の生成を行い,それぞれの面積/遅延にトレードオフの関係にあることが示された.

CiNii

▼全件表示

産業財産権

信号処理装置および信号処理方法

史又華, 戸川望, 柳澤政生, 五十嵐博昭

特許権

J-GLOBAL
故障攻撃検出回路および暗号処理装置

戸川望, 五十嵐博昭, 史又華

特許権

J-GLOBAL

現在担当している科目

Master's Thesis (Department of Electronic and Physical Systems)

大学院基幹理工学研究科

2026年通年
修士論文（電子）

大学院基幹理工学研究科

2026年通年
IoTシステム設計

大学院基幹理工学研究科

2026年春学期
機械学習ハードウェア

大学院基幹理工学研究科

2026年秋学期
知能システム設計演習Ｄ

大学院基幹理工学研究科

2026年秋学期
知能システム設計演習Ｂ

大学院基幹理工学研究科

2026年秋学期
知能システム設計演習Ａ

大学院基幹理工学研究科

2026年春学期
Seminar on Intelligent System Design D

大学院基幹理工学研究科

2026年秋学期
Seminar on Intelligent System Design C

大学院基幹理工学研究科

2026年春学期
Seminar on Intelligent System Design A

大学院基幹理工学研究科

2026年春学期
Seminar on Intelligent System Design B

大学院基幹理工学研究科

2026年秋学期
Seminar on Integrated System Design C

大学院基幹理工学研究科

2026年春学期
Seminar on Integrated System Design B

大学院基幹理工学研究科

2026年秋学期
Seminar on Integrated System Design A

大学院基幹理工学研究科

2026年春学期
Seminar on Integrated System Design D

大学院基幹理工学研究科

2026年秋学期
Research on Integrated System Design

大学院基幹理工学研究科

2026年通年
Hardware for Machine Learning

大学院基幹理工学研究科

2026年秋学期
Research on Intelligent System Design

大学院基幹理工学研究科

2026年通年
知能システム設計演習Ｃ

大学院基幹理工学研究科

2026年春学期
集積システム設計演習Ｂ

大学院基幹理工学研究科

2026年秋学期
集積システム設計演習Ａ

大学院基幹理工学研究科

2026年春学期
集積システム設計演習Ｄ

大学院基幹理工学研究科

2026年秋学期
集積システム設計演習Ｃ

大学院基幹理工学研究科

2026年春学期
機械学習ハードウェア

大学院基幹理工学研究科

2026年秋学期
知能システム設計研究

大学院基幹理工学研究科

2026年通年
集積システム設計研究

大学院基幹理工学研究科

2026年通年
IoTシステム設計

大学院創造理工学研究科

2026年春学期
知能システム設計研究

大学院基幹理工学研究科

2026年通年
集積システム設計研究

大学院基幹理工学研究科

2026年通年
Hardware for Machine Learning

大学院基幹理工学研究科

2026年秋学期
IoTシステム設計

大学院先進理工学研究科

2026年春学期
Physical Electronics Laboratory B

基幹理工学部

2026年春クォーター
Physical Electronics Seminar B

基幹理工学部

2026年春クォーター
Physical Electronics Laboratory A

基幹理工学部

2026年冬クォーター
Physical Electronics Seminar A

基幹理工学部

2026年冬クォーター
電子物理システム特別概論　【前年度成績S評価者用】

基幹理工学部

2026年春学期
電子物理システム演習Ａ　【前年度成績S評価者用】

基幹理工学部

2026年春学期
電子物理システム演習B

基幹理工学部

2026年秋学期
電子物理システム演習Ａ

基幹理工学部

2026年春学期
論理回路

基幹理工学部

2026年春学期
論理回路　【前年度成績S評価者用】

基幹理工学部

2026年春学期
IoTシステム設計

基幹理工学部

2026年春学期
IoTシステム設計

基幹理工学部

2026年春学期
電子物理システム実験B

基幹理工学部

2026年春学期
IoTシステム設計

基幹理工学部

2026年春学期
卒業論文Ｂ

基幹理工学部

2026年春学期
LSIアーキテクチャ

基幹理工学部

2026年秋クォーター
卒業論文Ｂ

基幹理工学部

2026年秋学期
卒業論文Ｂ　　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
卒業論文Ａ　　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
卒業論文Ａ　　【前年度成績S評価者用】

基幹理工学部

2026年春学期
卒業論文Ａ

基幹理工学部

2026年秋学期
卒業論文Ａ

基幹理工学部

2026年春学期
電子物理システム実験B　【前年度成績S評価者用】

基幹理工学部

2026年春学期
電子物理システム特別概論

基幹理工学部

2026年春学期
電子物理システム特別演習

基幹理工学部

2026年秋学期
電子物理システム実験C　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
電子物理システム実験C

基幹理工学部

2026年秋学期
電子物理システム実験A　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
電子物理システム実験A

基幹理工学部

2026年秋学期
電子物理システム演習B　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
Electronic Circuits B [S Grade]

基幹理工学部

2026年夏クォーター
Electronic Circuits B

基幹理工学部

2026年夏クォーター
Electronic Circuits A [S Grade]

基幹理工学部

2026年春クォーター
Electronic Circuits A

基幹理工学部

2026年春クォーター

▼全件表示

他学部・他研究科等兼任情報

理工学術院大学院基幹理工学研究科

学内研究所・附属機関兼任歴

2024年

-

2026年

理工学術院総合研究所兼任研究員
2024年

-

2026年

カーボンニュートラル社会研究教育センター兼任センター員

特定課題制度（学内資金）

TENGを用いた海洋エネルギー発電システムの開発

2025年

　概要を見る

海洋は地球表面の約70％を占め、極めて大きなエネルギー生成ポテンシャルを有している。特に、海洋エネルギーは、地球・月・太陽の運動や重力相互作用に起因して発生し、安定かつ長期的な電力供給源として利用可能である。本研究では、摩擦帯電現象を利用したTriboelectric Nanogenerator（TENG）を基盤とする海洋エネルギー発電システムの開発に取り組んだ。特に、小型・軽量で、不規則かつ低周波な波浪および海流運動からエネルギーを回収可能な高効率ハーベスティングシステムの構築を行った。具体的には、回転型およびスライディング型TENGシステムを設計し、材料選択、電極構造、ならびに複数TENG素子の接続方式の最適化を行った。さらに、海洋環境下での使用を想定し、耐水・耐塩害性を考慮したブイ構造設計と、エネルギーハーベスティング・蓄積用のインタフェース回路を開発した。試作デバイスの評価では、低周波振動条件下において安定した電力出力が得られることを確認するとともに、複数TENGにおける動作周波数および位相差が出力特性に与える影響を解析した。その結果、不規則かつ低周波な波浪・海流環境において、回転型TENGを用いることで分散型Internet of Things (IoT)機器への自己電源供給の実現可能性を示した。
AIシステム脆弱性の解明とその対策手法の開発

2025年 GUO, Chao

　概要を見る

近年、人工知能（AI）技術の急速な発展に伴い、エッジ環境において高性能かつ低コスト・低消費電力な推論を実現するため、深層学習モデルに最適化されたAIアクセラレータの重要性が急速に高まっている。モデルの複雑化および多様化に伴い、従来の手作業によるアクセラレータ設計に代わり、AIアクセラレータ自動生成プラットフォームが広く用いられるようになっている。しかしながら、これらのプラットフォーム自体が内在するセキュリティ脆弱性に関する体系的な研究は極めて限定的であった。そこで本研究では、AIアクセラレータ自動生成プラットフォームのセキュリティ脆弱性に着目し、その脅威モデル、攻撃手法、ならびに防御手法に関する研究を行った。具体的には、勾配情報を用いず、推論過程における出力情報のみを利用して層横断的に攻撃に脆弱なパラメータを探索することで、極めて低オーバーヘッドかつ高い秘匿性を有するハードウェアトロイ（HT）を自動的に探索・生成・挿入可能であることを明らかにした。提案手法を複数の代表的AIモデルに適用した結果、既存手法と比較して攻撃耐性の向上と性能劣化の抑制を両立できることを確認した。さらに、単一FPGAにとどまらず、マルチFPGA設計への攻撃拡張が可能であることも実証した。加えて、攻撃者視点にとどまらず、防御者の立場から悪意ある生成プラットフォームに対抗可能なモデルレベルの防御手法についても提案した。本研究成果は、今後の高信頼・高安全AIシステム構築に向けた基盤技術として有用である。
発電・センシング一体化システムの開発

2024年

　概要を見る

現在、IoTデバイスはバッテリー依存からの脱却が求められる中、超低消費電力動作が可能なセンサーやウェアラブルデバイスにおいて、エネルギーハーベスティング技術の適用が注目されている。特に、摩擦電気ナノジェネレータ（TENG）は、人体動作に伴う機械エネルギーを電力へ変換する技術として、テキスタイルや生体埋め込みデバイスへの応用が期待されている。一方、TENGの非線形かつ時間依存性を持つ内部容量、非対称出力、高電圧・微小電流特性は、エネルギー変換効率の最大化を妨げる要因であり、回路設計上の課題となっている。そのため、本研究は、TENGの課題を解決し、エネルギーハーベスティング、センシング、情報処理を統合する新たな解決策の提案を目的とする。本研究では、垂直接触分離型TENG（CS-TENG）および回転型TENG（RS-TENG）の試作に加え、高効率インターフェース回路を設計し、発電・センシング統合システムを開発した。実証実験では、1.5 Hz/3 Hz動作時に1MΩ負荷条件下で、従来の全波整流回路（FWR）と比較し、それぞれ264倍および168倍の出力電力の向上を達成した。さらに、センサー駆動と無線通信機能の実装により、環境発電・センシング一体化システム実現へのブレークスルーを実証した。本成果はIEEE Transactions on Power Electronicsに投稿中であり、超低消費電力デバイス向けの新たなエネルギー自立ソリューションとして産業応用が期待される。
摩擦帯電型ナノ発電機の試作および自立駆動可能なインターフェース回路の設計

2023年

　概要を見る

Triboelectric nanogenerators (TENGs) are emerging as a promising, cost-effective energy harvesting approach for Internet of Things (IoT) applications. However, their practical deployment faces challenges due to extremely high output voltages, ultra-low intrinsic capacitance, the necessity for non-self-powered interface circuits, and ultra-low transfer efficiency due to output voltage asymmetry.Addressing these issues, this research introduces a novel dual-output rectifier (DOR)-based interface circuit designed to efficiently convert TENG outputs into two different voltage levels, optimizing energy harvesting and switching generation. Our approach leverages energy from the TENG's transition from separation to contact in the negative half cycle to produce a step-down switching control signal. Concurrently, energy generated during the positive half cycle, from contact to separation, is temporarily stored at a high voltage level. This energy is later stepped down and directed to the load via a flyback converter, upon reaching a threshold that activates the control module, optimizing energy transfer efficiency.The effectiveness of our approach was demonstrated using self-manufactured vertical contact-separation TENGs (CS-TENGs) featuring a spring-assisted separation structure comprising two copper sheets and a polytetrafluoroethylene (PTFE) sheet, which occupies a 120mm × 90mm effective contact area. The PTFE layer is 0.1mm thick, allowing for a maximum displacement of 1.2mm in our experiments. The experimental results demonstrate significant improvements, achieving 2.75 and 2.34 times the maximum output power compared to a full-wave rectifier (FWR)-based design at 2 Hz and 3 Hz, respectively. Additionally, under the same frequency and load conditions (1MΩ at 2 and 3 Hz), the output gains are 152 and 160 times greater than the FWR's. Our approach brings about a significant advancement in integrating TENGs for low-frequency and low-load IoT devices, demonstrating its potential for wider practical application. The corresponding achievements have been accepted for publication in IEEE Transactions on Power Electronics.
摩擦発電のための高効率インターフェース回路の設計

2022年蘇怡瑞

　概要を見る

エネルギーハーベスティング技術の一つである摩擦帯電型ナノ発電機(Triboelectric Nanogenerator：TENG)は安価に作製でき、出力電力密度が高く、低周波数帯域での安定性が良い。一方、TENGの出力は高電圧、低電流で、内部抵抗は極めて高いため、そのままInternet of Things (IoT)デバイスに電力供給には適しない。これらの課題を解決するために、本研究は、TENG素子の試作を行い、高い電力変換効率を保ちながら、必要な最適負荷を低減できる自立駆動できるインターフェース回路を提案した。実機で評価した結果では、既存研究に比べて低い負荷抵抗にしても2倍以上の出力電力を得ることが達成した。本研究成果は今後、様々なIoTデバイスへの応用が大いに期待できる。
エッジコンピューティングに向け高いエネルギー効率をもつDNN回路設計技術の創出

2021年葉静浩

　概要を見る

Driven by the explosive growth ofavailable data and powerful computing resources, deep neural networks (DNNs)have achieved remarkable breakthrough recently. As DNN models become morediverse for various applications, how to obtain an optimal accelerator designfor specific NN models while maintaining high energy efficiency with limitedhardware resources becomes an emerging challenge. Unfortunately, few systematicapproaches have been proposed yet. To address this design challenge, amodel-defined energy efficient DNN accelerator design through design spaceexploration and architecture optimization is proposed. Firstly, two dual datareuse approaches are proposed to improve on-chip data utilization efficiency.Secondly, a layer-wise design space exploration framework is developed toprecisely determine the optimal tiling configuration and the corresponding datareuse strategy for target neural network models even with on-chip hardwareresource constraints, which can minimize the amount of data movement betweenoff-chip DRAM and on-chip GLB. Thirdly, an energy efficient accelerator designwith on-chip dual data reuse, centered ifmap/weight buffers, distributed psumbuffers, and optimal resource configuration techniques is presented for GLBaccess reduction and energy efficiency improvement. Compared with thestate-of-the-art accelerators, the proposed design can leverage the energyefficiency by up to 2.7X and 3.6X for AlexNet and VGG, respectively.
ウェアラブルデバイスに適用するエナジーハーベスティングインターフェース回路の開発

2020年

　概要を見る

近年では，スマートフォン需要の拡大や技術発達に伴う機器の小型化・コスト低下などから Internet of Things(IoT)が様々な分野で急速に普及している。IoTデバイスの電源問題を解決するために，エナジーハーベスティング（Energy　Harvesting：EH）技術が大きな注目を集めている。しかし，個々のエナジーハーベスタ（例えば：摩擦帯電型素⼦・圧電素⼦）から得られるエネルギーは⾮常に微弱であるため，高効率なEHインターフェース回路設計技術が必要である。そのため，本研究はウェアラブルデバイスに適⽤するEHインターフェース回路の開発を行った。結果，提案回路を用いて，人体の動作を用いたバッテリーフリー無線送信可能なウェアラブルデバイスの実現を達成した。
デジタル社会に向け長期的に高信頼かつ超低消費電力メモリの研究開発

2019年

　概要を見る

デジタル社会において、データの量が爆発的に増加しているため、メモリ回路の重要性はますます重要になってきている。しかし微細化によってトランジスタの性能ばらつきやソフトエラーの発生率が増大した事と、SRAM (Static Random Access Memory) と呼ばれるメモリの容量が増大し歩留まりが悪くなったことでメモリの消費電力は増大している。そのため、今後のデジタル社会の実現のために、長期的高信頼化かつ超低消費電力化メモリ設計技術の開発が急務である。本研究ではメモリ回路（特にSRAM回路）の長期的に高信頼化・低消費電力化を目的とした回路設計技術の研究開発を行った。特に、低消費電力化かつ長期的な安定性向上の設計技術を提案し、その有効性を評価した。
ビッグデータ処理に向けたApproximate Computingを実現するLSI設計技術の研究開発

2018年

　概要を見る

　近年、IoT（Internet of things）・ビッグデータ・人工知能への注目が高まっている。このような膨大的なデータ解析・処理において大きな問題となるのは、その計算量の多さ、実行時間の長さからくる消費電力の大きさである。一方、ビッグデータ分野では潜在的にエラー耐性を持ち、完全な精度の計算が必要とされない場面が多数ある。そこで、本研究は膨大的かつ潜在的にエラー耐性を持つビッグデータ処理に向けて、Approximate Computingを実現するLSI設計技術に関する研究を行った。特に、①エラー距離を考慮した概算加算回路の性能・精度指標の定式化、②ビット幅削減による低消費電力化FIR 回路、および③CNN に対する算術オーバーフローを考慮したビット幅削減手法などを提案した。
自然エネルギー利用に向けたスマートケースLSI設計技術の創生

2014年

　概要を見る

　本研究ではLSI（大規模集積回路）の設計技術に焦点を当て、不安定且つ微弱な自然エネルギーに適合し、状況に応じた最適な動作を実現するスマートケースLSI設計技術の研究開発を行った。特に、既存LSI設計技術の問題点を解決する革新的技術として「I: 極低エネルギーLSI設計技術」と「II：動作中自己調整機能を持つ設計技術」を提案した。本研究は、既存のワーストケースに基づいたLSI設計方法ではなく、回路が動作時自己調整により処理性能・消費電力・信頼性を最大限引き出すことが可能なシステムLSI設計基盤技術を開発した。
ディペンタブルな低電圧ＬＳＩ設計技術に関する研究

2011年

　概要を見る

　情報通信機器が高性能化するにしたがい、消費電力の増大が大きな問題になりつつある。LSI回路の低消費電力化には、LSI の電源電圧を下げることが最も効果的である。CMOS回路の動作電力は電圧の自乗に比例するので、電圧を1/3にすれば、単純には消費電力がほぼ1/10 になる。しかし、低電圧の条件下ではCMOS回路の動作が不安定になり、LSIの製造ばらつきやノイズなどに影響され、動作マージン減少、誤動作などの障害が、現状と比較して極めて増大する。つまり将来安心かつエコなアンビエント情報社会を実現するためには、情報通信・処理の主要素子であるCMOS トランジスタの動作電圧をしきい値電圧以下に低減できるLSI自動化設計技術と高信頼化設計技術の統合・融合したディペンタブルな低電圧LSI設計基盤技術が強く求められると考える。　本研究は、高い信頼性を持つディペンタブルな超低電圧LSI設計技術の開発を目的とする。研究の目標としては、既存研究（カスタム設計）と異なり、自動化設計により、設計複雑度や設計周期を減らし、並びに回路全体の信頼性を高めることを目指す。また、実チップ設計により、既存研究と比較してエネルギーを低減し、並びに低電圧領域における設計タイミングのばらつきを改善することを目標とする。　今年度では、主に以下の研究項目を行ってきた。（１）超低電圧LSI自動化設計技術について　具体的には、低電圧領域（サブスレッショルド領域）で動作する回路設計のため、①サブスレッショルド領域での遅延・電力のモデルの構築；②サブスレッショルド領域で動作させるため、既存のプロセスライブラリを用いて、トランジスタレベルでシミュレーションを行い、エネルギーが最小な電源電圧を選択できる合成手法の提案、及び③提案した最適エネルギー電圧選択手法をベースに上位レベル（RTLレベル）から低電圧による低エネルギー指向LSI自動合成フローの構築などの研究を取り込んだ。様々なアルゴリズムをコンピュータに実装し、評価実験を行った。既存のカスタム設計と異なり、合成時自動でエネルギー最小な電源電圧の選択ができ、Benchmark回路に適用し有効性を確認した。また、自動化設計により、設計複雑度や設計周期を減らすごとができた。（２）ディペンタブルなLSI設計技術について　　具体的には、①LSI回路動作時の遅延、温度変化および電源電圧変化の解析、及び②電圧変動により、ディレイ変動を検出・制御する技術の研究を行った。研究成果として、理論面から、80％以上の論理パス上発生した遅延エーラの検出ができた。
システムオンチップのテスト容易化設計に関する研究

2005年

　概要を見る

LSIの超大規模化・超微細化により、情報システム全体をワン・チップ上に実現することが可能になった。しかし、高集積化により故障をチェックするべき点が増え、各点の故障をテストするパターンの数は増加し、製造されたチップが正常に動作するか否かを調べるテストは益々困難になってきている。1チップあたりのテスト時間はテスト・パターンの数に比例するので、機能モジュールを複数集積したシステムオンチップ（SoC，System-on-a-Chip）では、集積したモジュールの数に比例した時間がかかり、テストの時間が非常に長くなる。その結果、SoCのテスト・コストが製造コストを超える勢いで増加しており、テストの品質も低下しているため、テストは半導体産業の発展を阻害する要因になりかねない。そのために、SoCに関する低コスト、高品質なテスト容易化設計方法の研究が重要となってきた。上記背景のもと，本研究ではテスト・データの圧縮技術やテスト時間削減の容易化設計手法に関する研究を行う。提案手法ではデザインに挿入され、少ないスキャン・チャネルから多数の内部スキャン・チェーンを供給するデコンプレッサで構成される。最先端のスキャンおよびテスト・データの圧縮技術と比較し、テスト・データの量とテスト時間を最大20 分の1までに削減できる。その研究成果を学会において発表した。また、多種の故障タイプのテストに対応し、故障解析方法の詳細の検討を行った．

▼全件表示